Different public data bases offer a big quantity of information about proteins and chemical compounds which interact with: structure, functionality, physicochemical properties, interactions, etc. In general, the information inside overlaps by 80%, so that, if only one is used, 20% of information is lost
This experiment unifies the information of many of them, and creates a matrix that can be used in virtual screening processes and prediction of interaction target-chemical compound.