{"id":368836,"date":"2023-12-05T07:49:33","date_gmt":"2023-12-05T12:49:33","guid":{"rendered":"https:\/\/platohealth.ai\/artidock-from-receptor-ai-next-generation-ai-docking-that-beats-diffdock-and-alphafold-latest\/"},"modified":"2023-12-05T07:49:56","modified_gmt":"2023-12-05T12:49:56","slug":"artidock-from-receptor-ai-next-generation-ai-docking-that-beats-diffdock-and-alphafold-latest","status":"publish","type":"post","link":"https:\/\/platohealth.ai\/artidock-from-receptor-ai-next-generation-ai-docking-that-beats-diffdock-and-alphafold-latest\/","title":{"rendered":"ArtiDock from Receptor.AI: Next-generation AI Docking That Beats DiffDock and AlphaFold-latest","gt_translate_keys":[{"key":"rendered","format":"text"}]},"content":{"rendered":"

Receptor.AI\u00a0has announced ArtiDock, the best-in-class model for \u201cAI docking,” which predicts the binding poses of small molecule ligands in protein binding pockets with unprecedented speed and accuracy.<\/p>\n

The company\u00a0performed a comprehensive comparison of ArtiDock with the best modern AI docking techniques and with the most widely used conventional docking programs, Vina and Gold.<\/p>\n

ArtiDock not only beats the previous best-performer,\u00a0DiffDock<\/a>\u00a0by a large margin but also performs on par with classical docking programs and rivals the recently announced next generation of\u00a0AlphaFold-latest<\/a>\u00a0that is capable of predicting protein-ligand complexes.<\/p>\n

How does it work?<\/h2>\n

The key to success is a proprietary technique of data augmentation combined with a fast and lightweight model architecture. ArtiDock is trained on a mixture of artificial and real complexes and is aware of a much larger set of combinations of intermolecular interactions than are present in the resolved structures. This gives it a dramatic boost in accuracy and predictive power.<\/p>\n

Instead of relying on only about 20.000+\u00a0resolved protein-ligand complexes, millions of artificial binding pockets, which closely follow the experimentally observed statistical properties of the real ones, are generated. The algorithmic approach to such generation is currently being published.<\/p>\n

Benchmark<\/h2>\n

The benchmark included the commonly used\u00a0Astex docking dataset<\/a>\u00a0and the novel\u00a0PoseBuster dataset<\/a>, which is dedicated to challenging the quality of AI docking algorithms. The usual RMSD metric was used, as well as an additional PoseBusters-Valid set of metrics. The latter also assesses steric clashes and the conformer quality of the ligand.<\/p>\n

The ArtiDock outperforms all its rivals (both AI and docking) with a large margin for the Astex dataset using either RMSD or PB-Valid metrics, as shown in\u00a0Figure 1.\u00a0<\/p>\n

\n

\"Comparative<\/p>

Figure 1.<\/figcaption><\/figure>\n

For the PoseBuster dataset, ArtiDock beats all other AI techniques, including DiffDock, and performs on par or slightly better than the docking techniques, as shown in\u00a0Figure 2.\u00a0\u00a0While its precision approaches the AlphaFold-latest model, ArtiDock stands out with its remarkable throughput, being 600 times faster. This is backed by a blend of data augmentation and a fast, lightweight model architecture. Trained on a mix of artificial and real complexes, it\u2019s aware of a broader range of PPI interactions and shallow pockets, making it efficient in docking even for binding interfaces lacking data on ligands and their binding modes.<\/p>\n

\n

\"Comparative<\/p>

Figure 2.<\/figcaption><\/figure>\n

Figure 3\u00a0shows the comparison of individual PoseBuster structural metrics for ArtiDock, DiffDock, and AlphaFold-latest, the newly announced AlphaFold version, which is capable of predicting protein-ligand complexes. It is clearly seen that ArtiDock significantly outperforms DiffDock for all the metrics, which are not predicted with 100% accuracy. An exception is the distance to inorganic cofactors, where the results of DiffDock are marginally better. ArtiDock even outperforms AlphaFold-latest for tetrahedral chirality.<\/p>\n

\n

\"Percentage<\/p>

Figure 3.<\/figcaption><\/figure>\n

It is worth emphasizing that although AlphaFold-latest provides the best overall prediction quality, this precision comes at the cost of extremely slow inference speed.<\/p>\n

Figure 4\u00a0shows the inference speeds of all the studied techniques. ArtiDock is somewhat slower than some of the “quick and dirty” AI techniques, but it is still two orders of magnitude faster than DiffDock, Vina, and Gold while being superior to them in quality. The AlphaFold-latest is expectedly the worst performed here. It is three orders of magnitude slower than ArtiDock, which makes it unusable for virtual screening\u2014the area where the technology of\u00a0Receptor.AI<\/a>\u00a0shines.<\/p>\n

\n

\"Approximate<\/p>

Figure 4.<\/figcaption><\/figure>\n

Conclusion<\/h2>\n

The ArtiDock model has a large potential for improvement despite already being impressive. The company is actively working on further increases in quality and inference speed.<\/p>\n

Thus, as of the present moment, the\u00a0Receptor.AI<\/a>\u00a0ligand pose prediction model appears to be the most capable\u00a0on the market in terms of the balance between prediction quality and high throughput capabilities.<\/p>\n

Receptor.AI was featured and profiled in The State of AI in the Biopharma Industry<\/a>, a landmark technology-scouting solution for the life sciences industry.<\/p>\n