Guidelines for Method ComparisonRead the first pre-print from the Small Molecule Steering Committee

This dataset has not yet been certified by approved reviewers. It may contain issues related to data completeness and quality.

Dataset

polaris/posebusters-v1

A diverse datasset of recent high-quality protein–ligand complexes which contain drug-like molecules for AI-based docking method evaluation.

Created on: September 06, 2024Dataset size: 70 KBNumber of datapoints: 428
Public

Tags

docking

Modalities

MOLECULE_3D
PROTEIN_3D
MOLECULE

Details

README

logo

Background

The PoseBusters dataset set is a new set of carefully-selected publicly-available crystal complexes from the PDB. It is a diverse set of recent high-quality protein–ligand complexes which contain drug-like molecules. It only contains complexes released since 2021 and therefore does not contain any complexes present in the PDBbind General Set v2020 used to train many of the methods.

Buttenschoen et al. lists the steps used to select the 308 unique proteins and 308 unique ligands in the PoseBusters dataset set. The complexes were downloaded from the PDB as MMTF files and PyMOL was used to remove solvents and all occurrences of the ligand of interest before saving the proteins with the cofactors in PDB files and the ligands in SDF files.

Data source

Other links

User Attributes

These are custom, user-defined attributes that are not required by the Polaris data model.

AttributeValue
year2024