ASAP Discovery x OpenADMET CompetitionTake part in the first prospective benchmark on Polaris.

Benchmark

molecularml/moleculeace-CHEMBL204-Ki
Created on: July 24, 2024Train size: 2,201Test size: 553
Public
Single Task
Regression

Participants

No participants yet.

Tags

ADMET
singletask

Related dataset

Leaderboard

Test set
Task
#
NameContributors
mean_squared_error
References
No results.

Details

README

Background

This dataset comprises collected and curated bioactivity data for the target Serine protease S1A subfamilyfrom ChEMBL assay CHEMBL204, utilized to evaluate the performance of various machine learning algorithms on activity cliffs. We employed classical machine learning methods combined with common molecular descriptors, as well as neural networks based on unstructured molecular data such as molecular graphs or SMILES strings.

Activity cliffs are molecules with small differences in structure but large differences in potency. Activity cliffs play an important role in drug discovery, but the bioactivity of activity cliff compounds are notoriously difficult to predict.

Description of readouts

  • exp_mean [nM]: Inhibition [Inhibitory Constant, Ki]
  • y: Negative of log transform of the bioactivity value.
  • split: Train-test split based on activity cliff.

Data resource: