Guidelines for Method ComparisonRead the first pre-print from the Small Molecule Steering Committee

This dataset has not yet been certified by approved reviewers. It may contain issues related to data completeness and quality.

Dataset

tdcommons/cyp2c9-substrate-carbonmangels

CYP2C9 substrate.

Created on: July 22, 2024Dataset size: 27 KBNumber of datapoints: 669
Public

Tags

ADME

Modalities

MOLECULE

Details

README

Background

CYP P450 2C9 plays a major role in the oxidation of both xenobiotic and endogenous compounds. Substrates are drugs that are metabolized by the enzyme. TDC used a dataset from [1], which merged information on substrates and nonsubstrates from six publications.

Description of readout

Task Description: Binary Classification. Given a drug SMILES string, predict if it is a substrate to the enzyme.

Data resource

References: [1] Selecting relevant descriptors for classification by bayesian estimates: a comparison with decision trees and support vector machines approaches for disparate data sets.

[2] admetSAR: a comprehensive source and free tool for assessment of chemical ADMET properties.