Introducing Certified DatasetsReviewed against basic checks, certified datasets are more visible on Polaris.

This dataset has not yet been certified by approved reviewers. It may contain issues related to data completeness and quality.

Dataset

tdcommons/cyp2c9-substrate-carbonmangels

CYP2C9 substrate.

Created on: July 22, 2024Dataset size: 27 KBNumber of datapoints: 669
Public

Tags

ADME

Modalities

MOLECULE

Details

README

Background

CYP P450 2C9 plays a major role in the oxidation of both xenobiotic and endogenous compounds. Substrates are drugs that are metabolized by the enzyme. TDC used a dataset from [1], which merged information on substrates and nonsubstrates from six publications.

Description of readout

Task Description: Binary Classification. Given a drug SMILES string, predict if it is a substrate to the enzyme.

Data resource

References: [1] Selecting relevant descriptors for classification by bayesian estimates: a comparison with decision trees and support vector machines approaches for disparate data sets.

[2] admetSAR: a comprehensive source and free tool for assessment of chemical ADMET properties.