Background
QM9 is a well-known dataset in the field of 3D GNNs. It consists of 19 graph-level quantum properties associated to an energy-minimized 3D conformation of the molecules. It is considered a simple dataset since all the molecules have at most 9 heavy atoms. We chose QM9 in our ToyMix since it is very similar to the larger proposed quantum datasets, PCQM4M_multitask and PM6_83M, but with smaller molecules.
Assay information
Computed geometric, energetic, electronic, and thermodynamic properties for 134k stable small organic molecules made up of CHONF. All properties were calculated at the B3LYP/6-31G(2df,p) level of quantum chemistry. For the predominant stoichiometry, C7H10O2, there are 6,095 constitutional isomers among the 134k molecules, with reported energies, enthalpies, and free energies of atomization at the more accurate G4MP2 level of theory.
Benchmarking
The goal of this benchmark is to have the best predictive model for atomic coordinates and calculated properties.
Data resource
Reference: Quantum chemistry structures and properties of 134 kilo molecules