Mutagenesis-Bonds Data Set

Description

The problem consists of predicting the mutagenicity of the molecules, that is, determining whether a molecule is mutagenic or non-mutagenic. The dataset for mutagenesis consists of 188 molecules, of which 125 are mutagenic (active) and 63 are non-mutagenic (inactive). From a MIL perspective different transformations are considered, concretely, mutagenesis-bonds representas all atom-bond tuples of a compound molecules as a bag.

Dataset

The original data set is partitioned using 10-fold cross-validation procedure five times. Thus, five different partitions of 10-fold cross validation are available

 

10-fold cross validation
Files
Procedure 1 mutagenesis_bonds-10-proc1.arff
Procedure 2 mutagenesis_bonds-10-proc2.arff
Procedure 3 mutagenesis_bonds-10-proc3.arff
Procedure 4 mutagenesis_bonds-10-proc4.arff
Procedure 5 mutagenesis_bonds-10-proc5.arff