Mutagenesis-Bonds Data Set
Description
The problem consists of predicting the mutagenicity of the molecules, that is, determining whether a molecule is mutagenic or non-mutagenic. The dataset for mutagenesis consists of 188 molecules, of which 125 are mutagenic (active) and 63 are non-mutagenic (inactive). From a MIL perspective different transformations are considered, concretely, mutagenesis-bonds representas all atom-bond tuples of a compound molecules as a bag.
Dataset
The original data set is partitioned using 10-fold cross-validation procedure five times. Thus, five different partitions of 10-fold cross validation are available
10-fold cross validation |
Files |
Procedure 1 | mutagenesis_bonds-10-proc1.arff |
Procedure 2 | mutagenesis_bonds-10-proc2.arff |
Procedure 3 | mutagenesis_bonds-10-proc3.arff |
Procedure 4 | mutagenesis_bonds-10-proc4.arff |
Procedure 5 | mutagenesis_bonds-10-proc5.arff |