Classification

Based on the training cross-validation score, the test average precision score and the test Precision-Recall curve, we select the best classifier for each experiment. All these metrics are considered equally important in the process of selection of the best classifier for use in the sampling process.

Testing evaluations

Experiment 1- hu.MAP

Experiment 2 - DIP

Variant 1 - MIPS train, TAP-MS test

Variant 2 - TAP-MS train, MIPS test

Experiment 3 - CombYeast

Training Evaluations

current_result_tables

Final pipelines

Experiment 1: GradientBoostingClassifier
Experiment 2 - variant1: Logistic Regression
Experiment 2 - variant2: Bernoulli Naive Bayes
Experiment 3 : GradientBoostingClassifier

It is interesting to note that the superior GradientBoostingClassifier performs the best only when train and test datasets have similar biases, such as in Experiments 1 and 3 where the complexes are split into train and test while ensuring equal size distributions.

Note that in the 2nd experiment, the classifier is trained on one dataset and tested on a dataset which comes from another source, meaning that the datasets can have different inherent biases.

<- Back to methods

Prediction results ->

Google Sites

Report abuse