Association of Computing Machinery Open Access Agreement Publications-Archive

Robust Self-training Strategy for Various Molecular Biology Prediction Tasks∗

Document Type

Article

Abstract

Molecular biology prediction tasks suffer the limited labeled data problem since it normally demands a series of professional experiments to label the target molecule. Self-training is one of the semi-supervised learning paradigms that utilizes both labeled and unlabeled data. It trains a teacher model on labeled data, and uses it to generate pseudo labels for unlabeled data. The labeled and pseudo-labeled data are then combined to train a student model. However, the pseudo labels generated from the teacher model are not sufficiently accurate. Thus, we propose a robust self-training strategy by exploring robust loss function to handle such noisy labels, which is model and task agnostic, and can be easily embedded with any prediction tasks. We have conducted molecular biology prediction tasks to gradually evaluate the performance of proposed robust self-training strategy. The results demonstrate that the proposed method consistently boosts the prediction performance, especially for molecular regression tasks, which have gained a 41.5% average improvement.

Publication Date

8-10-2022

Language

English

License

This work is licensed under a Creative Commons Attribution 4.0 International License.

Recommended Citation

Ma, Hehuan; Jiang, Feng; Rong, Yu; Guo, Yuzhi; and Huang, Junzhou, "Robust Self-training Strategy for Various Molecular Biology Prediction Tasks∗" (2022). Association of Computing Machinery Open Access Agreement Publications-Archive. 14.
https://mavmatrix.uta.edu/utalibraries_acmoapubs/14

Download

COinS

Association of Computing Machinery Open Access Agreement Publications-Archive

Robust Self-training Strategy for Various Molecular Biology Prediction Tasks∗

Document Type

Abstract

Publication Date

Language

License

Recommended Citation

Search

Browse

Author & Creator Corner

Association of Computing Machinery Open Access Agreement Publications-Archive

Robust Self-training Strategy for Various Molecular Biology Prediction Tasks∗

Authors

Document Type

Abstract

Publication Date

Language

License

Recommended Citation

Share

Search

Browse

Author & Creator Corner