Explainable Machine Learning (XML) approaches are crucial for medical information processing tasks, particularly for multi-omics data analytics. The XML system not only provides better performance but also explains the inside of the finding better. Here, we proposed an end-to-end explainable system for analyzing high dimensional RNA-seq data using an unsupervised gene selection approach and supervised methods, including Deep Neural Network (DNN), Deep Convolutional Neural Network (DCNN), Support Vector Machine (SVM), and Random Forest (RF). The proposed approaches evaluate with publicly available datasets for five different cancers and Kawasaki disease (KD) classification. The deep learning-based approaches yield the 99.62% and 99. 25% average testing accuracy for cancer and KD classification tasks. Additionally, we introduce an explainable system that demonstrates the ability to select cancer and disease-specific gene sets, which could be used for further analysis to discover the biological inside of the cancers and KD diseases.
|