"Don't reinvent the wheel"
This is a curated collection of papers about deep learning applications
in biology and disease, each providing code for learning or reproducibility purposes.
Feel free to visit this repository from time to time for inspiration. Good luck with your research!
- Guidelines or reviews for deep learning applications
- Genome
- Transcriptome
- Epigenome
- Proteome
- Metabolomics
- Microbiome
- Multiomics
- Radiomics
- Benchmarks
- Research group and people
- List of Abbreviations
- Contact
-
[2021 Nature Reviews Genetics] Navigating the pitfalls of applying machine learning in genomics[paper]
-
[2022 Nature Reviews Genetics] Obtaining genetics insights from deep learning via explainable artificial intelligence[paper]
-
[2021 Nature Reviews Molecular Cell Biology] A guide to machine learning for biologists[paper]
-
[pytorch-examples] Simple examples to introduce PyTorch[code]
- [2023 Nature Genetics | MLP/CNN/ResNet(one-dimensional) ]: Inference of chronic obstructive pulmonary disease with deep learning on raw spirograms identifies new genetic loci and improves risk models[paper][code]
-
[2021 Nature Biotechnology | CNN + GVAE + Gene ranking (GSEA) ]: Prediction of drug efficacy from transcriptional profiles with deep learning[paper][code]
-
[2023 Nature Biotechnology | GNN + Matrix Factorization ]: Supervised discovery of interpretable gene programs from single-cell data[paper][code]
-
[2022 Nature Communications | DAE + MLP classification + DaNN ]: Deep transfer learning of cancer drug responses by integrating bulk and single-cell RNA-seq data[paper][code]
- [2017 PNAS | CNN + LSTM ]: De novo peptide sequencing by deep learning[paper][code]
- [2022 Briefings in Bioinformatics | CNN + GAT ]: Learning spatial structures of proteins improves protein–protein interaction prediction[paper][code]
- [2022 Nature Medicine | MLP + ResNet ]: Metabolomic profiles predict individual multidisease outcomes[paper][code]
- [2023 Nature Methods | Stochastic Kinetic Model ]: Dictys: dynamic gene regulatory network dissects developmental continuum with single-cell multiomics[paper][code]
- [2024 Genome Biology | VAE + GAN + Contrastive Learning ]: TMO-Net: an explainable pretrained multi-omics model for multi-task learning in oncology[paper][code]
-
[2023 Nature Communications]: Systematic comparison of tools used for m6A mapping from nanopore direct RNA sequencing[paper][code]
-
[2023 Genome Biology]: Comprehensive benchmark and architectural analysis of deep learning models for nanopore sequencing basecalling[paper][code]
1.[Singapore] Jonathan Göke:We’re a computational genomics group searching for answers to these questions. We develop algorithms and machine learning methods and investigate genomics and transcriptomics data to better understand human diseases such as cancer. Our group specifically works on third generation long read RNA sequencing technology. We are located at the Genome Institute of Singapore. Read more about us, our research and publications, or available scholarships (PhD, internships) and positions. You can find more information about methods from the team on our github page [home]
- MLP: Multilayer Perceptron
- ResNet: Residual neural network
- CNN: Convolutional Neural Network
- LSTM: Long Short Term Memory networks
- AE: Autoencoder
- DAE: Denoising autoencoder
- VAE: Variational Autoencoder
- GVAE: Graph Variational Autoencoder
- GAN: Generative Adversarial Network
- DaNN: Domain-Adversarial Training of Neural Networks
- GCN: Graph Convolutional Networks
- GAT: Graph Attention Network
Email:mumdark 酩酊
Twitter: @酩酊