lucascamillomd/cpgpt-models
Model weights, configurations, and vocabularies for CpGPT: A Foundation Model for DNA Methylation.
README
license: mit tags: DNA-methylation epigenetics foundation-model aging biology CpGPT Model Checkpoints Model weights, configurations, and vocabularies for CpGPT: A Foundation Model for DNA Methylation. Contents Pre-trained Models | Model | Size | Parameters | Model Name | |-------|------|------------|------------| | CpGPT-2M | 30MB | ~2.5M | small | | CpGPT-100M | 1.1GB | ~101M | large | Download Dependencies You will also need the DNA embeddings for your species of interest: Human:…
Source attribution
- HuggingFace — lucascamillomd/cpgpt-models
Related resources
Deep learning library for Chemistry based on Tensorflow
Unified framework for state-of-the-art pre-trained bio foundation models across genomics and transcriptomics, providing standardized interfaces and pipelines for DNA, RNA, and single-cell models including Evo 2, Geneformer, scGPT, and UCE with streamlined inference, benchmarking, and fine-tuning workflows (213+ stars, 2024-2025)
MIRit is an R package that provides several methods for investigating the relationships between miRNAs and genes in different biological conditions. In particular, MIRit allows to explore the functions of dysregulated miRNAs, and makes it possible to identify miRNA-gene regulatory axes that control biological pathways, thus enabling the users to unveil the complexity of miRNA biology. MIRit is an all-in-one framework that aims to help researchers in all the central aspects of an integrative miRNA-mRNA analyses, from differential expression analysis to network characterization.
G-quadruplexes (G4s) are unique nucleic acid secondary structures predominantly found in guanine-rich regions and have been shown to be involved in various biological regulatory processes. G4SNVHunter is an R package designed to rapidly identify genomic sequences with G4-forming propensity and to accurately screen user-provided single nucleotide variants—as well as other small-scale variants such as indels and MNVs—for their potential to destabilize these structures. This allows researchers to then screen these critical variants for deeper study, digging into how they might influence biological functions—think gene regulation, for instance—by impairing G4 formation propensity.
scvi-tools/tabula-sapiens-heart-scanvi
by scvi-toolsScANVI is a variational inference model for single-cell RNA-seq data that can learn an underlying latent space, integrate technical batches and impute dropouts. In addition, to scVI, ScANVI is a semi-supervised model that can leverage labeled data to learn a cell-type classifier in the latent space…
scvi-tools/tabula-sapiens-blood-stereoscope
by scvi-toolsStereoscope is a variational inference model for single-cell RNA-seq data that can learn a cell-type specific rate of gene expression. The predictions of the model are meant to be afterward used for deconvolution of a second spatial transcriptomics dataset in Stereoscope.