Open Science Index

Find open-source science resources

Cross-domain directory aggregating tools, AI models, datasets, and research resources from bio.tools, Bioconductor, HuggingFace, curated GitHub awesome-lists, and more.

Filters

Domain

text-generation30
fill-mask17
text-classification9
feature-extraction8
image-text-to-text5
question-answering5
image-feature-extraction3
sentence-similarity3
tabular-classification3
token-classification3
image-segmentation2
other2
(None)73

Language

Python79
(None)93

License

(None)172

Source

huggingface172

Type(1)

Software tool3084
Database2418
AI model172

Filters

Domain

text-generation30
fill-mask17
text-classification9
feature-extraction8
image-text-to-text5
question-answering5
image-feature-extraction3
sentence-similarity3
tabular-classification3
token-classification3
image-segmentation2
other2
(None)73

Language

Python79
(None)93

License

(None)172

Source

huggingface172

Type(1)

Software tool3084
Database2418
AI model172

172 of 5,674 resources

Showing 101–150

LexBwmn/ACE-V1

by LexBwmn

object-detection

# ACE-V1.1: Brain Tumor Detection !Python!Format > [!CAUTION] > MEDICAL RESEARCH USE ONLY. ACE-V1.1 is NOT a cleared medical device. It must not be used for primary diagnosis or clinical decision-making. All outputs must be verified by a qualified professional.

↓03 weeks ago

SandboxAQ/AQAffinity

by SandboxAQ

↓03 months ago

Dr-BERT/DrBERT-7GB

by Dr-BERT

In recent years, pre-trained language models (PLMs) achieve the best performance on a wide range of natural language processing (NLP) tasks. While the first models were trained on general domain data, specialized ones have emerged to more effectively treat specific domains.

↓1.4K2 years ago

Dr-BERT/DrBERT-4GB

by Dr-BERT

In recent years, pre-trained language models (PLMs) achieve the best performance on a wide range of natural language processing (NLP) tasks. While the first models were trained on general domain data, specialized ones have emerged to more effectively treat specific domains.

↓3092 years ago

OpenMed/OpenMed-NER-ChemicalDetect-ElectraMed-33M

by OpenMed

token-classification

Specialized model for Chemical Entity Recognition - Identifies chemical compounds and substances in biomedical literature

↓529 months ago

ConvergeBio/virtual-cell-patient

by ConvergeBio

feature-extraction

A patient-level disease classification model trained on single-cell RNA-seq data. Given a matrix of gene expression profiles (one row per cell), the model produces a disease-category prediction for the patient.

↓692 weeks ago

zeroentropy/zerank-1-small-reranker

by zeroentropy

In search enginers, rerankers are crucial for improving the accuracy of your retrieval system.

↓12.7K2 months ago

EPFLiGHT/Apertus-70B-MeditronFO

by EPFLiGHT

text-generation

Apertus-70B-MeditronFO is a 70B-parameter medical specialist LLM, produced by supervised fine-tuning of Apertus-70B-Instruct on the Fully Open Meditron Corpus.

↓3976 days ago

jackxinning/Leanly_AI

by jackxinning

question-answering

↓6.2K3 weeks ago

smgjch/Meow-Omni-1

by smgjch

Meow-Omni 1 is the world’s first Multimodal Large Language Model (MLLM) specifically engineered for Computational Ethology. It natively co-embeds four distinct modalities—Text, Video, Audio, and Biological Time-Series—to decode the latent intentions of non-verbal species.

↓2521 week ago

microsoft/NatureLM-8x7B-Inst

by microsoft

# Model details ## Model description Nature Language Model (NatureLM) is a sequence-based science foundation model designed for scientific discovery. Pre-trained with data from multiple scientific domains, NatureLM offers a unified, versatile model that enables various applications including…

↓24011 months ago

microsoft/NatureLM-8x7B

by microsoft

# Model details ## Model description Nature Language Model (NatureLM) is a sequence-based science foundation model designed for scientific discovery. Pre-trained with data from multiple scientific domains, NatureLM offers a unified, versatile model that enables various applications including…

↓3311 months ago

birder-project/vit_reg4_so150m_p14_ls_dino-v2-bio

by birder-project

image-feature-extraction

vitreg4so150mp14ls_dino-v2-bio is a Bio-DINO image encoder for natural photographs of living organisms. It uses a SoViT-150M/14 Vision Transformer with 4 register tokens and 133.6M backbone parameters, trained with a DINOv2-style self-supervised objective on approximately 31 million curated images…

↓3.9K2 days ago

scvi-tools/tabula-sapiens-heart-stereoscope

by scvi-tools

Stereoscope is a variational inference model for single-cell RNA-seq data that can learn a cell-type specific rate of gene expression. The predictions of the model are meant to be afterward used for deconvolution of a second spatial transcriptomics dataset in Stereoscope.

↓02 months ago

scvi-tools/tabula-sapiens-heart-scanvi

by scvi-tools

ScANVI is a variational inference model for single-cell RNA-seq data that can learn an underlying latent space, integrate technical batches and impute dropouts. In addition, to scVI, ScANVI is a semi-supervised model that can leverage labeled data to learn a cell-type classifier in the latent space…

↓02 months ago

scvi-tools/tabula-sapiens-fat-stereoscope

by scvi-tools

Stereoscope is a variational inference model for single-cell RNA-seq data that can learn a cell-type specific rate of gene expression. The predictions of the model are meant to be afterward used for deconvolution of a second spatial transcriptomics dataset in Stereoscope.

↓02 months ago

ibm-research/biomed.sm.mv-te-84m-MoleculeNet-ligand_scaffold-TOXCAST-101

by ibm-research

# ibm/biomed.sm.mv-te-84m-MoleculeNet-ligand_scaffold-TOXCAST-101 biomed.sm.mv-te-84m is a multimodal biomedical foundation model for small molecules created using MMELON (Multi-view Molecular Embedding with Late Fusion), a flexible approach to aggregate multiple views (sequence, image, graph) of…

↓51 year ago

ibm-research/biomed.sm.mv-te-84m-MoleculeNet-ligand_scaffold-TOX21-101

by ibm-research

# ibm/biomed.sm.mv-te-84m-MoleculeNet-ligand_scaffold-TOX21-101 biomed.sm.mv-te-84m is a multimodal biomedical foundation model for small molecules created using MMELON (Multi-view Molecular Embedding with Late Fusion), a flexible approach to aggregate multiple views (sequence, image, graph) of…

↓131 year ago

Dr-BERT/DrBERT-4GB-CP-PubMedBERT

by Dr-BERT

In recent years, pre-trained language models (PLMs) achieve the best performance on a wide range of natural language processing (NLP) tasks. While the first models were trained on general domain data, specialized ones have emerged to more effectively treat specific domains.

↓3572 years ago

Dr-BERT/DrBERT-4GB-CP-CamemBERT

by Dr-BERT

In recent years, pre-trained language models (PLMs) achieve the best performance on a wide range of natural language processing (NLP) tasks. While the first models were trained on general domain data, specialized ones have emerged to more effectively treat specific domains.

↓02 years ago

Sevenlee/kkk

by Sevenlee

image-segmentation

↓03 years ago

UEG/interface

by UEG

text-classification

↓03 years ago

sagawa/ReactionT5v1-forward

by sagawa

This is a ReactionT5 pre-trained to predict the products of reactions.

↓631 year ago

UmbrellaInc/Prototype-Virus-1B

by UmbrellaInc

question-answering

!image/png

↓82 months ago

ByteDance-Seed/bamboo_mixer

by ByteDance-Seed

This repository contains the official model of the paper A Unified Predictive and Generative Solution for Liquid Electrolyte Formulation.

↓09 months ago

SaeedLab/SpeCollate

by SaeedLab

feature-extraction

Github | Cite

↓32 months ago

InstaDeepAI/instanovoplus-v1.1.0

by InstaDeepAI

text-generation

InstaNovoPlus is a diffusion-based model for de novo peptide sequencing from mass spectrometry data. This model leverages multinomial diffusion for accurate, database-free peptide identification for large-scale proteomics experiments.

↓47 months ago

InstaDeepAI/winnow-helaqc-model

by InstaDeepAI

Winnow recalibrates confidence scores and provides FDR control for de novo peptide sequencing (DNS) workflows. This repository contains the calibrator trained on HeLa Single Shot data as referenced in our paper: De novo peptide sequencing rescoring and FDR estimation with Winnow.

↓02 weeks ago

InstaDeepAI/winnow-general-model

by InstaDeepAI

Winnow recalibrates confidence scores and provides FDR control for de novo peptide sequencing (DNS) workflows. This repository hosts a pretrained, general-purpose calibrator that maps raw InstaNovo model confidences and complementary features (mass error, retention time, chimericity, beam features,…

↓02 weeks ago

andrewdalpino/ESM2-150M-Protein-Biological-Process

by andrewdalpino

text-classification

An Evolutionary-scale Model (ESM) for protein function prediction from amino acid sequences using the Gene Ontology (GO). Based on the ESM2 Transformer architecture, pre-trained on UniRef50, and fine-tuned on the AmiGO dataset, this model predicts the GO subgraph for a particular protein sequence -…

↓311 months ago

PurvaTijare/PPTStab

by PurvaTijare

tabular-regression

PPTStab: Prediction and Designing of thermostable proteins with a desired melting temperature

↓01 year ago

scvi-tools/tabula-sapiens-fat-scanvi

by scvi-tools

ScANVI is a variational inference model for single-cell RNA-seq data that can learn an underlying latent space, integrate technical batches and impute dropouts. In addition, to scVI, ScANVI is a semi-supervised model that can leverage labeled data to learn a cell-type classifier in the latent space…

↓02 months ago

scvi-tools/tabula-sapiens-fat-scvi

by scvi-tools

ScVI is a variational inference model for single-cell RNA-seq data that can learn an underlying latent space, integrate technical batches and impute dropouts. The learned low-dimensional latent representation of the data can be used for visualization and clustering.

↓02 months ago

scvi-tools/tabula-sapiens-eye-stereoscope

by scvi-tools

Stereoscope is a variational inference model for single-cell RNA-seq data that can learn a cell-type specific rate of gene expression. The predictions of the model are meant to be afterward used for deconvolution of a second spatial transcriptomics dataset in Stereoscope.

↓02 months ago

scvi-tools/tabula-sapiens-eye-condscvi

by scvi-tools

CondSCVI is a variational inference model for single-cell RNA-seq data that can learn an underlying latent space. The predictions of the model are meant to be afterward used for deconvolution of a second spatial transcriptomics dataset in DestVI.

↓02 months ago

scvi-tools/tabula-sapiens-eye-scanvi

by scvi-tools

ScANVI is a variational inference model for single-cell RNA-seq data that can learn an underlying latent space, integrate technical batches and impute dropouts. In addition, to scVI, ScANVI is a semi-supervised model that can leverage labeled data to learn a cell-type classifier in the latent space…

↓02 months ago

scvi-tools/tabula-sapiens-eye-scvi

by scvi-tools

ScVI is a variational inference model for single-cell RNA-seq data that can learn an underlying latent space, integrate technical batches and impute dropouts. The learned low-dimensional latent representation of the data can be used for visualization and clustering.

↓02 months ago

scvi-tools/tabula-sapiens-bone_marrow-stereoscope

by scvi-tools

Stereoscope is a variational inference model for single-cell RNA-seq data that can learn a cell-type specific rate of gene expression. The predictions of the model are meant to be afterward used for deconvolution of a second spatial transcriptomics dataset in Stereoscope.

↓02 months ago

scvi-tools/tabula-sapiens-bone_marrow-condscvi

by scvi-tools

CondSCVI is a variational inference model for single-cell RNA-seq data that can learn an underlying latent space. The predictions of the model are meant to be afterward used for deconvolution of a second spatial transcriptomics dataset in DestVI.

↓02 months ago

scvi-tools/tabula-sapiens-bone_marrow-scanvi

by scvi-tools

ScANVI is a variational inference model for single-cell RNA-seq data that can learn an underlying latent space, integrate technical batches and impute dropouts. In addition, to scVI, ScANVI is a semi-supervised model that can leverage labeled data to learn a cell-type classifier in the latent space…

↓02 months ago

scvi-tools/tabula-sapiens-bone_marrow-scvi

by scvi-tools

ScVI is a variational inference model for single-cell RNA-seq data that can learn an underlying latent space, integrate technical batches and impute dropouts. The learned low-dimensional latent representation of the data can be used for visualization and clustering.

↓02 months ago

scvi-tools/tabula-sapiens-blood-stereoscope

by scvi-tools

Stereoscope is a variational inference model for single-cell RNA-seq data that can learn a cell-type specific rate of gene expression. The predictions of the model are meant to be afterward used for deconvolution of a second spatial transcriptomics dataset in Stereoscope.

↓02 months ago

scvi-tools/tabula-sapiens-blood-condscvi

by scvi-tools

CondSCVI is a variational inference model for single-cell RNA-seq data that can learn an underlying latent space. The predictions of the model are meant to be afterward used for deconvolution of a second spatial transcriptomics dataset in DestVI.

↓02 months ago

scvi-tools/tabula-sapiens-blood-scanvi

by scvi-tools

ScANVI is a variational inference model for single-cell RNA-seq data that can learn an underlying latent space, integrate technical batches and impute dropouts. In addition, to scVI, ScANVI is a semi-supervised model that can leverage labeled data to learn a cell-type classifier in the latent space…

↓02 months ago

scvi-tools/tabula-sapiens-blood-scvi

by scvi-tools

ScVI is a variational inference model for single-cell RNA-seq data that can learn an underlying latent space, integrate technical batches and impute dropouts. The learned low-dimensional latent representation of the data can be used for visualization and clustering.

↓02 months ago

scvi-tools/tabula-sapiens-bladder-stereoscope

by scvi-tools

Stereoscope is a variational inference model for single-cell RNA-seq data that can learn a cell-type specific rate of gene expression. The predictions of the model are meant to be afterward used for deconvolution of a second spatial transcriptomics dataset in Stereoscope.

↓02 months ago

scvi-tools/tabula-sapiens-bladder-condscvi

by scvi-tools

CondSCVI is a variational inference model for single-cell RNA-seq data that can learn an underlying latent space. The predictions of the model are meant to be afterward used for deconvolution of a second spatial transcriptomics dataset in DestVI.

↓02 months ago

scvi-tools/tabula-sapiens-bladder-scanvi

by scvi-tools

ScANVI is a variational inference model for single-cell RNA-seq data that can learn an underlying latent space, integrate technical batches and impute dropouts. In addition, to scVI, ScANVI is a semi-supervised model that can leverage labeled data to learn a cell-type classifier in the latent space…

↓02 months ago

scvi-tools/tabula-sapiens-bladder-scvi

by scvi-tools

ScVI is a variational inference model for single-cell RNA-seq data that can learn an underlying latent space, integrate technical batches and impute dropouts. The learned low-dimensional latent representation of the data can be used for visualization and clustering.

↓02 months ago

songlab/tokenizer-dna-mlm

by songlab

↓02 years ago

1
2
3
4

Submit a resource bio.tools Awesome Bioinformatics