Find open-source science resources

!image/png

2.1K1 year ago

mradermacher/zerank-2-GGUF

by mradermacher

For a convenient overview and download list, visit our model page for this model.

4411 week ago

blaze999/Medical-NER

by blaze999

token-classification

This model is a fine-tuned version of DeBERTa on the PubMED Dataset.

48.7K2 years ago

microsoft/BiomedCLIP-PubMedBERT_256-vit_base_patch16_224

by microsoft

zero-shot-image-classification

BiomedCLIP is a biomedical vision-language foundation model that is pretrained on PMC-15M, a dataset of 15 million figure-caption pairs extracted from biomedical research articles in PubMed Central, using contrastive learning.

975.3K1 year ago

seyonec/ChemBERTa-zinc-base-v1

by seyonec

Deep learning for chemistry and materials science remains a novel field with lots of potiential. However, the popularity of transfer learning based methods in areas such as NLP and computer vision have not yet been effectively developed in computational chemistry + machine learning.

254.2K5 years ago

zhihan1996/DNA_bert_4

by zhihan1996

46710 months ago

andrewdalpino/ESM2-35M-Protein-Cellular-Component

by andrewdalpino

An Evolutionary-scale Model (ESM) for protein function prediction from amino acid sequences using the Gene Ontology (GO). Based on the ESM2 Transformer architecture, pre-trained on UniRef50, and fine-tuned on the AmiGO dataset, this model predicts the GO subgraph for a particular protein sequence -…

1511 months ago

google/medgemma-4b-it

by google

image-text-to-text

515.9K6 months ago

gbyuvd/drugtargetpred-chemselfies

by gbyuvd

This model is a BERT-like sequence classifier for 221 human protein drug targets, fine-tuned from gbyuvd/chemselfies-base-bertmlm on a dataset derived ChemBL34 (Zdrazil et al. 2023). It predicts potential drug targets using chemical structures represented as SELFIES (Self-Referencing Embedded…

91 year ago

ibm-research/biomed.omics.bl.sm.ma-ted-458m.tcr_epitope_bind

by ibm-research

T-cell receptor (TCR) binding to immunogenic peptides (epitopes) presented by major histocompatibility complex (MHC) molecules is a critical mechanism in the adaptive immune system, essential for antigen recognition and triggering immune responses.

441 year ago

zeroentropy/zerank-2-reranker

by zeroentropy

text-ranking

In search engines, rerankers are crucial for improving the accuracy of your retrieval system.

150.1K2 weeks ago

prov-gigapath/prov-gigapath

by prov-gigapath

image-feature-extraction

58.6K1 year ago

zeroentropy/zembed-1-embedding

by zeroentropy

feature-extraction

In retrieval systems, embedding models determine the quality of your search.

356.2K2 months ago

ibm-research/biomed.sm.mv-te-84m-MoleculeNet-ligand_scaffold-MUV-101

by ibm-research

# ibm/biomed.sm.mv-te-84m-MoleculeNet-ligand_scaffold-MUV-101 biomed.sm.mv-te-84m is a multimodal biomedical foundation model for small molecules created using MMELON (Multi-view Molecular Embedding with Late Fusion), a flexible approach to aggregate multiple views (sequence, image, graph) of…

141 year ago

google/medgemma-1.5-4b-it

by google

image-text-to-text

205.5K1 month ago

ibm-research/biomed.omics.bl.sm.ma-ted-458m.moleculenet_clintox_fda

by ibm-research

Drugs must satisfy stringent criteria for both efficacy and safety. This model predicts the likelihood of FDA approval for small-molecule drugs, represented using SMILES (Simplified Molecular Input Line Entry System) strings.

261 year ago

ibm-research/biomed.omics.bl.sm.ma-ted-458m.protein_solubility

by ibm-research

Protein solubility is a critical factor in both pharmaceutical research and production processes, as it can significantly impact the quality and function of a protein. This is an example for finetuning ibm/biomed.omics.bl.sm-ted-458m for protein solubility prediction (binary classification) based…

641 year ago

DISCO-Design/DISCO

by DISCO-Design

other

DISCO (DIffusion for Sequence-structure CO-design) is a multimodal generative model that simultaneously co-designs protein sequences and 3D structures, conditioned on and co-folded with arbitrary biomolecules — including small-molecule ligands, DNA, and RNA.

1714 hours ago

PocketDoc/Dans-PersonalityEngine-V1.2.0-24b

by PocketDoc

Dans-PersonalityEngine-V1.2.0-24b ⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⢀⠀⠄⠀⡂⠀⠁⡄⢀⠁⢀⣈⡄⠌⠐⠠⠤⠄⡀⠀⠀⠀⠀⠀⠀⠀⠀⠀ ⠀⠀⠀⠀⠀⠀⠀⠀⡄⠆⠀⢠⠀⠛⣸⣄⣶⣾⡷⡾⠘⠃⢀⠀⣴⠀⡄⠰⢆⣠⠘⠰⠀⡀⠀⠀⠀⠀⠀ ⠀⠀⠀⠀⠀⠀⠀⠀⠀⠃⠀⡋⢀⣤⡿⠟⠋⠁⠀⡠⠤⢇⠋⠀⠈⠃⢀⠀⠈⡡⠤⠀⠀⠁⢄⠀⠀⠀⠀ ⠀⠀⠀⠀⠀⠁⡂⠀⠀⣀⣔⣧⠟⠋⠀⢀⡄⠀⠪⣀⡂⢁⠛⢆⠀⠀⠀⢎⢀⠄⢡⠢⠛⠠⡀⠀⠄⠀⠀ ⠀⠀⡀⠡⢑⠌⠈⣧⣮⢾⢏⠁⠀⠀⡀⠠⠦⠈⠀⠞⠑⠁⠀⠀⢧⡄⠈⡜⠷⠒⢸⡇⠐⠇⠿⠈⣖⠂⠀ ⠀⢌⠀⠤⠀⢠⣞⣾⡗⠁⠀⠈⠁⢨⡼⠀⠀⠀⢀⠀⣀⡤⣄⠄⠈⢻⡇⠀⠐⣠⠜⠑⠁⠀⣀⡔⡿⠨⡄…

361 year ago

prov-gigatime/GigaTIME

by prov-gigatime

image-to-image

2705 months ago

nvidia/geneformer_V2_316M

by nvidia

## Description: Geneformer is a foundational transformer model pretrained on a large-scale corpus of single-cell transcriptomes to enable context-specific predictions in settings with limited data in network biology.

285 months ago

biohub/esm3-sm-open-v1

by biohub

3.8K1 year ago

ibm-research/biomed.omics.bl.sm.ma-ted-458m.moleculenet_clintox_tox

by ibm-research

Drugs must satisfy stringent criteria for both efficacy and safety. This model predicts the likelihood of failure in clinical toxicity trials for small-molecule drugs, represented using SMILES (Simplified Molecular Input Line Entry System) strings.

241 year ago

ibm-research/biomed.sm.mv-te-84m-MoleculeNet-ligand_scaffold-BACE-101

by ibm-research

# ibm/biomed.sm.mv-te-84m-MoleculeNet-ligand_scaffold-BACE-101 biomed.sm.mv-te-84m is a multimodal biomedical foundation model for small molecules created using MMELON (Multi-view Molecular Embedding with Late Fusion), a flexible approach to aggregate multiple views (sequence, image, graph) of…

1.6K3 months ago

gbyuvd/synthaccess-chemselfies

by gbyuvd

ChemFIE-SA is a BERT-like sequence classifier for predicting synthesis accessibility given a SELFIES string of a compound, fine-tuned from gbyuvd/chemselfies-base-bertmlm on DeepSA's expanded dataset from Wang et al. 2023.

91 year ago

Prior-Labs/tabpfn_3

by Prior-Labs

tabular-classification

### Model Overview TabPFN-3 is a transformer-based foundation model that uses in-context-learning to solve tabular prediction problems in a forward pass. Inference code can be found at https://github.com/PriorLabs/TabPFN. More details can be found in the Model Report.

8K1 week ago

zhihan1996/DNA_bert_5

by zhihan1996

48410 months ago

ibm-research/biomed.sm.mv-te-84m

by ibm-research

# ibm-research/biomed.sm.mv-te-84m biomed.sm.mv-te-84m is a multimodal biomedical foundation model for small molecules created using MMELON (Multi-view Molecular Embedding with Late Fusion), a flexible approach to aggregate multiple views (sequence, image, graph) of molecules in a foundation model…

12.1K3 months ago

Prior-Labs/tabpfn_2_6

by Prior-Labs

tabular-classification

### Model Overview TabPFN-2.6 is a transformer-based foundation model that uses in-context-learning to solve tabular prediction problems in a forward pass. Inference code can be found at https://github.com/PriorLabs/tabPFN.

11.3K1 month ago

PocketDoc/Dans-PersonalityEngine-V1.3.0-24b

by PocketDoc

Dans-PersonalityEngine-V1.3.0-24b Dans-PersonalityEngine-V1.3.0-24b ⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⠀⢀⠀⠄⠀⡂⠀⠁⡄⢀⠁⢀⣈⡄⠌⠐⠠⠤⠄⡀⠀⠀⠀⠀⠀⠀⠀⠀⠀ ⠀⠀⠀⠀⠀⠀⠀⠀⡄⠆⠀⢠⠀⠛⣸⣄⣶⣾⡷⡾⠘⠃⢀⠀⣴⠀⡄⠰⢆⣠⠘⠰⠀⡀⠀⠀⠀⠀⠀ ⠀⠀⠀⠀⠀⠀⠀⠀⠀⠃⠀⡋⢀⣤⡿⠟⠋⠁⠀⡠⠤⢇⠋⠀⠈⠃⢀⠀⠈⡡⠤⠀⠀⠁⢄⠀⠀⠀⠀ ⠀⠀⠀⠀⠀⠁⡂⠀⠀⣀⣔⣧⠟⠋⠀⢀⡄⠀⠪⣀⡂⢁⠛⢆⠀⠀⠀⢎⢀⠄⢡⠢⠛⠠⡀⠀⠄⠀⠀ ⠀⠀⡀⠡⢑⠌⠈⣧⣮⢾⢏⠁⠀⠀⡀⠠⠦⠈⠀⠞⠑⠁⠀⠀⢧⡄⠈⡜⠷⠒⢸⡇⠐⠇⠿⠈⣖⠂⠀…

1201 year ago

osmapi/Nidum-Gemma-2B-Uncensored-GGUF

by osmapi

Welcome to the repository for Nidum-Limitless-Gemma-2B-GGUF, an advanced language model that provides unrestricted and versatile responses across a wide range of topics. This version is designed for maximum flexibility, allowing you to run it on both CPU and GPU.

1.5K1 year ago

ibm-research/biomed.sm.mv-te-84m-MoleculeNet-ligand_scaffold-ESOL-101

by ibm-research

# ibm/biomed.sm.mv-te-84m-MoleculeNet-ligand_scaffold-ESOL-101 biomed.sm.mv-te-84m is a multimodal biomedical foundation model for small molecules created using MMELON (Multi-view Molecular Embedding with Late Fusion), a flexible approach to aggregate multiple views (sequence, image, graph) of…

221 year ago

ibm-research/biomed.sm.mv-te-84m-MoleculeNet-ligand_scaffold-HIV-101

by ibm-research

# ibm/biomed.sm.mv-te-84m-MoleculeNet-ligand_scaffold-HIV-101 biomed.sm.mv-te-84m is a multimodal biomedical foundation model for small molecules created using MMELON (Multi-view Molecular Embedding with Late Fusion), a flexible approach to aggregate multiple views (sequence, image, graph) of…

151 year ago

zhihan1996/DNA_bert_6

by zhihan1996

26.9K10 months ago

InstaDeepAI/instanovo-v1.0.0

by InstaDeepAI

# InstaNovo: De novo Peptide Sequencing Model ## Model Description

362 weeks ago

thelamapi/next-ocr

by thelamapi

image-text-to-text

![Language: Multilingual]()

3.3K2 months ago

empirischtech/DeepSeek-R1-Distill-Qwen-32B-gptq-4bit

by empirischtech

A domain-optimized reasoning model built on DeepSeek-R1-Distill-Qwen-32B, refined through a multi-stage pipeline of GPTQ quantization-aware training and QLoRA fine-tuning. Achieves 84% on MedQA — within 4 points of GPT-4o — in a ~20GB package that fits on a single L40/L40s GPU.

8191 month ago

ibm-research/biomed.sm.mv-te-84m-MoleculeNet-ligand_scaffold-CLINTOX-101

by ibm-research

# ibm/biomed.sm.mv-te-84m-MoleculeNet-ligand_scaffold-CLINTOX-101 biomed.sm.mv-te-84m is a multimodal biomedical foundation model for small molecules created using MMELON (Multi-view Molecular Embedding with Late Fusion), a flexible approach to aggregate multiple views (sequence, image, graph) of…

161 year ago

zhihan1996/DNA_bert_3

by zhihan1996

1.9K10 months ago

FreedomIntelligence/HuatuoGPT-o1-7B

by FreedomIntelligence

HuatuoGPT-o1-7B

5031 year ago

ibm-research/biomed.omics.bl.sm.ma-ted-458m.dti_bindingdb_pkd

by ibm-research

Accurate prediction of drug-target binding affinity is essential in the early stages of drug discovery. This is an example of finetuning ibm/biomed.omics.bl.sm-ted-400 the task. Prediction of binding affinities using pKd, the negative logarithm of the dissociation constant, which reflects the…

1721 year ago

rajveer43/gemma-4-E4B-medical-legal-finance-qa

by rajveer43

Fine-tuned version of google/gemma-4-E4B-it across three professional domains — Medical, Legal, and Finance — using QLoRA (4-bit NF4) with Optuna-tuned hyperparameters, trained on Kaggle T4 GPU.

1K1 month ago

ncfrey/ChemGPT-19M

by ncfrey

# ChemGPT 19M ChemGPT is based on the GPT-Neo model and was introduced in the paper Neural Scaling of Deep Chemical Models.

6.8K3 years ago

ibm-research/biomed.sm.mv-te-84m-MoleculeNet-ligand_scaffold-QM7-101

by ibm-research

# ibm/biomed.sm.mv-te-84m-MoleculeNet-ligand_scaffold-QM7-101 biomed.sm.mv-te-84m is a multimodal biomedical foundation model for small molecules created using MMELON (Multi-view Molecular Embedding with Late Fusion), a flexible approach to aggregate multiple views (sequence, image, graph) of…

191 year ago

cambridgeltl/SapBERT-from-PubMedBERT-fulltext

by cambridgeltl

feature-extraction

datasets: - UMLS

1.8M2 years ago

andrewdalpino/ESM2-150M-Protein-Molecular-Function

by andrewdalpino

1511 months ago

nvidia/geneformer_V1_10M

by nvidia

155 months ago

google/medasr

by google

automatic-speech-recognition

12.3K3 weeks ago

ctheodoris/Geneformer

by ctheodoris

# Geneformer Geneformer is a foundational transformer model pretrained on a large-scale corpus of human single cell transcriptomes to enable context-aware predictions in settings with limited data in network biology.

20.2K1 month ago

littleworth/protgpt2-distilled-small

by littleworth

A compact protein language model distilled from ProtGPT2 using complementary-regularizer distillation---a method that combines uncertainty-aware position weighting with calibration-aware label smoothing to achieve 54% better perplexity than standard knowledge distillation at 9.4x compression.

52 months ago