Find open-source science resources

Cross-domain directory aggregating tools, AI models, datasets, and research resources from bio.tools, Bioconductor, HuggingFace, curated GitHub awesome-lists, and more.

19 of 5,674 resources

Stamps Seurat, SingleCellExperiment, and SummarizedExperiment objects with a persistent metadata passport. For Seurat objects the passport is stored in the misc slot; for SingleCellExperiment and SummarizedExperiment objects it is stored in the metadata slot. Tracks animal info, experiment details, lineage (parent/child relationships), RDS registry numbers, processing logs, and custom fields. Includes an interactive Shiny gadget to fill and update the passport, and a read mode to print the full passport to console. The passport persists inside the RDS file with no external files needed.

31 month ago
R
MIT

E(3)-equivariant neural network interatomic potentials achieving DFT accuracy with up to 1000× less training data than invariant models, foundational architecture behind MACE and Allegro (Harvard, MIT, Nature Communications 2022)

9144 days ago
Python
MIT

Computation Pipeline library for python widely used in science and bioinformatics.

1754 years ago
Python
MIT

CluMSID is a tool that aids the identification of features in untargeted LC-MS/MS analysis by the use of MS2 spectra similarity and unsupervised statistical methods. It offers functions for a complete and customisable workflow from raw data to visualisations and is interfaceable with the xmcs family of preprocessing packages.

104 years ago
R
MIT

Provides functionality for producing geometric representations of protein and RNA structures, and biological interaction networks.

1.2K2 days ago
Jupyter Notebook
MIT

A library for building, manipulating, analyzing and automatic design of molecules, including a genetic algorithm.

2844 months ago
Python
MIT

Interactive and hardware-agnostic SDK for laboratory automation, enabling programmatic control of liquid handlers, plate readers, and other lab instruments across multiple vendors; foundational infrastructure for self-driving laboratories and AI-driven experimental execution (447+ stars)

4503 days ago
Python
MIT

Automatic Filtering, Trimming, Error Removing and Quality Control for fastq data.

2146 years ago
Python
MIT

LLM papers for scientific discovery

3456 months ago
MIT

ChemFormula provides a class for working with chemical formulas. It allows parsing chemical formulas, calculating formula weights, and generating formatted output strings (e.g. in HTML, LaTeX, or Unicode).

336 months ago
Python
MIT

Equivariant graph attention Transformer (ICLR2023)

2821 year ago
Python
MIT

First agentic LLM for autonomous data science with end-to-end pipeline from data to analyst-grade reports

4.2K1 month ago
Python
MIT

A Python script that converts positional information from a SAM dataset into interval format with 0-based start and 1-based end. CIGAR string of SAM format is used to compute the end coordinate.

373 months ago
Python
MIT

Deep learning library for Chemistry based on Tensorflow

6.8K2 months ago
Python
MIT

Experiments with expanded ensembles to explore chemical space.

1996 months ago
Python
MIT

Co-create PowerPoint presentations with Generative AI from documents or topics

3582 weeks ago
Python
MIT

A Swiss Army knife for genome arithmetic.

1K1 year ago
C
MIT

OpenChem is a deep learning toolkit for Computational Chemistry with PyTorch backend.

7452 years ago
Python
MIT

lumpy: a general probabilistic framework for structural variant discovery.

3423 months ago
C
MIT