Find open-source science resources

Cross-domain directory aggregating tools, AI models, datasets, and research resources from bio.tools, Bioconductor, HuggingFace, curated GitHub awesome-lists, and more.

16 of 5,674 resources

E(3)-equivariant neural network interatomic potentials achieving DFT accuracy with up to 1000× less training data than invariant models, foundational architecture behind MACE and Allegro (Harvard, MIT, Nature Communications 2022)

9144 days ago
Python
MIT

Computation Pipeline library for python widely used in science and bioinformatics.

1754 years ago
Python
MIT

A library for building, manipulating, analyzing and automatic design of molecules, including a genetic algorithm.

2844 months ago
Python
MIT

Interactive and hardware-agnostic SDK for laboratory automation, enabling programmatic control of liquid handlers, plate readers, and other lab instruments across multiple vendors; foundational infrastructure for self-driving laboratories and AI-driven experimental execution (447+ stars)

4502 days ago
Python
MIT

Automatic Filtering, Trimming, Error Removing and Quality Control for fastq data.

2146 years ago
Python
MIT

ChemFormula provides a class for working with chemical formulas. It allows parsing chemical formulas, calculating formula weights, and generating formatted output strings (e.g. in HTML, LaTeX, or Unicode).

336 months ago
Python
MIT

Equivariant graph attention Transformer (ICLR2023)

2821 year ago
Python
MIT

First agentic LLM for autonomous data science with end-to-end pipeline from data to analyst-grade reports

4.2K1 month ago
Python
MIT

A Python script that converts positional information from a SAM dataset into interval format with 0-based start and 1-based end. CIGAR string of SAM format is used to compute the end coordinate.

373 months ago
Python
MIT

Deep learning library for Chemistry based on Tensorflow

6.8K2 months ago
Python
MIT

Experiments with expanded ensembles to explore chemical space.

1996 months ago
Python
MIT

Co-create PowerPoint presentations with Generative AI from documents or topics

3582 weeks ago
Python
MIT

OpenChem is a deep learning toolkit for Computational Chemistry with PyTorch backend.

7452 years ago
Python
MIT

Standalone browser-based Gene Ontology network viewer for exploring, filtering, searching, and exporting GO term and gene annotation neighborhoods from locally preprocessed GO OBO and GAF data.

NanoSV is a software package that can be used to identify structural genomic variations in long-read sequencing data, such as data produced by Oxford Nanopore Technologies’ MinION, GridION or PromethION instruments, or Pacific Biosciences RSII or Sequel sequencers.

Tool to generate a count matrix for expression data in Galaxy. generate_count_matrix reads in one or more input text files with expression counts and produces a single combined file. Each input will have a column in the matrix containing expression values. The column containing gene (or feature) names should be identical for all input count files.