Find open-source science resources

Cross-domain directory aggregating tools, AI models, datasets, and research resources from bio.tools, Bioconductor, HuggingFace, curated GitHub awesome-lists, and more.

2,418 of 5,684 resources

Showing 1,1011,150

MetaCyc is a database of non-redundant, experimentally elucidated metabolic pathways and enzymes. It also contains reactions, chemical compounds, and genes. It stores predominantly qualitative information rather than quantitative data, although it does contain some quantitative data such as enzyme kinetics data. MetaCyc is [curated](http://www.biocyc.org/glossary.shtml?sid=biocyc14-3908554027#Curation) from the scientific experimental literature according to an [extensive process](https://metacyc.org/MetaCycUserGuide.shtml#TAG:__tex2page_sec_4)]. The majority of pathways occurring in it are from microorganisms and plants. MetaCyc stores thousands of additional enzyme-catalyzed reactions that have not yet been assigned an EC number

Medical Subject Headings vocabulary is the set of predicates used in the MeSH RDF dump

The MEROPS database is an information resource for peptidases (also termed proteases, proteinases and proteolytic enzymes) and the proteins that inhibit them.

Chemical reactions in the Merck Index. This website doesn't exist anymore

A semantic space of small molecules

The midlevel energy ontology (MENO) is a BFO-based midlevel ontology. It comprises the concepts for energy qualities, energy-based dispositions and energy-driven transformation and transfer processes and their interrelations. It has the goal to provide an upper level structure for these concepts for energy-related domain ontologies.

The mission of MediaDive is to transform poorly structured media recipes into a standardized database. The contents of the database include mined thousands of PDF and HTML documents. To ensure the quality of the media and continous improvement of the database, we developed an internal editor interface. Experts at the DSMZ are creating new media and curating the existing media using this interface. [adapted from https://mediadive.dsmz.de/about]

The mission of MediaDive is to transform poorly structured media recipes into a standardized database. The contents of the database include mined thousands of PDF and HTML documents. To ensure the quality of the media and continous improvement of the database, we developed an internal editor interface. Experts at the DSMZ are creating new media and curating the existing media using this interface. [adapted from https://mediadive.dsmz.de/about]

The mission of MediaDive is to transform poorly structured media recipes into a standardized database. The contents of the database include mined thousands of PDF and HTML documents. To ensure the quality of the media and continous improvement of the database, we developed an internal editor interface. Experts at the DSMZ are creating new media and curating the existing media using this interface. [adapted from https://mediadive.dsmz.de/about]

This vocabulary is used internally inside MedGen to assign temporary identifiers to terms that will later get put in UMLS. Mappings between MedGen, MedGen CIDs, and UMLS can be found [here](https://ftp.ncbi.nlm.nih.gov/pub/medgen/MedGenIDMappings.txt.gz).

The MDL number contains a unique identification number for each reaction and variation. The format is RXXXnnnnnnnn. R indicates a reaction, XXX indicates which database contains the reaction record. The numeric portion, nnnnnnnn, is an 8-digit number. [wikipedia]

A controlled vocabulary to support the study of transcription in the mouse brain

Some IDs may represent experiment sets, e.g. https://www.mavedb.org/#/experiment-sets/urn:mavedb:00000011 Others represent genomic regions (specifically deep mutational scans thereof) e.g. https://www.mavedb.org/#/experiment-sets/urn:mavedb:00000011-a

MatrixDB is a freely available database focused on interactions established by extracellular matrix proteins, proteoglycans and polysaccharides

MathModDB is a database of mathematical models developed by the Mathematical Research Data Initiative (MaRDI). MathModDB defines a data model with classes (Mathematical Model, Mathematical Formulation, Research Field, Research Problem, Quantity [Kind], Computational Task, Publication), object properties/relations, data properties and annotation properties as an ontology. This ontology is populated with individuals/data from various fields of applied mathematics, making it a knowledge graph. [from homepage]

This vocabulary and grammar defines which types of objects are admissible to the MathAlgoDB - the algorithm knowledge graph - and by which properties they can relate. All in all five classes, "problem", "algorithm", "benchmark", "software", "publication", are defined, as well as a minimal but intuitively intelligible number of properties. As opposed to the more liberal WikiData, MathAlgoDB relies on the strict adherence to the ontology to provide a reliable machine-readable database of (numerical) algorithm knowledge. [from homepage]

MarkerDB is a freely available electronic database that attempts to consolidate information on all known clinical and pre-clinical biomarkers into a single resource. It provides identifiers for known clinical and pre-clinical biomarkers. Each entry provides detailed annotations, including biomarker descriptions, associated conditions, specificity, sensitivity, molecular structures, chromosomal locations, and clinical approval status.

MARC List for Languages provides three-character lowercase alphabetic strings that serve as the identifiers of languages and language groups. The codes are usually based on the first three letters of the English form or, in some cases, vernacular form of the corresponding language name. The codes are varied where necessary to resolve conflicts and are not intended to be abbreviations of a language name. When the name of a language is changed in the list, the original code is generally retained. The codes in this list are equivalent to those of ISO 639-2 (Bibliographic) codes and some codes from ISO 639-5, although the language name labels may differ. They are linked to the equivalent codes in ISO 639-2 and ISO 639-5 and the corresponding two-character codes in ISO 639-1. The list contains over 480 discrete codes. It is also searchable at: MARC Code List for Languages.