Open Science Index

Find open-source science resources

Cross-domain directory aggregating tools, AI models, datasets, and research resources from bio.tools, Bioconductor, HuggingFace, curated GitHub awesome-lists, and more.

Filters

Domain(1)

Software90
Infrastructure75
ImmunoOncology63
Annotation25
Genetics15
Visualization14
Microarray13
DataImport12
DNAMethylation12
GeneExpression10
Sequencing9
BiologicalQuestion8
(None)2

Language(1)

R15

License(1)

Artistic-2.015
GPL-311
GPL-29
GPL (>= 2)4
MIT + file LICENSE3
Apache License (>= 2)1
CC01
GPL1
GPL (>= 3)1
GPL-3 + file LICENSE1
LGPL1
LGPL (>= 2.1)1

Source

bioconductor15
github1

Type

Software tool15

Filters

Domain(1)

Software90
Infrastructure75
ImmunoOncology63
Annotation25
Genetics15
Visualization14
Microarray13
DataImport12
DNAMethylation12
GeneExpression10
Sequencing9
BiologicalQuestion8
(None)2

Language(1)

R15

License(1)

Artistic-2.015
GPL-311
GPL-29
GPL (>= 2)4
MIT + file LICENSE3
Apache License (>= 2)1
CC01
GPL1
GPL (>= 3)1
GPL-3 + file LICENSE1
LGPL1
LGPL (>= 2.1)1

Source

bioconductor15
github1

Type

Software tool15

15 of 5,674 resources

VariantFiltering

Filter genetic variants using different criteria such as inheritance model, amino acid change consequence, minor allele frequencies across human populations, splice site strength, conservation, etc.

★47 months ago

VariantTools

Explore, diagnose, and compare variant calls using filters.

systemPipeR

systemPipeR is a workflow management environment for reproducible data analysis that integrates R with command-line software. It enables researchers to design, execute, and report complex workflows on local machines and HPC systems. The framework combines R-based analysis with external tools through a Common Workflow Language (CWL) interface, manages workflow dependencies and restart capabilities, and automatically generates reproducible scientific analysis reports. The companion package systemPipeRdata provides ready-to-use workflow templates that simplify workflow setup and customization. Alternatively, workflow templates can be loaded from dedicated GitHub repositories.

SummarizedExperiment

The SummarizedExperiment container contains one or more assays, each represented by a matrix-like object of numeric or other mode. The rows typically represent genomic ranges of interest and the columns represent samples.

SplicingGraphs

This package allows the user to create, manipulate, and visualize splicing graphs and their bubbles based on a gene model for a given organism. Additionally it allows the user to assign RNA-seq reads to the edges of a set of splicing graphs, and to summarize them in different ways.

regioneReloaded

RegioneReloaded is a package that allows simultaneous analysis of associations between genomic region sets, enabling clustering of data and the creation of ready-to-publish graphs. It takes over and expands on all the features of its predecessor regioneR. It also incorporates a strategy to improve p-value calculations and normalize z-scores coming from multiple analysis to allow for their direct comparison. RegioneReloaded builds upon regioneR by adding new plotting functions for obtaining publication-ready graphs.

regioneR

regioneR offers a statistical framework based on customizable permutation tests to assess the association between genomic region sets and other genomic features.

nucleoSim

This package can generate a synthetic map with reads covering the nucleosome regions as well as a synthetic map with forward and reverse reads emulating next-generation sequencing. The synthetic hybridization data of “Tiling Arrays” can also be generated. The user has choice between three different distributions for the read positioning: Normal, Student and Uniform. In addition, a visualization tool is provided to explore the synthetic nucleosome maps.

gwascat

Represent and model data in the EMBL-EBI GWAS catalog.

GenomicRanges

The ability to efficiently represent and manipulate genomic annotations and alignments is playing a central role when it comes to analyzing high-throughput sequencing data (a.k.a. NGS data). The GenomicRanges package defines general purpose containers for storing and manipulating genomic intervals and variables defined along a genome. More specialized containers for representing and manipulating short alignments against a reference genome, or a matrix-like summarization of an experiment, are defined in the GenomicAlignments and SummarizedExperiment packages, respectively. Both packages build on top of the GenomicRanges infrastructure.

GenomicFiles

This package provides infrastructure for parallel computations distributed 'by file' or 'by range'. User defined MAPPER and REDUCER functions provide added flexibility for data combination and manipulation.

GenomicFeatures

Extract the genomic locations of genes, transcripts, exons, introns, and CDS, for the gene models stored in a TxDb object. A TxDb object is a small database that contains the gene models of a given organism/assembly. Bioconductor provides a small collection of TxDb objects in the form of ready-to-install TxDb packages for the most commonly studied organisms. Additionally, the user can easily make a TxDb object (or package) for the organism/assembly of their choice by using the tools from the txdbmaker package.

GenomeInfoDb

Contains data and functions that define and allow translation between different chromosome sequence naming conventions (e.g., "chr1" versus "1"), including a function that attempts to place sequence names in their natural, rather than lexicographic, order.

cpvSNP

Gene set analysis methods exist to combine SNP-level association p-values into gene sets, calculating a single association p-value for each gene set. This package implements two such methods that require only the calculated SNP p-values, the gene set(s) of interest, and a correlation matrix (if desired). One method (GLOSSI) requires independent SNPs and the other (VEGAS) can take into account correlation (LD) among the SNPs. Built-in plotting functions are available to help users visualize results.

BSgenome

Infrastructure shared by all the Biostrings-based genome data packages.

Submit a resource bio.tools Awesome Bioinformatics