Open Science Index

Find open-source science resources

Cross-domain directory aggregating tools, AI models, datasets, and research resources from bio.tools, Bioconductor, HuggingFace, curated GitHub awesome-lists, and more.

Filters

Domain

Microarray6
Infrastructure3
ImmunoOncology2
DataImport1
DifferentialExpression1
MotifAnnotation1
MultipleComparison1
Preprocessing1
SequenceMatching1
SNP1

Language

R18

License(1)

Artistic-2.0530
GPL-3496
MIT + file LICENSE369
CC-BY-4.0271
GPL-2231
GPL (>= 2)177
CC0-1.0123
CC-BY-3.087
GPL (>= 3)78
GPL-3 + file LICENSE68
GPL (>=2)47
file LICENSE38
(None)2555

Source

bioconductor18
github1

Type

Software tool18

Filters

Domain

Microarray6
Infrastructure3
ImmunoOncology2
DataImport1
DifferentialExpression1
MotifAnnotation1
MultipleComparison1
Preprocessing1
SequenceMatching1
SNP1

Language

R18

License(1)

Artistic-2.0530
GPL-3496
MIT + file LICENSE369
CC-BY-4.0271
GPL-2231
GPL (>= 2)177
CC0-1.0123
CC-BY-3.087
GPL (>= 3)78
GPL-3 + file LICENSE68
GPL (>=2)47
file LICENSE38
(None)2555

Source

bioconductor18
github1

Type

Software tool18

18 of 5,674 resources

arrayQualityMetrics

This package generates microarray quality metrics reports for data in Bioconductor microarray data containers (ExpressionSet, NChannelSet, AffyBatch). One and two color array platforms are supported.

★13 weeks ago

siggenes

MultipleComparison

Identification of differentially expressed genes and estimation of the False Discovery Rate (FDR) using both the Significance Analysis of Microarrays (SAM) and the Empirical Bayes Analyses of Microarrays (EBAM).

seqLogo

SequenceMatching

seqLogo takes the position weight matrix of a DNA sequence motif and plots the corresponding sequence logo as introduced by Schneider and Stephens (1990).

RLMM

A classification algorithm, based on a multi-chip, multi-SNP approach for Affymetrix SNP arrays. Using a large training sample where the genotype labels are known, this aglorithm will obtain more accurate classification results on new data. RLMM is based on a robust, linear model and uses the Mahalanobis distance for classification. The chip-to-chip non-biological variation is removed through normalization. This model-based algorithm captures the similarities across genotype groups and probes, as well as thousands other SNPs for accurate classification. NOTE: 100K-Xba only at for now.

Rhtslib

This package provides version 1.18 of the 'HTSlib' C library for high-throughput sequence analysis. The package is primarily useful to developers of other R packages who wish to make use of HTSlib. Motivation and instructions for use of this package are in the vignette, vignette(package="Rhtslib", "Rhtslib").

qpcrNorm

The package contains functions to perform normalization of high-throughput qPCR data. Basic functions for processing raw Ct data plus functions to generate diagnostic plots are also available.

PWMEnrich

MotifAnnotation

A toolkit of high-level functions for DNA motif scanning and enrichment analysis built upon Biostrings. The main functionality is PWM enrichment analysis of already known PWMs (e.g. from databases such as MotifDb), but the package also implements high-level functions for PWM scanning and visualisation. The package does not perform "de novo" motif discovery, but is instead focused on using motifs that are either experimentally derived or computationally constructed by other tools.

preprocessCore

A library of core preprocessing routines.

oligo

A package to analyze oligonucleotide arrays (expression/SNP/tiling/exon) at probe-level. It currently supports Affymetrix (CEL files) and NimbleGen arrays (XYS files).

mdqc

MDQC is a multivariate quality assessment method for microarrays based on quality control (QC) reports. The Mahalanobis distance of an array's quality attributes is used to measure the similarity of the quality of that array against the quality of the other arrays. Then, arrays with unusually high distances can be flagged as potentially low-quality.

MassSpecWavelet

Peak Detection in Mass Spectrometry data is one of the important preprocessing steps. The performance of peak detection affects subsequent processes, including protein identification, profile alignment and biomarker identification. Using Continuous Wavelet Transform (CWT), this package provides a reliable algorithm for peak detection that does not require any type of smoothing or previous baseline correction method, providing more consistent results for different spectra. See <doi:10.1093/bioinformatics/btl355} for further details.

lumi

The lumi package provides an integrated solution for the Illumina microarray data analysis. It includes functions of Illumina BeadStudio (GenomeStudio) data input, quality control, BeadArray-specific variance stabilization, normalization and gene annotation at the probe level. It also includes the functions of processing Illumina methylation microarrays, especially Illumina Infinium methylation microarrays.

logicFS

Identification of interactions between binary variables using Logic Regression. Can, e.g., be used to find interesting SNP interactions. Contains also a bagging version of logic regression for classification.

goseq

Detects Gene Ontology and/or other user defined categories which are over/under represented in RNA-seq data.

flagme

DifferentialExpression

Fragment-level analysis of gas chromatography-massspectrometry metabolomics data.

BufferedMatrix

A tabular style data object where most data is stored outside main memory. A buffer is used to speed up access to data.

affyio

Routines for parsing Affymetrix data files based upon file format information. Primary focus is on accessing the CEL and CDF file formats.

affxparser

Package for parsing Affymetrix files (CDF, CEL, CHP, BPMAP, BAR). It provides methods for fast and memory efficient parsing of Affymetrix files using the Affymetrix' Fusion SDK. Both ASCII- and binary-based files are supported. Currently, there are methods for reading chip definition file (CDF) and a cell intensity file (CEL). These files can be read either in full or in part. For example, probe signals from a few probesets can be extracted very quickly from a set of CEL files into a convenient list structure.

Submit a resource bio.tools Awesome Bioinformatics