Services: Core Data Resources
For information about what Core Data Resources are and how they are selected see the Data Platform pages.
Name of service | Tag | Related links* | Key collection | |
---|---|---|---|---|
ArrayExpress |
ArrayExpress is a MIAME-standard compliant resource that stores functional genomics experiments performed using RNA-Seq/ChIP-Seq and array-based technologies. |
bio.toolsFAIRsharingTeSS | CDR EDD | |
BRENDA |
A comprehensive enzyme information system. It is the world’s largest and most widely information system on all aspects of enzymes, including function, structure, mutants, properties like stability, purification. Data download is possible as an integrated text filed or via SOAP. |
bio.toolsFAIRsharingTeSS | CDR | |
CATH/Gene3D |
A classification of protein structures and sequences that groups protein domains into superfamilies. |
bio.toolsFAIRsharingTeSS | CDR | |
Cellosaurus |
A knowledge resource on cell lines. |
bio.toolsFAIRsharing | CDR | |
ChEBI |
ChEBI (Chemical Entities of Biological Interest) is a dictionary of small molecular entities. It is manually annotated and provides a chemical ontology to describe small molecules, including their biological and chemical roles. |
bio.toolsFAIRsharingTeSS | CDR | |
ChEMBL |
ChEMBL is a database of bioactive compounds that focuses on interactions between small molecules and their macromolecular targets, including medicinal chemistry, clinical development and therapeutics data. |
bio.toolsFAIRsharingTeSS | CDR | |
Ensembl |
Produces and maintains automatic and manually curated annotation on eukaryotic genomes. It is integrated with important molecular resources, for example UniProt, and can be accessed programmatically or through a web browser. |
bio.toolsFAIRsharingTeSS | CDR | |
Ensembl Genomes |
Provides access to genome-scale data from bacteria, protists, fungi, plants and invertebrate metazoa, through a unified set of interactive and programmatic interfaces based on the Ensembl software platform. |
FAIRsharingTeSS | CDR | |
EuropePMC |
Europe PubMedCentral (EuropePMC) contains over 3 million full text life science research articles, of which over 900 000 are open access, and combines these with 30 million abstracts from PubMed and other sources. |
bio.toolsFAIRsharing | CDR | |
Human Protein Atlas (HPA) |
Database with millions of high-resolution images. |
bio.toolsFAIRsharing | CDR | |
IntAct |
IntAct provides a freely available, open source database system and analysis tools for molecular interaction data. |
bio.toolsFAIRsharingTeSS | CDR EDD | |
InterPro |
InterPro classifies proteins into families and predicts the presence of important domains and sites. |
bio.toolsFAIRsharingTeSS | CDR | |
Orphadata |
Orphadata provides the scientific community with comprehensive, quality data sets related to rare diseases and orphan drugs from the Orphanet knowledge base, in reusable formats. |
bio.toolsFAIRsharing | CDR | |
PRIDE |
PRIDE (The Proteomics Identifications Database) is a standards-compliant, public repository for proteomics data. It contains protein and peptide identifications and their associated supporting evidence. |
bio.toolsFAIRsharingTeSS | CDR EDD | |
Protein Data Bank in Europe (PDBe) |
The Protein Data Bank in Europe (PDBe) is the European part of the wwPDB for the collection, organisation and dissemination of data on biological macromolecular structures. |
bio.toolsFAIRsharingTeSS | CDR EDD | |
Reactome |
An open-source, curated and peer reviewed pathway database. Its goal is to provide tools for the visualization, interpretation and analysis of pathway knowledge to support basic research, genome analysis, modeling and systems biology. |
bio.toolsFAIRsharingTeSS | CDR | |
Rhea |
Rhea is a comprehensive and non-redundant resource of expert-curated biochemical reactions described using species from the ChEBI (Chemical Entities of Biological Interest) ontology of small molecules. |
bio.toolsFAIRsharing | CDR | |
SILVA |
SILVA provides comprehensive, quality checked and regularly updated datasets of aligned small (16S/18S, SSU) and large subunit (23S/28S, LSU) ribosomal RNA (rRNA) sequences for all three domains of life (Bacteria, Archaea and Eukarya). |
bio.toolsFAIRsharing | CDR | |
STRING |
STRING is a database of known and predicted protein-protein interactions. The database contains information from numerous sources, including experimental repositories, computational prediction methods and public text collections. |
bio.toolsFAIRsharingTeSS | CDR | |
The European Genome-phenome Archive (EGA) |
The European Genome-phenome Archive (EGA) allows users to explore datasets from numerous genotype experiments, including case-control, population and family studies, that are supplied by a range of data providers. |
bio.toolsFAIRsharingTeSS | CDR EDD | |
The European Nucleotide Archive (ENA) |
The European Nucleotide Archive (ENA) contains all the nucleotide sequences in the public domain and consolidates data from EMBL-Bank, the European Trace Archive and the Sequence Read Archive. |
bio.toolsFAIRsharingTeSS | CDR EDD | |
UniProtKB |
UniProt produces and maintains automatic and manually curated annotation of all publicly available protein sequences and serves these to users through various interfaces. |
bio.toolsFAIRsharingTeSS | CDR |