OBO Foundry

OntoBee AberOWL OLS BioPortal Bioregistry

SO is a collaborative ontology project for the definition of sequence features used in biological sequence annotation. SO was initially developed by the Gene Ontology Consortium. Contributors to SO include the GMOD community, model organism database groups such as WormBase, FlyBase, Mouse Genome Informatics group, and institutes such as the Sanger Institute and the EBI. Input to SO is welcomed from the sequence annotation community. SO is also part of the Open Biomedical Ontologies library. Our aim is to develop an ontology suitable for describing the features of biological sequences. For questions, please send mail to the SO developers mailing list. For new term suggestions, please use the Term Tracker.

The Sequence Ontology is a set of terms and relationships used to describe the features and attributes of biological sequence. SO includes different kinds of features which can be located on the sequence. Biological features are those which are defined by their disposition to be involved in a biological process. Examples are binding_site and exon. Biomaterial features are those which are intended for use in an experiment such as aptamer and PCR_product. There are also experimental features which are the result of an experiment. SO also provides a rich set of attributes to describe these features such as “polycistronic” and “maternally imprinted”.

The Sequence Ontologies are provided as a resource to the biological community. They have the following obvious uses:

To provide for a structured controlled vocabulary for the description of primary annotations of nucleic acid sequence, e.g. the annotations shared by a DAS server (BioDAS, Biosapiens DAS), or annotations encoded by GFF3.
To provide for a structured representation of these annotations within databases. Were genes within model organism databases to be annotated with these terms then it would be possible to query all these databases for, for example, all genes whose transcripts are edited, or trans-spliced, or are bound by a particular protein. One such genomic database is Chado.
To provide a structured controlled vocabulary for the description of mutations at both the sequence and more gross level in the context of genomic databases.

The Sequence Ontology is part of OBO. It has close links to other ontology projects such as the RNAO consortium, and the Biosapiens polypeptide features.

Publications

The Sequence Ontology: a tool for the unification of genome annotations.

Evolution of the Sequence Ontology terms and relationships.

Products

so.owl	Main SO OWL release
so.obo	Main SO release in OBO Format
so/subsets/SOFA.owl	Sequence Ontology Feature Annotation (SOFA) subset (OWL)	This subset includes only locatable sequence features and is designed for use in such outputs as GFF3
so/subsets/SOFA.obo	Sequence Ontology Feature Annotation (SOFA) subset (OBO Format)	This subset includes only locatable sequence features and is designed for use in such outputs as GFF3

ID Space: so
PURL: http://purl.obolibrary.org/obo/so.owl
License: CC BY 4.0
Homepage: http://www.sequenceontology.org/; https://en.wikipedia.org/wiki/Sequence_Ontology
Contact: Karen Eilbeck; 0000-0002-0831-6427; @keilbeck
Repository: https://github.com/The-Sequence-Ontology/SO-Ontologies
Tracker: https://github.com/The-Sequence-Ontology/SO-Ontologies/issues
Domain: chemistry and biochemistry
Mailing List: https://sourceforge.net/p/song/mailman/song-devel/
Stars
Contributors
Last Commit
OBO Dashboard

View Edit PURL Config

Generated by: _layouts/ontology_detail.html
See metadata guide

Sequence types and features ontology