Genotype Ontology

Last uploaded: October 8, 2023
Preferred Name

genotype

Synonyms
Definitions

1. Scope of 'Genetic State': 'Genetic state' is considered quite broadly in GENO to describe two general kinds of 'states'. First, is traditional notion of 'allelic state' - defined as the complement of alleles present at a particular location or locations in a genome (i.e. across all homologous chromosomes containing this location). Here, a genotype can describe allelic state at a specific locus in a genome (an 'allelic genotype'), or describe the allelic state across the entire genome ('genomic genotype'). Second, this concept can also describe states of genomic features 'extrinsic' to their intrinsic sequence, such as the expression status of a gene as a result of being specifically targeted by experimental interventions such as RNAi, morpholinos, or CRISPRs. 2. Genotype Subtypes: In GENO, we use the term 'intrinsic' for genotypes describing variation in genomic sequence, and 'extrinsic' for genotypes describing variation in gene expression (e.g. resulting from the targeted experimental knock-down or over-expression of endogenous genes). We use the term 'effective genotype' to describe the total intrinsic and extrinsic variation in a cell or organism at the time a phenotypic assessment is performed. Two more precise conccepts are subsumed by the notion of an 'intrinsic genotype': (1) 'allelic genotypes', which specify allelic state at a single genomic location; and (2) 'genomic genotypes', which specify allelic state across an entire genome. In both cases, allelic state is typically specified in terms of a differential between a reference and a set of 1 or more known variant features. 3. The Genotype Partonomy: 'Genomic genotypes' describing sequence variation across an entire genome are 'decomposed' in GENO into a partonomy of more granular levels of variation. These levels are defined to be meaningful to biologists in their attempts to relate genetic variation to phenotypic features. They include 'genomic variation complement' (GVC), 'variant single locus complement' (VSLC), 'allele', 'haplotype', 'sequence alteration', and 'genomic background' classes. For example, the components of the zebrafish genotype "fgf8a<ti282a/ti282a>; fgf3<t24149/+>[AB]", described at zfin.org/ZDB-FISH-150901-9362, include the following elements: - GVC: fgf8a<ti282a/ti282a>; fgf3<t24149/+> (total intrinsic variation in the genome) - Genomic Background: AB (the reference against which the GVC is variant) - VSLC1: fgf8a<ti282a/ti282a> (homozygous complement of gene alleles at one known variant locus) - VSLC2: fgf3<t24149/+> (heterozygous complement of gene alleles at another known variant locus) - Allele 1: fgf8a<ti282a> (variant version of the fgf8a gene, present in two copies) - Allele 2: fgf3<t24149> (variant version of the fgf3 gene, present in one copy) - Allele 3: fgf3<+> (wild-type version of the fgf3 gene, present in one copy) - Sequence Alteration1: <ti282a> (the specific mutation within the fgf8a gene that makes it variant) - Sequence Alteration2: <t24149> (the specific mutation within the fgf3 gene that makes it variant) A graphical representation of this decomposition that maps each element to a visual depiction of the portion of a genome it denotes can be found here: https://github.com/monarch-initiative/GENO-ontology/blob/develop/README.md One reason that explicit representation of these levels is important is because it is at these levels that phenotypic features are annotated to genetic variations in different clinical and model organism databases For example, ZFIN typically annotates phenotypes to effective genotypes, MGI to intrinsic genotypes, Wormbase to variant alleles, and ClinVar to haplotypes and sequence alterations. The ability to decompose a genotype into representations at these levels allows us to "propagate phenotypes" up or down the partonomy (e.g. infer associations of phenotypes annotated to a genotype to its more granular levels of variation and the gene(s) affected). This helps to supporting integrated analysis of G2P data. A specification of the genetic state of an organism, whether complete (defined over the whole genome) or incomplete (defined over a subset of the genome). Genotypes typically describe this genetic state as a diff between some variant component and a canonical reference.

ID

http://purl.obolibrary.org/obo/GENO_0000536

comment

1. Scope of 'Genetic State': 'Genetic state' is considered quite broadly in GENO to describe two general kinds of 'states'. First, is traditional notion of 'allelic state' - defined as the complement of alleles present at a particular location or locations in a genome (i.e. across all homologous chromosomes containing this location). Here, a genotype can describe allelic state at a specific locus in a genome (an 'allelic genotype'), or describe the allelic state across the entire genome ('genomic genotype'). Second, this concept can also describe states of genomic features 'extrinsic' to their intrinsic sequence, such as the expression status of a gene as a result of being specifically targeted by experimental interventions such as RNAi, morpholinos, or CRISPRs. 2. Genotype Subtypes: In GENO, we use the term 'intrinsic' for genotypes describing variation in genomic sequence, and 'extrinsic' for genotypes describing variation in gene expression (e.g. resulting from the targeted experimental knock-down or over-expression of endogenous genes). We use the term 'effective genotype' to describe the total intrinsic and extrinsic variation in a cell or organism at the time a phenotypic assessment is performed. Two more precise conccepts are subsumed by the notion of an 'intrinsic genotype': (1) 'allelic genotypes', which specify allelic state at a single genomic location; and (2) 'genomic genotypes', which specify allelic state across an entire genome. In both cases, allelic state is typically specified in terms of a differential between a reference and a set of 1 or more known variant features. 3. The Genotype Partonomy: 'Genomic genotypes' describing sequence variation across an entire genome are 'decomposed' in GENO into a partonomy of more granular levels of variation. These levels are defined to be meaningful to biologists in their attempts to relate genetic variation to phenotypic features. They include 'genomic variation complement' (GVC), 'variant single locus complement' (VSLC), 'allele', 'haplotype', 'sequence alteration', and 'genomic background' classes. For example, the components of the zebrafish genotype "fgf8a; fgf3[AB]", described at zfin.org/ZDB-FISH-150901-9362, include the following elements: - GVC: fgf8a; fgf3 (total intrinsic variation in the genome) - Genomic Background: AB (the reference against which the GVC is variant) - VSLC1: fgf8a (homozygous complement of gene alleles at one known variant locus) - VSLC2: fgf3 (heterozygous complement of gene alleles at another known variant locus) - Allele 1: fgf8a (variant version of the fgf8a gene, present in two copies) - Allele 2: fgf3 (variant version of the fgf3 gene, present in one copy) - Allele 3: fgf3<+> (wild-type version of the fgf3 gene, present in one copy) - Sequence Alteration1: (the specific mutation within the fgf8a gene that makes it variant) - Sequence Alteration2: (the specific mutation within the fgf3 gene that makes it variant) A graphical representation of this decomposition that maps each element to a visual depiction of the portion of a genome it denotes can be found here: https://github.com/monarch-initiative/GENO-ontology/blob/develop/README.md One reason that explicit representation of these levels is important is because it is at these levels that phenotypic features are annotated to genetic variations in different clinical and model organism databases For example, ZFIN typically annotates phenotypes to effective genotypes, MGI to intrinsic genotypes, Wormbase to variant alleles, and ClinVar to haplotypes and sequence alterations. The ability to decompose a genotype into representations at these levels allows us to "propagate phenotypes" up or down the partonomy (e.g. infer associations of phenotypes annotated to a genotype to its more granular levels of variation and the gene(s) affected). This helps to supporting integrated analysis of G2P data.

definition

A specification of the genetic state of an organism, whether complete (defined over the whole genome) or incomplete (defined over a subset of the genome). Genotypes typically describe this genetic state as a diff between some variant component and a canonical reference.

definition source

Core definition above adapted from the GA4GH VMC data model definition here: https://docs.google.com/document/d/12E8WbQlvfZWk5NrxwLytmympPby6vsv60RxCeD5wc1E/edit#heading=h.4e32jj4jtmsl (retrieved 2018-04-09). Note however that the VMC genotype concept likely is not intended to cover 'effective' and 'extrinsic' genotype concepts defined in GENO.

editor note

As information artifacts, genotypes specify the state of a genome be defining a diff between some canonical reference and a variant or alternate sequence that replaces the corresponding portion of the reference. We can consider a genotype then as a collection of these reference and variant features, along with some rule for operating on them and resolve a final single sequence. This is valid ontologically because we commit only to sequence features being GDCs - which allows for their concretization in either biological or informational patterns. Accordingly, a particular gene allele, such as shh, can be part of a genome in a biological sense and part of a genotype in an informational sense. This idea underpins the 'genotype partonomy' at the core of the GENO model that decomposes a complete genotype into its more fundamental parts, including alleles and allele complements, as described in the comment above.

label

genotype

prefixIRI

GENO:0000536

prefLabel

genotype

subClassOf

http://purl.obolibrary.org/obo/IAO_0000030

Delete Subject Author Type Created
No notes to display
Create mapping

Delete Mapping To Ontology Source
http://purl.obolibrary.org/obo/GENO_0000536 SLSO SAME_URI
http://vocabs.lter-europe.net/EnvThes/AF_c3_2342 ENVTHES LOOM
http://purl.obolibrary.org/obo/SO_0001027 SO LOOM
http://vocabs.lter-europe.net/EnvThes/21403 ENVTHES LOOM
http://www.ebi.ac.uk/efo/EFO_0000513 EFO LOOM
https://w3id.org/biolink/vocab/Genotype BIOLINK LOOM
http://purl.bioontology.org/ontology/LNC/LP29254-7 LOINC LOOM
http://purl.obolibrary.org/obo/MCO_0000383 MCO LOOM
http://purl.obolibrary.org/obo/LifO_0000002 LIFO LOOM
http://phenomebrowser.net/ontologies/mesh/mesh.owl#G05.380 RH-MESH LOOM
http://ncicb.nci.nih.gov/xml/owl/EVS/Thesaurus.owl#C16631 NCIT LOOM
http://sbmi.uth.tmc.edu/ontology/ochv#C0017431 OCHV LOOM
http://purl.jp/bio/4/id/200906050296267914 IOBC LOOM
http://www.projecthalo.com/aura#Genotype AURA LOOM
http://www.bioontology.org/pma.owl#PMA_225 PMA LOOM
http://sbmi.uth.tmc.edu/ontology/ochv#C1285573 OCHV LOOM
http://purl.obolibrary.org/obo/OGI_0000082 OGI LOOM
http://purl.obolibrary.org/obo/OGI_0000082 OGSF LOOM
http://purl.bioontology.org/ontology/RCTV2/4L1..00 RCTV2 LOOM
http://purl.obolibrary.org/obo/NCIT_C16631 PREMEDONTO LOOM
http://regenbase.org/ontology#RB_0000487 RB LOOM
http://www.loria.fr/~coulet/sopharm/SOPHARM_40000 SOPHARM LOOM
http://semanticscience.org/resource/SIO_001079 HHEARP LOOM
http://semanticscience.org/resource/SIO_001079 SIO LOOM
http://semanticscience.org/resource/SIO_001079 CHEAR LOOM
http://semanticscience.org/resource/SIO_001079 BIOMO LOOM
http://semanticscience.org/resource/SIO_001079 SCO LOOM
http://semanticscience.org/resource/SIO_001079 HHEAR LOOM
http://semanticscience.org/resource/SIO_001079 HASCO LOOM
http://doe-generated-ontology.com/OntoAD#C0017431 ONTOAD LOOM
http://scai.fraunhofer.de/AlzheimerOntology#Genetics NIO LOOM
http://semanticscience.org/resource/Genotype CHEAR LOOM
https://www.codamono.com/biointerchange/gfvo#Genotype GFVO LOOM
http://purl.bioontology.org/ontology/SNMI/F-E0280 SNMI LOOM
http://purl.obolibrary.org/obo/ODGI.owl#Genotype OGDI LOOM
http://www.ebi.ac.uk/efo/EFO_0000513 ENM LOOM
http://mged.sourceforge.net/ontologies/MGEDOntology.owl#Genotype CSEO LOOM
http://mged.sourceforge.net/ontologies/MGEDOntology.owl#Genotype MO LOOM
http://purl.obolibrary.org/obo/HIO_0000070 HIO LOOM
https://github.com/sap218/coid/blob/master/coid.owl#COID_00818 COID LOOM
http://purl.obolibrary.org/obo/SCDO_0004061 SCDO LOOM
http://data.loterre.fr/ark:/67375/BLH-QNPFGGTQ-Q BIODIVTHES LOOM
http://purl.obolibrary.org/obo/OMIT_0007033 OMIT LOOM
http://purl.bioontology.org/NEMO/ontology/NEMO.owl#NEMO_8906000 NEMO LOOM
http://semanticscience.org/resource/genotype MATERIALSMINE LOOM
http://purl.org/obo/owl/SO#SO_0001207 ONSTR LOOM
http://purl.bioontology.org/ontology/MESH/D005838 MESH LOOM
http://sbmi.uth.tmc.edu/ontology/ochv#5452 OCHV LOOM
http://bioontology.org/projects/ontologies/birnlex#birnlex_2023 BIRNLEX LOOM
http://purl.obolibrary.org/obo/ExO_0000107 EXO LOOM
http://www.w3.org/2001/sw/hcls/ns/transmed/TMO_0080 TMO LOOM
http://uri.neuinfo.org/nif/nifstd/birnlex_2023 NIFSTD LOOM
http://phenomebrowser.net/ontologies/mesh/mesh.owl#D005838 RH-MESH LOOM
http://purl.obolibrary.org/obo/MESH_D005838 BERO LOOM
http://purl.bioontology.org/ontology/CSP/1258-5784 CRISP LOOM
http://purl.bioontology.org/ontology/LNC/LP345004-8 LOINC LOOM
http://jermontology.org/ontology/JERMOntology#genotype JERM LOOM
http://www.pepathway.org/peo/1.2#Genetics PE-O LOOM
http://purl.obolibrary.org/obo/SO_0001027 EUPATH LOOM
http://purl.obolibrary.org/obo/SO_0001027 NIFSTD LOOM
http://purl.obolibrary.org/obo/SO_0001027 GENEPIO LOOM