<?xml version="1.0"?>
<?xml-stylesheet type="text/xsl" href="https://ontobee.org/ontology/view/GENO?iri=http://purl.obolibrary.org/obo/GENO_0000961"?>
<rdf:RDF xmlns="http://www.w3.org/2002/07/owl#"
     xml:base="http://www.w3.org/2002/07/owl"
     xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
     xmlns:owl="http://www.w3.org/2002/07/owl#"
     xmlns:oboInOwl="http://www.geneontology.org/formats/oboInOwl#"
     xmlns:xsd="http://www.w3.org/2001/XMLSchema#"
     xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#"
     xmlns:ns2="http://purl.obolibrary.org/obo/"
     xmlns:foaf="http://xmlns.com/foaf/0.1/"
     xmlns:dc="http://purl.org/dc/elements/1.1/">
    


    <!-- 
    ///////////////////////////////////////////////////////////////////////////////////////
    //
    // Annotation properties
    //
    ///////////////////////////////////////////////////////////////////////////////////////
     -->

    <AnnotationProperty rdf:about="http://purl.obolibrary.org/obo/IAO_0000116"/>
    <AnnotationProperty rdf:about="http://purl.obolibrary.org/obo/IAO_0000115"/>
    


    <!-- 
    ///////////////////////////////////////////////////////////////////////////////////////
    //
    // Datatypes
    //
    ///////////////////////////////////////////////////////////////////////////////////////
     -->

    


    <!-- 
    ///////////////////////////////////////////////////////////////////////////////////////
    //
    // Classes
    //
    ///////////////////////////////////////////////////////////////////////////////////////
     -->

    


    <!-- http://purl.obolibrary.org/obo/GENO_0000872 -->

    <Class rdf:about="http://purl.obolibrary.org/obo/GENO_0000872">
        <rdfs:label xml:lang="en">genomic sequence set</rdfs:label>
    </Class>
    


    <!-- http://purl.obolibrary.org/obo/GENO_0000961 -->

    <Class rdf:about="http://purl.obolibrary.org/obo/GENO_0000961">
        <rdfs:label xml:lang="en">copy number complement</rdfs:label>
        <rdfs:subClassOf rdf:resource="http://purl.obolibrary.org/obo/GENO_0000872"/>
        <rdfs:comment>The notion of a &#39;complement&#39; is useful as a special case of a set, where the members necessarily comprise an exhaustive collection of *all* objects that make up some well-defined set. Here, a &#39;copy number complement&#39; represents &#39;represents the set of *all* copies of a specified sequence in a particular genome. Note that sequences can be duplicated in a set (i.e. contain more than one member representing the same sequence). In the &#39;copy number complement&#39; example, each set member is a copy of this same biological sequence.</rdfs:comment>
        <rdfs:comment>The count of how many of a particular sequences are found in a genome is the sequences &#39;copy number&#39;. In diploid organisms, the normal copy number for sequences at most locations is 2 (a notable exception being those on the X-chromosome where normal copy number is 1). Variations in copy number occur if this count increases due to a duplication of the gene/region, or decreases due to a deletion of a gene/region. A driving use case for representing copy number is to support associations between variation in copy number of a particular sequence, and phenotypes or diseases that can result.

A &#39;complement&#39; refers to an exhaustive collection of *all* objects that make up some well-defined set. Such a set may contain 0, 1, or more than one members. The notion of a complement is useful for defining many biologically-relevant sets of sequence features, such as &#39;copy number complements&#39; representing the set of all copies of a particular sequence in a genome. 

The fact that we are counting how many copies of the same *sequence* exist in a genome here, as opposed to how many of the same *feature*, is what sets sequence-level concepts like &#39;copy number complement&#39; apart from feature-level concepts like &#39;single locus complement&#39;. To illustrate the difference, consider a duplication event that creates a new copy of the human APOE gene on a different chromosome. This creates an entirely new sequence feature at a distinct locus from that of the original APOE gene.  The &#39;copy number complement&#39; for sequence defined by the APOE gene locus would have a count of three, as this sequence is present three times in the genome.  But the &#39;single locus complement&#39; at the APOE gene locus would still have a count of two - because the duplicated copy is at a different location in the genome, and therefore does not represent a copy of the APOE locus.</rdfs:comment>
        <ns2:IAO_0000115>A set representing the complement of all copies of a particular biological sequence (typically at the scale of complete genes or larger) present in a particular genome.</ns2:IAO_0000115>
        <ns2:IAO_0000116>The identity of a &#39;copy number complement&#39; instance is determined by the sequence defining its members, and their count (the number of times this sequence appears in a particular genome). In reality the sequence of each copy may not be identical, given the tendency of large regions to accumulate subtle variations. What matters is that they share a common origin/alignment with a defining location in a reference genome.

We represent the notion of copy number at the &quot;sequence level&quot; (as opposed to the &quot;sequence feature level&quot;)  because we are concerned only with the number of copies of a sequence in a genome, and not the location of the features bearing this sequence. Consider a copy number complement comprised of three copies of the sequence defined by the location Chr8 100000-200000 on a GRCh38.2 reference genome.  In one person&#39;s genome, this sequence may appear at its normal location on Chromosome 8, as well as in duplications on chromosomes 5, and 12. In another genome the sequence might appear three times as well, but on chromosomes 8, 9, and 15.  When representing causal associations linking copy number to disease, it is important that these are considered to be *the same* copy number complement - because what a curator associates with a disease is the presence of three copies of some sequence in a genome, independent of their location. The &quot;sequence level&quot; representation here supports this use case.  By contrast, a &quot;feature level&quot; representation, where identity of a copy number complement would be based on the identity of member *features*), does not - because we have two sets comprised of entirely different features (based on location being tied to their identity).</ns2:IAO_0000116>
    </Class>
</rdf:RDF>



<!-- Generated by the OWL API (version 3.2.4.1806) http://owlapi.sourceforge.net -->



