Class: NcbiTaxon
A taxonomy term from NCBI Taxonomy. NcbiTaxon instances are identified by NCBITaxon CURIEs (e.g. NCBITaxon:511145) and stored in ontology_class_set alongside other OntologyClass instances. No NMDC-minted identifiers are assigned. Support for additional taxonomic authorities (GTDB, LPSN, SeqCode) would be represented by separate classes or by widening this class's id_prefixes in a follow-on PR.
URI: nmdc:NcbiTaxon
classDiagram
class NcbiTaxon
click NcbiTaxon href "../NcbiTaxon"
OntologyClass <|-- NcbiTaxon
click OntologyClass href "../OntologyClass"
NcbiTaxon : alternative_identifiers
NcbiTaxon : alternative_names
NcbiTaxon : definition
NcbiTaxon : description
NcbiTaxon : id
NcbiTaxon : is_obsolete
NcbiTaxon : is_root
NcbiTaxon : name
NcbiTaxon : relations
NcbiTaxon --> "*" OntologyRelation : relations
click OntologyRelation href "../OntologyRelation"
NcbiTaxon : type
Inheritance
- NamedThing
- OntologyClass
- NcbiTaxon
- OntologyClass
Slots
| Name | Cardinality and Range | Description | Inheritance |
|---|---|---|---|
| alternative_names | * String |
A list of alternative names used to refer to the entity | OntologyClass |
| relations | * OntologyRelation |
OntologyClass | |
| definition | 0..1 String |
The definition of the ontology term as provided by the ontology | OntologyClass |
| is_obsolete | 0..1 Boolean |
A boolean value indicating whether the ontology term is obsolete | OntologyClass |
| is_root | 0..1 Boolean |
A boolean value indicating whether the ontology term is a root term; it is no... | OntologyClass |
| id | 1 Uriorcurie |
A unique identifier for a thing | NamedThing |
| name | 0..1 String |
A human readable label for an entity | NamedThing |
| description | 0..1 String |
a human-readable description of a thing | NamedThing |
| alternative_identifiers | * Uriorcurie |
A list of alternative identifiers for the entity | NamedThing |
| type | 1 Uriorcurie |
the class_uri of the class that has been instantiated | NamedThing |
Comments
- Follows the OrthologyGroup pattern: a concrete, directly-instantiated semantic subclass of OntologyClass with no additional slots, no typecode, and no dedicated Database collection. Instances go into ontology_class_set and are referenced from sample slots such as classified_as.
- Sub-species information (strains, cultivars, lab-specific isolates) is below the resolution of NCBI Taxonomy. Strain identity should be captured via dedicated text slots on OrganismSample (e.g. isolate_name, source_mat_id, subspecf_gen_lin) or a future Strain class — not by minting NcbiTaxon instances for strains.
- Existing taxonomy slots (samp_taxon_id, host_taxid) already use ControlledIdentifiedTermValue with range OntologyClass. The intent is that these slots point to NcbiTaxon instances specifically, but LinkML does not currently support constraining the term slot of a ControlledIdentifiedTermValue to a particular OntologyClass subclass.
See Also
- https://github.com/microbiomedata/nmdc-schema/issues/2959
- https://github.com/microbiomedata/nmdc-schema/issues/2971
Identifier and Mapping Information
Valid ID Prefixes
Instances of this class should have identifiers with one of the following prefixes:
- NCBITaxon
Schema Source
- from schema: https://w3id.org/nmdc/nmdc
Mappings
| Mapping Type | Mapped Value |
|---|---|
| close | biolink:OrganismTaxon |
LinkML Source
Direct
name: NcbiTaxon
id_prefixes:
- NCBITaxon
description: A taxonomy term from NCBI Taxonomy. NcbiTaxon instances are identified
by NCBITaxon CURIEs (e.g. NCBITaxon:511145) and stored in ontology_class_set alongside
other OntologyClass instances. No NMDC-minted identifiers are assigned. Support
for additional taxonomic authorities (GTDB, LPSN, SeqCode) would be represented
by separate classes or by widening this class's id_prefixes in a follow-on PR.
comments:
- 'Follows the OrthologyGroup pattern: a concrete, directly-instantiated semantic
subclass of OntologyClass with no additional slots, no typecode, and no dedicated
Database collection. Instances go into ontology_class_set and are referenced from
sample slots such as classified_as.'
- Sub-species information (strains, cultivars, lab-specific isolates) is below the
resolution of NCBI Taxonomy. Strain identity should be captured via dedicated text
slots on OrganismSample (e.g. isolate_name, source_mat_id, subspecf_gen_lin) or
a future Strain class — not by minting NcbiTaxon instances for strains.
- Existing taxonomy slots (samp_taxon_id, host_taxid) already use ControlledIdentifiedTermValue
with range OntologyClass. The intent is that these slots point to NcbiTaxon instances
specifically, but LinkML does not currently support constraining the term slot of
a ControlledIdentifiedTermValue to a particular OntologyClass subclass.
from_schema: https://w3id.org/nmdc/nmdc
see_also:
- https://github.com/microbiomedata/nmdc-schema/issues/2959
- https://github.com/microbiomedata/nmdc-schema/issues/2971
close_mappings:
- biolink:OrganismTaxon
is_a: OntologyClass
slot_usage:
id:
name: id
comments:
- Validation is intentionally limited to NCBITaxon CURIEs. If GTDB, LPSN, or SeqCode
support is added, widen this pattern in lockstep with id_prefixes and example
data.
pattern: ^NCBITaxon:\d+$
class_uri: nmdc:NcbiTaxon
Induced
name: NcbiTaxon
id_prefixes:
- NCBITaxon
description: A taxonomy term from NCBI Taxonomy. NcbiTaxon instances are identified
by NCBITaxon CURIEs (e.g. NCBITaxon:511145) and stored in ontology_class_set alongside
other OntologyClass instances. No NMDC-minted identifiers are assigned. Support
for additional taxonomic authorities (GTDB, LPSN, SeqCode) would be represented
by separate classes or by widening this class's id_prefixes in a follow-on PR.
comments:
- 'Follows the OrthologyGroup pattern: a concrete, directly-instantiated semantic
subclass of OntologyClass with no additional slots, no typecode, and no dedicated
Database collection. Instances go into ontology_class_set and are referenced from
sample slots such as classified_as.'
- Sub-species information (strains, cultivars, lab-specific isolates) is below the
resolution of NCBI Taxonomy. Strain identity should be captured via dedicated text
slots on OrganismSample (e.g. isolate_name, source_mat_id, subspecf_gen_lin) or
a future Strain class — not by minting NcbiTaxon instances for strains.
- Existing taxonomy slots (samp_taxon_id, host_taxid) already use ControlledIdentifiedTermValue
with range OntologyClass. The intent is that these slots point to NcbiTaxon instances
specifically, but LinkML does not currently support constraining the term slot of
a ControlledIdentifiedTermValue to a particular OntologyClass subclass.
from_schema: https://w3id.org/nmdc/nmdc
see_also:
- https://github.com/microbiomedata/nmdc-schema/issues/2959
- https://github.com/microbiomedata/nmdc-schema/issues/2971
close_mappings:
- biolink:OrganismTaxon
is_a: OntologyClass
slot_usage:
id:
name: id
comments:
- Validation is intentionally limited to NCBITaxon CURIEs. If GTDB, LPSN, or SeqCode
support is added, widen this pattern in lockstep with id_prefixes and example
data.
pattern: ^NCBITaxon:\d+$
attributes:
alternative_names:
name: alternative_names
description: A list of alternative names used to refer to the entity. The distinction
between name and alternative names is application-specific. This should not
be used for identifers which have their own slots (e.g., bioproject:PRJNA406974)
from_schema: https://w3id.org/nmdc/nmdc
exact_mappings:
- dcterms:alternative
- skos:altLabel
rank: 1000
alias: alternative_names
owner: NcbiTaxon
domain_of:
- OntologyClass
- Study
- Biosample
range: string
multivalued: true
relations:
name: relations
from_schema: https://w3id.org/nmdc/nmdc
rank: 1000
alias: relations
owner: NcbiTaxon
domain_of:
- OntologyClass
range: OntologyRelation
multivalued: true
inlined: true
inlined_as_list: true
definition:
name: definition
description: The definition of the ontology term as provided by the ontology.
from_schema: https://w3id.org/nmdc/nmdc
rank: 1000
alias: definition
owner: NcbiTaxon
domain_of:
- OntologyClass
range: string
is_obsolete:
name: is_obsolete
description: A boolean value indicating whether the ontology term is obsolete.
comments:
- If true (the ontology term is declared obsolete via the ontology source itself),
the term is no longer considered a valid term to use in an annotation at NMDC,
and it no longer has ontology_relation_set records.
from_schema: https://w3id.org/nmdc/nmdc
rank: 1000
alias: is_obsolete
owner: NcbiTaxon
domain_of:
- OntologyClass
range: boolean
is_root:
name: is_root
description: A boolean value indicating whether the ontology term is a root term;
it is not a subclass of any other term.
from_schema: https://w3id.org/nmdc/nmdc
rank: 1000
alias: is_root
owner: NcbiTaxon
domain_of:
- OntologyClass
range: boolean
id:
name: id
description: A unique identifier for a thing. Must be either a CURIE shorthand
for a URI or a complete URI
notes:
- The identifiers for terms from external ontologies can't have their ids constrained
to the nmdc namespace
comments:
- Validation is intentionally limited to NCBITaxon CURIEs. If GTDB, LPSN, or SeqCode
support is added, widen this pattern in lockstep with id_prefixes and example
data.
examples:
- value: nmdc:mgmag-00-x012.1_7_c1
description: https://github.com/microbiomedata/nmdc-schema/pull/499#discussion_r1018499248
from_schema: https://w3id.org/nmdc/nmdc
structured_aliases:
- literal_form: workflow_execution_id
predicate: NARROW_SYNONYM
contexts:
- https://bitbucket.org/berkeleylab/jgi-jat/macros/nmdc_metadata.yaml
- literal_form: data_object_id
predicate: NARROW_SYNONYM
contexts:
- https://bitbucket.org/berkeleylab/jgi-jat/macros/nmdc_metadata.yaml
rank: 1000
identifier: true
alias: id
owner: NcbiTaxon
domain_of:
- NamedThing
range: uriorcurie
required: true
pattern: ^NCBITaxon:\d+$
name:
name: name
description: A human readable label for an entity
from_schema: https://w3id.org/nmdc/nmdc
rank: 1000
alias: name
owner: NcbiTaxon
domain_of:
- PersonValue
- NamedThing
- Protocol
range: string
description:
name: description
description: a human-readable description of a thing
from_schema: https://w3id.org/nmdc/nmdc
rank: 1000
slot_uri: dcterms:description
alias: description
owner: NcbiTaxon
domain_of:
- ImageValue
- NamedThing
- Protocol
range: string
alternative_identifiers:
name: alternative_identifiers
description: A list of alternative identifiers for the entity.
from_schema: https://w3id.org/nmdc/nmdc
rank: 1000
alias: alternative_identifiers
owner: NcbiTaxon
domain_of:
- NamedThing
- MetaboliteIdentification
range: uriorcurie
multivalued: true
pattern: ^[a-zA-Z0-9][a-zA-Z0-9_\.]+:[a-zA-Z0-9_][a-zA-Z0-9_\-\/\.,\(\)\=\#]*$
type:
name: type
description: the class_uri of the class that has been instantiated
notes:
- makes it easier to read example data files
- required for polymorphic MongoDB collections
examples:
- value: nmdc:Biosample
- value: nmdc:Study
from_schema: https://w3id.org/nmdc/nmdc
see_also:
- https://github.com/microbiomedata/nmdc-schema/issues/1048
- https://github.com/microbiomedata/nmdc-schema/issues/1233
- https://github.com/microbiomedata/nmdc-schema/issues/248
structured_aliases:
- literal_form: workflow_execution_class
predicate: NARROW_SYNONYM
contexts:
- https://bitbucket.org/berkeleylab/jgi-jat/macros/nmdc_metadata.yaml
rank: 1000
slot_uri: rdf:type
designates_type: true
alias: type
owner: NcbiTaxon
domain_of:
- EukEval
- FunctionalAnnotationAggMember
- PeptideQuantification
- ProteinQuantification
- GenomeFeature
- FunctionalAnnotation
- AttributeValue
- NamedThing
- OntologyRelation
- FailureCategorization
- Protocol
- CreditAssociation
- Doi
- ProvenanceMetadata
- MobilePhaseSegment
- PortionOfSubstance
- MagBin
- MetaboliteIdentification
range: uriorcurie
required: true
class_uri: nmdc:NcbiTaxon