Class: Isolate (IsolateInterface)
Metadata for microbial isolate samples submitted to NMDC. Captures core biological identity slots (taxonomy, strain, culture collection) that NMDC stores and queries. Slots are sourced from nmdc-schema OrganismSample and Organism classes; depends on nmdc-schema PR #2977 and #2975 merging.
URI: nmdc_sub_schema:IsolateInterface
classDiagram
class IsolateInterface
click IsolateInterface href "../IsolateInterface/"
DhMultiviewCommonColumnsMixin <|-- IsolateInterface
click DhMultiviewCommonColumnsMixin href "../DhMultiviewCommonColumnsMixin/"
DhInterface <|-- IsolateInterface
click DhInterface href "../DhInterface/"
IsolateInterface : analysis_type
IsolateInterface --> "1..*" AnalysisTypeEnum : analysis_type
click AnalysisTypeEnum href "../AnalysisTypeEnum/"
IsolateInterface : classified_as
IsolateInterface : collection_date
IsolateInterface : gc_content
IsolateInterface : isolate_known_contaminants
IsolateInterface : isolate_name
IsolateInterface : isolate_single_colony
IsolateInterface --> "1" YesNoEnum : isolate_single_colony
click YesNoEnum href "../YesNoEnum/"
IsolateInterface : organism_genus
IsolateInterface : organism_species
IsolateInterface : ploidy
IsolateInterface --> "0..1" PloidyEnum : ploidy
click PloidyEnum href "../PloidyEnum/"
IsolateInterface : samp_name
IsolateInterface : source_mat_id
IsolateInterface : strain_name
Inheritance
- DhInterface
- IsolateInterface [ DhMultiviewCommonColumnsMixin]
Slots
| Name | Cardinality and Range | Description | Inheritance |
|---|---|---|---|
| isolate_single_colony | 1 YesNoEnum |
Was the sample isolated from a single colony or equivalent axenic/pure cultur... | direct |
| isolate_known_contaminants | 0..1 recommended String |
List any organisms known or suspected to grow in co-culture or that may be co... | direct |
| collection_date | 1 String |
The date of sampling | direct |
| source_mat_id | 0..1 String |
A globally unique identifier assigned to the parent sample or sample that is ... | direct |
| organism_genus | 1 String |
Genus of the organism | direct |
| organism_species | 1 String |
Species of the organism | direct |
| strain_name | 1 String |
Strain or cultivar name of the organism | direct |
| isolate_name | 0..1 String |
Isolate or mutant name | direct |
| classified_as | 1 String |
Taxonomic classification of this organism | direct |
| gc_content | 0..1 Float |
Estimated GC content as a percentage | direct |
| ploidy | 0..1 PloidyEnum |
The ploidy level of the genome (e | direct |
| analysis_type | 1..* AnalysisTypeEnum |
Select all the data types associated or available for this biosample | DhMultiviewCommonColumnsMixin |
| samp_name | 1 String |
A local identifier or name that for the material sample collected | DhMultiviewCommonColumnsMixin |
Usages
| used by | used in | type | used |
|---|---|---|---|
| SampleData | isolate_data | range | IsolateInterface |
Identifier and Mapping Information
Annotations
| property | value |
|---|---|
| excel_worksheet_name | Isolate |
Schema Source
- from schema: https://example.com/nmdc_submission_schema
Mappings
| Mapping Type | Mapped Value |
|---|---|
| self | nmdc_sub_schema:IsolateInterface |
| native | nmdc_sub_schema:IsolateInterface |
LinkML Source
Direct
name: IsolateInterface
annotations:
excel_worksheet_name:
tag: excel_worksheet_name
value: Isolate
description: 'Metadata for microbial isolate samples submitted to NMDC. Captures core
biological identity slots (taxonomy, strain, culture collection) that NMDC stores
and queries. Slots are sourced from nmdc-schema OrganismSample and Organism classes;
depends on nmdc-schema PR #2977 and #2975 merging.'
title: Isolate
from_schema: https://example.com/nmdc_submission_schema
is_a: DhInterface
mixins:
- DhMultiviewCommonColumnsMixin
slots:
- isolate_single_colony
- isolate_known_contaminants
- collection_date
- source_mat_id
- organism_genus
- organism_species
- strain_name
- isolate_name
- classified_as
- gc_content
- ploidy
slot_usage:
isolate_single_colony:
name: isolate_single_colony
slot_group: organism_section
isolate_known_contaminants:
name: isolate_known_contaminants
slot_group: organism_section
collection_date:
name: collection_date
notes:
- JGI requires all three date components, so the full YYYY-MM-DD is mandatory
on the isolate interface. The export layer splits it into Collection Year, Collection
Month, and Collection Day columns.
comments:
- Use the date the sample was collected from its natural habitat, not the date
of purchase or culturing. Full date (YYYY-MM-DD) is required.
examples:
- value: '2017-01-20'
- value: '2013-03-25'
aliases:
- Collection Year
- Collection Month
- Collection Day
pattern: ^[12]\d{3}-(?:0[1-9]|1[0-2])-(?:0[1-9]|[12]\d|3[01])$
Induced
name: IsolateInterface
annotations:
excel_worksheet_name:
tag: excel_worksheet_name
value: Isolate
description: 'Metadata for microbial isolate samples submitted to NMDC. Captures core
biological identity slots (taxonomy, strain, culture collection) that NMDC stores
and queries. Slots are sourced from nmdc-schema OrganismSample and Organism classes;
depends on nmdc-schema PR #2977 and #2975 merging.'
title: Isolate
from_schema: https://example.com/nmdc_submission_schema
is_a: DhInterface
mixins:
- DhMultiviewCommonColumnsMixin
slot_usage:
isolate_single_colony:
name: isolate_single_colony
slot_group: organism_section
isolate_known_contaminants:
name: isolate_known_contaminants
slot_group: organism_section
collection_date:
name: collection_date
notes:
- JGI requires all three date components, so the full YYYY-MM-DD is mandatory
on the isolate interface. The export layer splits it into Collection Year, Collection
Month, and Collection Day columns.
comments:
- Use the date the sample was collected from its natural habitat, not the date
of purchase or culturing. Full date (YYYY-MM-DD) is required.
examples:
- value: '2017-01-20'
- value: '2013-03-25'
aliases:
- Collection Year
- Collection Month
- Collection Day
pattern: ^[12]\d{3}-(?:0[1-9]|1[0-2])-(?:0[1-9]|[12]\d|3[01])$
attributes:
isolate_single_colony:
name: isolate_single_colony
description: Was the sample isolated from a single colony or equivalent axenic/pure
culture?
title: single colony isolation
comments:
- Answer yes if the sample originated from a single colony pick, single-cell isolation,
or equivalent process expected to yield a genetically homogeneous culture.
examples:
- value: 'yes'
from_schema: https://example.com/nmdc_submission_schema
aliases:
- Is the sample isolated from a single colony or equivalent?
rank: 30
alias: isolate_single_colony
owner: IsolateInterface
domain_of:
- IsolateInterface
slot_group: organism_section
range: YesNoEnum
required: true
isolate_known_contaminants:
name: isolate_known_contaminants
description: List any organisms known or suspected to grow in co-culture or that
may be contaminating the culture, along with an estimate of the percentage.
title: known or suspected organisms
comments:
- Leave blank if no contaminants are known or suspected.
examples:
- value: Bacillus subtilis ~5%
from_schema: https://example.com/nmdc_submission_schema
aliases:
- Known / Suspected Organisms
rank: 31
alias: isolate_known_contaminants
owner: IsolateInterface
domain_of:
- IsolateInterface
slot_group: organism_section
range: string
recommended: true
collection_date:
name: collection_date
description: The date of sampling
title: collection date
notes:
- JGI requires all three date components, so the full YYYY-MM-DD is mandatory
on the isolate interface. The export layer splits it into Collection Year, Collection
Month, and Collection Day columns.
comments:
- Use the date the sample was collected from its natural habitat, not the date
of purchase or culturing. Full date (YYYY-MM-DD) is required.
examples:
- value: '2017-01-20'
- value: '2013-03-25'
from_schema: https://example.com/nmdc_submission_schema
aliases:
- Collection Year
- Collection Month
- Collection Day
rank: 3
keywords:
- date
slot_uri: MIXS:0000011
alias: collection_date
owner: IsolateInterface
domain_of:
- AirInterface
- BiofilmInterface
- BuiltEnvInterface
- HcrCoresInterface
- HcrFluidsSwabsInterface
- HostAssociatedInterface
- MiscEnvsInterface
- PlantAssociatedInterface
- SedimentInterface
- SoilInterface
- WastewaterSludgeInterface
- WaterInterface
- IsolateInterface
slot_group: mixs_modified_section
range: string
required: true
pattern: ^[12]\d{3}-(?:0[1-9]|1[0-2])-(?:0[1-9]|[12]\d|3[01])$
source_mat_id:
name: source_mat_id
annotations:
Expected_value:
tag: Expected_value
value: 'for cultures of microorganisms: identifiers for two culture collections;
for other material a unique arbitrary identifer'
description: A globally unique identifier assigned to the parent sample or sample
that is the source of this sample.
title: source material identifier
todos:
- Currently, the comments say to use UUIDs. However, if we implement assigning
NMDC identifiers with the minter we dont need to require a GUID. It can be an
optional field to fill out only if they already have a resolvable ID.
comments:
- Identifiers must be prefixed. Possible FAIR prefixes are IGSNs (http://www.geosamples.org/getigsn),
NCBI biosample accession numbers, ARK identifiers (https://arks.org/). These
IDs enable linking to derived analytes and subsamples. If you have not assigned
FAIR identifiers to your samples, you can generate UUIDs (https://www.uuidgenerator.net/).
- 'Prefixed FAIR identifiers are preferred where available: `igsn` for International
Generic Sample Numbers (http://www.geosamples.org/getigsn), `biosample` for
NCBI biosample accession IDs, `gold` for GOLD identifiers. The format is not
constrained, so culture collection identifiers such as "DSM 6724" or "ATCC 5680"
are also accepted.'
examples:
- value: igsn:AU1243
- value: biosample:SAMEA2397676
from_schema: https://example.com/nmdc_submission_schema
rank: 2
keywords:
- identifier
- material
- source
slot_uri: MIXS:0000026
alias: source_mat_id
owner: IsolateInterface
domain_of:
- DhMultiviewCommonColumnsMixin
- IsolateInterface
slot_group: sample_id_section
range: string
multivalued: false
organism_genus:
name: organism_genus
description: Genus of the organism.
comments:
- Free-text submitter-provided genus name. For an ontology-grounded classification,
use `classified_as` with a NcbiTaxon instance on the parent Organism class.
examples:
- value: Shewanella
description: GOLD organism_v2 Go0000189 (Shewanella loihica PV-4, queried 2026-04-21)
- value: Ruegeria
description: GOLD organism_v2 Go0000514 (Ruegeria pomeroyi DSS-3, queried 2026-04-21)
- value: Campylobacter
description: GOLD organism_v2 (Go0000058, queried 2026-04-14)
in_subset:
- jgi_isolate
from_schema: https://example.com/nmdc_submission_schema
structured_aliases:
Genus:
literal_form: Genus
predicate: EXACT_SYNONYM
notes:
- Exact JGI form template is access-restricted; source is the public submission
overview.
source: https://jgi.doe.gov/user-programs/pmo-overview/project-materials-submission-overview/
rank: 106
alias: organism_genus
owner: IsolateInterface
domain_of:
- IsolateInterface
slot_group: organism_section
range: string
required: true
organism_species:
name: organism_species
description: Species of the organism.
comments:
- Free-text submitter-provided species name. For an ontology-grounded classification,
use `classified_as` with a NcbiTaxon instance on the parent Organism class.
examples:
- value: loihica
description: GOLD organism_v2 Go0000189 (Shewanella loihica PV-4, queried 2026-04-21)
- value: pomeroyi
description: GOLD organism_v2 Go0000514 (Ruegeria pomeroyi DSS-3, queried 2026-04-21)
- value: sp.
description: GOLD organism_v2.species (n=37 records, queried 2026-04-30) — used
when the isolate has not yet been assigned a species name
in_subset:
- jgi_isolate
from_schema: https://example.com/nmdc_submission_schema
structured_aliases:
Species:
literal_form: Species
predicate: EXACT_SYNONYM
notes:
- Exact JGI form template is access-restricted; source is the public submission
overview.
source: https://jgi.doe.gov/user-programs/pmo-overview/project-materials-submission-overview/
rank: 107
alias: organism_species
owner: IsolateInterface
domain_of:
- IsolateInterface
slot_group: organism_section
range: string
required: true
strain_name:
name: strain_name
description: Strain or cultivar name of the organism.
comments:
- 'Microbial strain identifiers and plant cultivar names (governed by the International
Code of Nomenclature for Cultivated Plants, ICNCP) are nomenclaturally distinct,
but this slot accepts both for now to match the JGI Isolate (NA) v19 form''s
combined "Strain or cultivar" field. A separate `cultivar_name` slot may be
added if a plant-specific use case emerges; see #3056.'
- MIxS `subspecf_gen_lin` (MIXS:0000020) covers this concept along with cultivar,
serovar, biotype, ecotype, and other sub-species lineage types in a single slot
using a rank-prefix encoding (e.g. "strain:PV-4"). NMDC splits the concept into
separate slots; this slot covers the strain rank specifically.
examples:
- value: PV-4
description: GOLD organism_v2 Go0000189 (Shewanella loihica PV-4, queried 2026-04-21)
- value: DSS-3
description: GOLD organism_v2 Go0000514 (Ruegeria pomeroyi DSS-3, queried 2026-04-21)
- value: DSM 6724
description: GOLD organism_v2 Dictyoglomus turgidum (Go0000002, queried 2026-04-14)
in_subset:
- jgi_isolate
from_schema: https://example.com/nmdc_submission_schema
structured_aliases:
Strain or cultivar:
literal_form: Strain or cultivar
predicate: EXACT_SYNONYM
notes:
- Exact JGI form template is access-restricted; source is the public submission
overview.
source: https://jgi.doe.gov/user-programs/pmo-overview/project-materials-submission-overview/
related_mappings:
- MIXS:0000020
rank: 108
alias: strain_name
owner: IsolateInterface
domain_of:
- IsolateInterface
slot_group: organism_section
range: string
required: true
isolate_name:
name: isolate_name
description: Isolate or mutant name.
comments:
- MIxS `subspecf_gen_lin` (MIXS:0000020) covers this concept along with strain,
cultivar, serovar, biotype, ecotype, and other sub-species lineage types in
a single slot using a rank-prefix encoding. NMDC uses a separate slot for the
isolate rank specifically.
examples:
- value: Bd21-3
description: GOLD dw_sample_taxonomy_info.isolate (n=260 records, queried 2026-04-30)
— Brachypodium distachyon Bd21-3 reference accession
- value: MR164
description: GOLD dw_sample_taxonomy_info.isolate (n=555 records, queried 2026-04-30)
- value: Isolate
description: GOLD dw_sample_taxonomy_info.isolate (n=918 records, queried 2026-04-30)
— generic placeholder used when no specific mutant/isolate name is recorded
in_subset:
- jgi_isolate
from_schema: https://example.com/nmdc_submission_schema
structured_aliases:
Isolate:
literal_form: Isolate
predicate: EXACT_SYNONYM
notes:
- Exact JGI form template is access-restricted; source is the public submission
overview.
source: https://jgi.doe.gov/user-programs/pmo-overview/project-materials-submission-overview/
related_mappings:
- MIXS:0000020
rank: 109
alias: isolate_name
owner: IsolateInterface
domain_of:
- IsolateInterface
slot_group: organism_section
range: string
classified_as:
name: classified_as
description: 'Taxonomic classification of this organism. Narrowed from the global
OntologyClass range (defined on the slot itself) to NcbiTaxon, since organism
identity at NMDC is anchored to NCBI Taxonomy. Per #3016 — the broader pattern
is to narrow `classified_as` to `NcbiTaxon` on all organism-oriented classes
via slot_usage.'
comments:
- 'Taxonomy-oriented uses (e.g. on Organism) should point to NcbiTaxon instances.
OrganismSample reaches taxonomy indirectly via expected_organism.classified_as.
The global range stays OntologyClass; narrowing to NcbiTaxon via slot_usage
is tracked in #3016.'
examples:
- value: NCBITaxon:562
- value: NCBITaxon:1392
from_schema: https://example.com/nmdc_submission_schema
see_also:
- https://github.com/microbiomedata/nmdc-schema/issues/2959
narrow_mappings:
- biolink:in_taxon
rank: 111
alias: classified_as
owner: IsolateInterface
domain_of:
- IsolateInterface
slot_group: organism_section
range: string
required: true
multivalued: false
pattern: NCBITaxon:\d+
structured_pattern:
syntax: NCBITaxon:\d+
interpolated: false
gc_content:
name: gc_content
description: Estimated GC content as a percentage.
in_subset:
- jgi_isolate
from_schema: https://example.com/nmdc_submission_schema
structured_aliases:
GC Content %:
literal_form: GC Content %
predicate: EXACT_SYNONYM
notes:
- Exact JGI form template is access-restricted; source is the public submission
overview.
source: https://jgi.doe.gov/user-programs/pmo-overview/project-materials-submission-overview/
rank: 113
alias: gc_content
owner: IsolateInterface
domain_of:
- IsolateInterface
slot_group: organism_section
range: float
minimum_value: 0
maximum_value: 100
unit:
ucum_code: '%'
ploidy:
name: ploidy
description: The ploidy level of the genome (e.g. allopolyploid, haploid, diploid,
triploid, tetraploid). It has implications for the downstream study of duplicated
gene and regions of the genomes (and perhaps for difficulties in assembly).
For terms, please select terms listed under class ploidy (PATO:001374) of Phenotypic
Quality Ontology (PATO), and for a browser of PATO (v 2018-03-27) please refer
to http://purl.bioontology.org/ontology/PATO
title: ploidy
from_schema: https://example.com/nmdc_submission_schema
rank: 114
slot_uri: MIXS:0000021
alias: ploidy
owner: IsolateInterface
domain_of:
- IsolateInterface
slot_group: organism_section
range: PloidyEnum
analysis_type:
name: analysis_type
description: Select all the data types associated or available for this biosample
title: analysis/data type
comments:
- MIxS:investigation_type was included as a `see_also` but that term doesn't resolve
any more
examples:
- value: metagenomics; metabolomics; metaproteomics
from_schema: https://example.com/nmdc_submission_schema
rank: 3
alias: analysis_type
owner: IsolateInterface
domain_of:
- DhMultiviewCommonColumnsMixin
slot_group: sample_id_section
range: AnalysisTypeEnum
required: true
multivalued: true
samp_name:
name: samp_name
annotations:
Preferred_unit:
tag: Preferred_unit
value: ''
description: A local identifier or name that for the material sample collected.
Refers to the original material collected or to any derived sub-samples.
title: sample name
comments:
- It can have any format, but we suggest that you make it concise, unique and
consistent within your lab, and as informative as possible.
- If multiple sample environments (soil, water, etc) are used, these values must
be unique across all environments.
examples:
- value: Rock core CB1178(5-6) from NSW
from_schema: https://example.com/nmdc_submission_schema
rank: 1
keywords:
- sample
slot_uri: MIXS:0001107
identifier: true
alias: samp_name
owner: IsolateInterface
domain_of:
- DhMultiviewCommonColumnsMixin
slot_group: sample_id_section
range: string
required: true