Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Adding schemas for image-based spatial transcriptomics #48

Open
wants to merge 6 commits into
base: dev
Choose a base branch
from

Conversation

rightbower
Copy link

Will attach to issue for review

@rightbower rightbower added documentation Improvements or additions to documentation enhancement New feature or request metadata schema New or change in metadata schema labels Jul 3, 2024
@rightbower rightbower linked an issue Jul 3, 2024 that may be closed by this pull request
1 task

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hi, I am new to the BICAN group, from a SSPsyGene consortium which will also have spatial transciptomics. I was curious about the "light treatment time" field, this isn't one that I've seen in other groups (HuBMAP/HTAN), and also was not considering for our SSPsyGene schema. I am guessing that light treatment came up when gathering input from your generating labs? I see that this field is "if applied" and so it looks great for BICAN. I am more asking in case SSPsyGene should consider this field, or how it was decided it add it.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

One other question, for these fields are there some that are strictly 'required'? -where it appears the "light treatment time" field is optional -only used when applied?

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

For "gene name" I was curious what the thoughts were about HGNC identifiers as well? Our group has decided on HGNC identifiers (with the plan to also connect those to Ensembl IDs in the background).

Copy link
Contributor

@djarecka djarecka left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@rightbower - I left comments for the specific elements of the tables.

I will have to talk to you also about the class description (and relations between them), some value sets that are not included as a separate file, e.g. BICAN-Species/synonym, and if the fields have some exact mapping that we should use, e.g. from NIMP.

donor age unit,bil:Specimen/Donor/Age unit,,Units for donor age,FALSE,bil_donor_age_units,ValueSet,Days,,,,categorical,,
donor sex,bil:Specimen/Donor/Sex,,Sex of donor,FALSE,bil_donor_sex,ValueSet,Male,,,,categorical,,
donor genotype,bil:Specimen/Donor/Genotype,,"The genotype or transgenic line information (e.g. StrainName, CreDriverLine, ReporterName, etc.)",TRUE,,text,wt/wt,,,,categorical,,
organ local name,bil:Specimen/Donor/Organ local ID,,Local (lab) reference ID assigned to the organ/brain.,TRUE,,text,,,,,categorical,,
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I understand that all fields above should belong to Donor class, but what about the fields about "organ"?

species,bil:Specimen/Donor/Species,,"Common name for the donor species (e.g. Mouse, Human)",FALSE,[BICAN-Species/synonym],ValueSet,mouse,,,,categorical,,
species NCBI taxonomy,bil:Specimen/Donor/NCBITaxonomy,,NCBI Taxonomy ID for species; format ID with prefix 'NCBI:txid',FALSE,[NCBI Taxon],ValueSet,NCBI:txid10090,,,,categorical,,
donor age,bil:Specimen/Donor/Age,,Age of donor (if known),FALSE,,float,12,,,,continuous,,
donor age unit,bil:Specimen/Donor/Age unit,,Units for donor age,FALSE,bil_donor_age_units,ValueSet,Days,,,,categorical,,
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm not sure if "nullable" should be False, since the description suggest otherwise

@@ -0,0 +1,21 @@
Proposed BICAN Field Name,Aliases,LinkML Class,Definition,Nullable,Permissible Values,Data Type,Data Examples,Min Value,Max Value,Unit,Statistical Variable Type,Subsets,Notes
histological stain name,bil:Image/Channel/Fluorophore,Anatomical Stain,Name of histological stain applied to tissue ,TRUE,histological_stain_name,ValueSet,GFAP,,,,categorical,,
nuclear stain name,bil:Image/Channel/Fluorophore,Nuclear Stain,Name of nuclear stain applied to tissue,FALSE,nuclear_stain_name,ValueSet,DAPI,,,,categorical,,
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

it has the same alias as "histological stain", is that ok?

histological stain name,bil:Image/Channel/Fluorophore,Anatomical Stain,Name of histological stain applied to tissue ,TRUE,histological_stain_name,ValueSet,GFAP,,,,categorical,,
nuclear stain name,bil:Image/Channel/Fluorophore,Nuclear Stain,Name of nuclear stain applied to tissue,FALSE,nuclear_stain_name,ValueSet,DAPI,,,,categorical,,
probe set DOI,bil:Dataset/Methods,Bench Chemistry,"List of genes for which probes were made in imaging-based methods; protocols.io doi to a text file that contains the name of all target genes, the ensembl or NBCI id for the targeted id, and the barcode assigned to each targeted RNA if applicable.",FALSE,,text,,,,,continuous,Analysis,
probe sequences DOI,bil:Dataset/Methods,Bench Chemistry,List of probe sequences used to target genes for imaging-based methods,TRUE,,text,,,,,continuous,,
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

it has the same alias as "probe set DOI", is that ok?

platform name,bil:Instrument/MicroscopeManufacturerAndModel,Probe Imaging,Name of platform that was used to acquire spatial transcriptomics,FALSE,platform_name,ValueSet,MERSCOPE,,,,categorical,Analysis,
machine name,,Probe Imaging,Name/ID of machine used to acquire data,TRUE,,text,Flounder10,,,,categorical,,
machine software version,,Probe Imaging,Version of software used for imaging (e.g. SoftwareVersion from experiment.json in MERSCOPE),TRUE,,text,232.221125.301,,,,categorical,,
number of z-sections,bil:Image/Channel/Slices,Probe Imaging,Slice number in z-axis,FALSE,,integer,,1,,,continuous,,
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

is this a total number of slices? or identifier for the specific slice?

brain subdivision,brain_slab/brain_subdivision,Tissue Sample,The subdivision of the brain the tissue sample belongs to,TRUE,[brain_subdivision],ValueSet,cerebrum,,,,categorical,,
brain hemisphere,brain_slab/slab_hemisphere,Tissue Sample,The hemisphere of the brain the slab belongs to,TRUE,[hemisphere],ValueSet,right,,,,categorical,,
tissue sample type,,Tissue Sample,"Block, slab, etc.",FALSE,[tissue_sample_type],ValueSet,slab,,,,categorical,Tracking/Identification,
parent tissue sample name,,Tissue Sample,Upstream tissue sample from which tissue was derived (if addt�l specimen processsing after tissue received from portal),TRUE,,text,,,,,categorical,Tracking/Identification,
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

something wrong with the description, the same for the next line

brain section local name,bil:Specimen/Sample/local ID,Tissue Cryosection,Local unique ID that represents the tissue cryosection to be processed,FALSE,,text,,,,,categorical,Identification,
brain section NIMP ID,,Tissue Cryosection,NIMP ID for the tissue cryosection,FALSE,,text,,,,,categorical,,
reference brain atlas,bil:Specimen/Sample/Atlas,Tissue Cryosection,Reference atlas used to pin locations,TRUE,"[DHBA, DMBA, HBA, MBA]",ValueSet,MBA,,,,categorical,,
atlas location,bil:Specimen/Sample/Location,Tissue Cryosection,"The (sub) location(s) or region where the tissue is found; structures from DHBA, MBA ontologies",TRUE,"[DHBA, DMBA, HBA, MBA]",ValueSet,MBA:315,,,,categorical,,
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

should this be "ValueSet"? seems like it has additional integer to specify the location

@@ -0,0 +1,5 @@
Proposed BICAN Field Name,Aliases,LinkML Class,Definition,Nullable,Permissible Values,Data Type,Data Examples,Min Value,Max Value,Unit,Statistical Variable Type,Subsets,Notes
gene name,,Bench Chemistry,The word or phrase that uniquely identifies the gene,FALSE,"[ENSEMBL, NCBI]",ValueSet,calbindin 2,,,,categorical,,
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

examples in this file don't match the permissible values

tissue sample name,tissue_sample_local_name,Tissue Sample,Block of tissue that was used for further dissection and/or sectioning,FALSE,,text,,,,,categorical,Tracking/Identification,
brain subdivision,brain_slab/brain_subdivision,Tissue Sample,The subdivision of the brain the tissue sample belongs to,TRUE,[brain_subdivision],ValueSet,cerebrum,,,,categorical,,
brain hemisphere,brain_slab/slab_hemisphere,Tissue Sample,The hemisphere of the brain the slab belongs to,TRUE,[hemisphere],ValueSet,right,,,,categorical,,
tissue sample type,,Tissue Sample,"Block, slab, etc.",FALSE,[tissue_sample_type],ValueSet,slab,,,,categorical,Tracking/Identification,
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

i don't see "tissue_sample_type" in the `ValueSets"

@@ -0,0 +1,13 @@
Proposed BICAN Field Name,Aliases,LinkML Class,Definition,Nullable,Permissible Values,Data Type,Data Examples,Min Value,Max Value,Unit,Statistical Variable Type,Subsets,Notes
tissue sample name,tissue_sample_local_name,Tissue Sample,Block of tissue that was used for further dissection and/or sectioning,FALSE,,text,,,,,categorical,Tracking/Identification,
brain subdivision,brain_slab/brain_subdivision,Tissue Sample,The subdivision of the brain the tissue sample belongs to,TRUE,[brain_subdivision],ValueSet,cerebrum,,,,categorical,,
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

for consistency it should be probably "nimp_brain_subdivision", since that's the name of the file.
the same for the other elements in this file

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
documentation Improvements or additions to documentation enhancement New feature or request metadata schema New or change in metadata schema
Projects
None yet
Development

Successfully merging this pull request may close these issues.

New Metadata Schema Request: Image-based Spatial Transcriptomics
4 participants