Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Differential expression analysis error #15

Open
link01153113 opened this issue Mar 18, 2024 · 0 comments
Open

Differential expression analysis error #15

link01153113 opened this issue Mar 18, 2024 · 0 comments
Labels
bug Something isn't working

Comments

@link01153113
Copy link
Collaborator

the issue for the differential expression error is the first column naming. For example, there are a few study (experiments)'s experiment_short_title having invalid sign like below:

{
        "_index": "study",
        "_id": "PsOHhI0B-tReATpm7Ohe",
        "_score": 1,
        "_source": {
          "contact_name": "Stefan Ruhl",
          "experiment_title": "Proteome analysis of glandular parotid and submandibular-sublingual saliva in comparison to whole human saliva by two-dimensional gel electrophoresis",
          "sequence_mapped_swiss_count": "0",
          "Taxononomy_Species": "Homo sapiens",
          "condition_type": "Disease Free",
          "contact_information": "Department of Oral Biology, School of Dental Medicine, University at Buffalo",
          "experiment_peptide_count": "0",
          "sample_type": "Sublingual gland",
          "experiment_id_key": 4213,
          "institution": "University at Buffalo",
          "bto_term_list": [
            "Sublingual gland",
            "Submandibular gland"
          ],
          "experiment_protein_count": "0",
          "Study_name": "Proteome analysis of glandular parotid and submandibular-sublingual saliva in comparison to whole human saliva by two-dimensional gel electrophoresis.",
          "reference_line": "4213",
          "bto_ac": [
            "BTO:0001315",
            "BTO:0001316"
          ],
          "sample_description_comment": "SMSL saliva from a healthy control was collected while the orifice of the parotid duct was blocked by Lashley cups.",
          "experiment_created_date": "2019-10-30 12:12:22",
          "experiment_group_id": 4213,
          "PubMed_ID": "16402355",
          "experiment_short_title": "Walz et al. ‎2006",
          "Taxononomy_ID": "9606",
          "search_engine": "",
          "sample_name": "Healthy Control"
        }
      },

The "#" sign:

{
        "_index": "study",
        "_id": "t8OHhI0B-tReATpm4uZ7",
        "_score": 1,
        "_source": {
          "contact_name": "Yanbao Yu",
          "experiment_title": "Self-Assembled STrap for Global Proteomics and Salivary Biomarker Discovery",
          "sequence_mapped_swiss_count": "0",
          "Taxononomy_Species": "Homo sapiens (Human)",
          "condition_type": "Disease Free",
          "contact_information": "[email protected]",
          "experiment_peptide_count": "13688",
          "sample_type": "Saliva",
          "experiment_id_key": 1009,
          "institution": "J. Craig Venter Institute",
          "bto_term_list": [
            "Saliva"
          ],
          "experiment_protein_count": "483",
          "Study_name": "Self-assembled STrap for global proteomics and salivary biomarker discovery.",
          "reference_line": "1009",
          "bto_ac": [
            "BTO:0001202"
          ],
          "sample_description_comment": "",
          "experiment_created_date": "2019-09-25 17:29:11",
          "experiment_group_id": 1009,
          "PubMed_ID": "30848925",
          "experiment_short_title": "_1DLC092118QE_Saliva_Strap_#7_run1",
          "Taxononomy_ID": "9606",
          "search_engine": "Proteome Discoverer version 2.3 / Sequest",
          "sample_name": "1DLC092118QE_Saliva_Strap_#7_run1"
        }
      },

And in the inputdata, it's using experiment_short_title as the experiment identifiers. R script does not allow those character.
In addition, there are 3 records having the same experiment_short_title: Walz et al. ‎2006

Might need to make sure experiments title and short title are unique based on study and experiments
There are lots of duplicated experiment_title in the index:
There are 206 records with the same experiment_title of "Association of salivary proteins with age, sex, body mass index, smoking and education"
There are 100 records with the same experiment_title of "Persistent Dysfunction of Innate Immune and Clotting Factors Detected in Saliva and Blood Plasma after COVID-19"
There are 36 records with the same experiment_title of "Mapping differences in salivary gland secretions using dried saliva spots"
There are 30 records with the same experiment_title of "Self-Assembled STrap for Global Proteomics and Salivary Biomarker Discovery"
There are 3 records with the same experiment_title of "Proteome analysis of glandular parotid and submandibular-sublingual saliva in comparison to whole human saliva by two-dimensional gel electrophoresis"

For experiment_short_title, there are 3 records having the same experiment_short_title. This one is easy to fix.

@link01153113 link01153113 added the bug Something isn't working label Mar 18, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

1 participant