You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: CHANGELOG.md
+12Lines changed: 12 additions & 0 deletions
Original file line number
Diff line number
Diff line change
@@ -1,5 +1,17 @@
1
1
# Qiita changelog
2
2
3
+
Version 2022.11
4
+
---------------
5
+
6
+
* Adding new Woltka v0.1.4 and WoLr2 - WoLr1 and RS210 (updated version of rep200) are available but not default.
7
+
* Update qp-woltka so it runs faster, contribution with @wasade. Add newest improvements for woltka processing so it runs faster.
8
+
* Fix qp-qiime2, "filter features" bug that expects a "feature metadata" value from the dropdown.
9
+
* Avoid having [multiple git version](https://github.com/qiita-spots/qiita/issues/3216) commands running in the system - this in theory should fix some of the slow downs.
10
+
* Fix sample_name in per preparation sample information files.
11
+
* Now the sequencing artifacts (like per-sample-FASTQ, FASTA, etc) accept log files.
12
+
* SortMeRNA via qp-meta will generate processing logs and will be added to their artifacts - note that this will only be available in newer runs AKA you can delete the outputs of past runs and rerun it.
13
+
* Added new "Admin Sample Validation" to validate a list of sample names against what's available in a study, contributed by @sarayupai.
Copy file name to clipboardExpand all lines: qiita_pet/support_files/doc/source/processingdata/processing-recommendations.rst
+26-13Lines changed: 26 additions & 13 deletions
Original file line number
Diff line number
Diff line change
@@ -45,7 +45,7 @@ Shotgun sequencing
45
45
------------------
46
46
47
47
Qiita currently has one active shotgun metagenomics data analysis pipeline: a per sample
48
-
bowtie2 alignment step with Woltka classification using either the WoLr1or Rep200 databases.
48
+
bowtie2 alignment step with Woltka classification using either the WoLr1, WoLr2 (default) or RS210 databases.
49
49
Below you will find more information about each of these options.
50
50
51
51
.. note::
@@ -67,16 +67,16 @@ subsequent meta-analyses. Currently, the `fastp` command is set to autodetect ad
67
67
wetlab processing and we provide the following host references for your convenience:
68
68
69
69
- auto-detect adapters and artifacts + phix filtering: This is a `deblur artifacts <https://github.com/biocore/deblur/blob/master/deblur/support_files/artifacts.fa>`_ reference, mainly for debugging and testing
70
-
- auto-detect adapters and cheetah + phix filtering
71
-
- auto-detect adapters and cow + phix filtering
72
-
- auto-detect adapters and hamster + phix filtering
73
-
- auto-detect adapters and horse + phix filtering
70
+
- auto-detect adapters and `cheetah<https://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/003/709/585/GCF_003709585.1_Aci_jub_2/GCF_003709585.1_Aci_jub_2_genomic.fna.gz>`_ + phix filtering
71
+
- auto-detect adapters and `cow<https://ftp.ncbi.nlm.nih.gov/genomes/all/GCA/000/003/205/GCA_000003205.6_Btau_5.0.1/GCA_000003205.6_Btau_5.0.1_genomic.fna.gz>`_ + phix filtering
72
+
- auto-detect adapters and `hamster<https://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/017/639/785/GCF_017639785.1_BCM_Maur_2.0/GCF_017639785.1_BCM_Maur_2.0_genomic.fna.gz>`_ + phix filtering
73
+
- auto-detect adapters and `horse<https://ftp.ncbi.nlm.nih.gov/genomes/all/GCA/000/002/305/GCA_000002305.1_EquCab2.0/GCA_000002305.1_EquCab2.0_genomic.fna.gz>`_ + phix filtering
74
74
- auto-detect adapters and merge_genomes + phix filtering : is the combined genomes of a cheetah, cow, hamster, horse, human, mouse, pig, rabbit, and rat
75
-
- auto-detect adapters and mouse + phix filtering
76
-
- auto-detect adapters and pig + phix filtering
77
-
- auto-detect adapters and rabbit + phix filtering
78
-
- auto-detect adapters and rat + phix filtering
79
-
- auto-detect adapters only filtering [not recommended]
75
+
- auto-detect adapters and `mouse<https://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/000/001/635/GCF_000001635.27_GRCm39/GCF_000001635.27_GRCm39_genomic.fna.gz>`_ + phix filtering
76
+
- auto-detect adapters and `pig<https://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/000/003/025/GCF_000003025.6_Sscrofa11.1/GCF_000003025.6_Sscrofa11.1_genomic.fna.gz>`_ + phix filtering
77
+
- auto-detect adapters and `rabbit<https://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/000/003/625/GCF_000003625.3_OryCun2.0/GCF_000003625.3_OryCun2.0_genomic.fna.gz>`_ + phix filtering
78
+
- auto-detect adapters and `rat<https://ftp.ncbi.nlm.nih.gov/genomes/all/GCA/000/001/895/GCA_000001895.4_Rnor_6.0/GCA_000001895.4_Rnor_6.0_genomic.fna.gz>`_ + phix filtering
79
+
- auto-detect adapters only filtering
80
80
81
81
Note that the command produces up to 6 output artifacts based on the aligner and database selected:
82
82
@@ -89,7 +89,7 @@ Note that the command produces up to 6 output artifacts based on the aligner and
89
89
90
90
.. note::
91
91
Woltka provides easy transformations for the "per gene Prediction table" to generate functional
92
-
profiles, `more information <https://github.com/qiyunzhu/woltka/blob/master/doc/wol.md#comparison>`
92
+
profiles, `more information <https://github.com/qiyunzhu/woltka/blob/master/doc/wol.md#comparison>`_.
93
93
94
94
Aligners
95
95
^^^^^^^^
@@ -127,8 +127,6 @@ Reference databases
127
127
128
128
Note that some of these are legacy option but not available for new processing.
129
129
130
-
131
-
132
130
#. WoLr2 (“Web of Life” release 2): A significant upgrade from WoLr1. The genome
133
131
pool is an even representation of microbial diversity, sampled from
134
132
non-redundant bacterial and archaeal genomes from NCBI (RefSeq and GenBank,
@@ -154,6 +152,21 @@ Note that some of these are legacy option but not available for new processing.
154
152
- Genera: 6,811
155
153
- Species: 12,258
156
154
155
+
#. RS210: Collection of reference microbial genomes sampled from the NCBI RefSeq
156
+
genome database, as of 2022-01-01. This time point corresponds to RefSeq
157
+
release 210.
158
+
159
+
- Genomes: 29,648
160
+
- Nucleotides: 926,894
161
+
- Basepairs: 111,767,286,504 (includes linkers)
162
+
- Numbers of taxonomic units:
163
+
164
+
- Archaea: 606
165
+
- Bacteria: 21,047
166
+
- Fungi: 409
167
+
- Protozoa: 93
168
+
- Viral: 7,493
169
+
157
170
#. WoLr1 ("Web of Life" release 1): An even representation of microbial diversity, selected using an prototype
158
171
selection algorithm based on the MinHash distance matrix among all non-redundant bacterial and archaeal genomes
159
172
from NCBI (RefSeq and GenBank, complete and draft), plus several genome quality control criteria. A
0 commit comments