Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
104 commits
Select commit Hold shift + click to select a range
462c77e
add metabuli
xonq Feb 7, 2026
6bfeb34
major version update
xonq Feb 10, 2026
b90a85f
taxon id optional for read extraction
xonq Feb 10, 2026
98fb69a
develop standalone wf through to testing
xonq Feb 10, 2026
26cf153
fix miniwdl check discrepancies
xonq Feb 10, 2026
bf38c3c
update read1 select first
xonq Feb 10, 2026
6cf3b28
change inputs for trimming
xonq Feb 10, 2026
f21bbab
merge fastp_pe and fastp_se
xonq Feb 10, 2026
04b4762
call fastp correct
xonq Feb 10, 2026
e3f8e7c
select_first optional output
xonq Feb 10, 2026
7e4c410
Update fastp task parameters for congruence with trimmomatic, disable…
xonq Feb 10, 2026
e1fc4d8
trim adapters by default
xonq Feb 10, 2026
468a9a5
Use select_first for read2 input in fastp call
xonq Feb 10, 2026
93f4e7c
add testing data
xonq Feb 11, 2026
d649f15
send it v1.1
xonq Feb 11, 2026
7bff258
1.0
xonq Feb 11, 2026
a5614d1
read 2 optional input circumvention
xonq Feb 11, 2026
98fb3f0
setup for SE
xonq Feb 11, 2026
1cd6354
fix output harvesting for SE
xonq Feb 11, 2026
d1c3a73
bash syntax correction
xonq Feb 11, 2026
2773061
new input vals
xonq Feb 11, 2026
1418943
add seqmode input
xonq Feb 11, 2026
7acc48a
change input naming
xonq Feb 11, 2026
6bc7e76
fix fastp trim adapters
xonq Feb 11, 2026
4a5e528
call_trim default on, fix implicit propagation
xonq Feb 11, 2026
b6058d9
add metabuli and update versions
xonq Feb 11, 2026
74dd730
init metabuli wf
xonq Feb 11, 2026
77752c7
restrict to ONT input
xonq Feb 11, 2026
45548cd
update with exposed options for wf
xonq Feb 11, 2026
a8429b5
update with illumina explanation
xonq Feb 11, 2026
425fec8
update default taxdump
xonq Feb 11, 2026
7c21284
update implicit illumina
xonq Feb 11, 2026
1a94b3c
fix select_first
xonq Feb 11, 2026
c9864e3
update fastp discussion, set adapter trimming to on by default
xonq Feb 11, 2026
78db6d1
update scoring explanation
xonq Feb 11, 2026
778bbcc
add workflow
xonq Feb 11, 2026
036d61c
get it straight
xonq Feb 11, 2026
d4924a5
update formatting
xonq Feb 11, 2026
ebeed3e
update tables
xonq Feb 11, 2026
e47032c
use concordant fastp output naming
xonq Feb 11, 2026
c6c6bac
update outputs
xonq Feb 11, 2026
64f2fc5
update database name
xonq Feb 11, 2026
c7f017e
add optional taxon input
xonq Feb 11, 2026
b3eb8f2
add ete4
xonq Feb 11, 2026
517e221
add taxon standardization
xonq Feb 11, 2026
46eeef3
add taxon
xonq Feb 11, 2026
0cfa25f
integrate metabuli status
xonq Feb 11, 2026
b7c3a9d
fix metabuli status quotes
xonq Feb 11, 2026
c0481f9
no reads recovered, error
xonq Feb 11, 2026
e75b901
attempt to fix terra grep query
xonq Feb 11, 2026
fc92dab
update table
xonq Feb 11, 2026
92b69eb
update disk size for gtdb
xonq Feb 12, 2026
c6a3742
update to bbp compatible name
xonq Feb 12, 2026
c629bb4
update i/o docs
xonq Feb 12, 2026
eff6183
update descriptions for metabuli workflow variables in input and outp…
xonq Feb 12, 2026
7a37fd4
rm md5s
xonq Feb 12, 2026
33bafc0
burning md5s
xonq Feb 12, 2026
cf6a19e
add example metabuli report
xonq Feb 12, 2026
4cc50ce
revert to 100 GB until discussed
xonq Feb 12, 2026
656e029
use metabuli status correctly
xonq Feb 12, 2026
5ead490
fix read1 propagation based on metabuli
xonq Feb 12, 2026
11d797c
one b
xonq Feb 12, 2026
aa35947
query report, not classifications
xonq Feb 12, 2026
5e3c826
Push Metabuli diagram
cimendes Feb 13, 2026
6228f09
fix workflow type for metabuli
cimendes Feb 16, 2026
fcb0e30
add warning for database size and disk space
cimendes Feb 16, 2026
62bba46
fiz disk size input
cimendes Feb 16, 2026
1b356e7
reorder things for readability
cimendes Feb 16, 2026
a56dddc
update disk size to accomodate gtdb
xonq Feb 18, 2026
db2527f
add metabuli_db input requirement
xonq Feb 18, 2026
a694160
update testing files
xonq Feb 18, 2026
689ae32
update i/o
xonq Feb 18, 2026
0f36a08
update db path and add decompressed size
xonq Feb 18, 2026
303a483
update default and ram/disk size
xonq Feb 18, 2026
449aa0e
init fastplong offload porechop
xonq Feb 19, 2026
6056b26
comment inputs
xonq Feb 19, 2026
70fdd40
init
xonq Feb 19, 2026
fbc6998
update input names
xonq Feb 19, 2026
22c7ebb
add fastplong
xonq Feb 19, 2026
629e34c
correct input
xonq Feb 19, 2026
2365d2e
update i/o docs
xonq Feb 19, 2026
f6fe068
update id
xonq Feb 19, 2026
751fabb
add ont bacterial
xonq Feb 19, 2026
fbb2a10
update with fastplong
xonq Mar 2, 2026
580f23c
add info on database size
xonq Mar 2, 2026
01803ea
update max_ram usage
xonq Mar 4, 2026
2426976
allow taxon name input
xonq Mar 4, 2026
66e1378
use taxon name instead of ID for common hits
xonq Mar 4, 2026
c73eda7
explicitly declare name
xonq Mar 4, 2026
6e7d09b
add gtdb taxdump
xonq Mar 4, 2026
5ec65e1
float taxdump to front
xonq Mar 4, 2026
5e20542
Merge remote-tracking branch 'origin/main' into kzm-metabuli-dev
xonq Mar 4, 2026
372e1a0
change inputs.tsv
xonq Mar 4, 2026
6507d45
taxdump explanation
xonq Mar 4, 2026
4d44164
rm default for optional input
xonq Mar 5, 2026
493366c
Merge remote-tracking branch 'origin/main' into kzm-metabuli-dev
xonq Mar 5, 2026
7fd14a3
remove sc2
xonq Mar 13, 2026
54e0558
Merge remote-tracking branch 'origin/main' into kzm-metabuli-dev
xonq Mar 20, 2026
51f49f7
rm extra
xonq Mar 20, 2026
add9043
rm line
xonq Mar 20, 2026
29bf5b6
update io
xonq Mar 20, 2026
223aa4f
simplify percent_human and target_lineage acquisitions
xonq Mar 23, 2026
eaa3c1f
change to fastplong
xonq Mar 23, 2026
b8953a0
harden regex
xonq Mar 23, 2026
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
5 changes: 5 additions & 0 deletions .dockstore.yml
Original file line number Diff line number Diff line change
Expand Up @@ -240,6 +240,11 @@ workflows:
primaryDescriptorPath: /workflows/phylogenetics/wf_phylocompare.wdl
testParameterFiles:
- /tests/inputs/empty.json
- name: Metabuli_PHB
subclass: WDL
primaryDescriptorPath: /workflows/standalone_modules/wf_metabuli.wdl
testParameterFiles:
- /tests/inputs/empty.json
- name: Nextclade_Batch_PHB
subclass: WDL
primaryDescriptorPath: /workflows/phylogenetics/wf_nextclade_batch.wdl
Expand Down
Binary file added docs/assets/figures/Metabuli.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
89 changes: 59 additions & 30 deletions docs/assets/tables/all_inputs.tsv

Large diffs are not rendered by default.

48 changes: 30 additions & 18 deletions docs/assets/tables/all_outputs.tsv
Original file line number Diff line number Diff line change
Expand Up @@ -172,8 +172,8 @@ biosample_metadata File Metadata used for BioSample submission in proper BioSamp
biosample_report_xmls Array[File] One or more XML files that contain the response from NCBI regarding your BioSample submission. These can be pretty cryptic, but often contain information to determine if anything went wrong Terra_2_NCBI
biosample_status String String showing whether BioSample submission was successful Terra_2_NCBI
biosample_submission_xml File XML file used to submit your BioSamples to NCBI Terra_2_NCBI
bracken_report String Refined kraken2 report generated by Bracken Freyja_FASTQ, TheiaCoV_Illumina_PE, TheiaProk_Illumina_PE, TheiaProk_Illumina_SE, TheiaViral_Illumina_PE
bracken_report File Refined kraken2 report generated by Bracken Kraken_PE, Kraken_SE, TheiaCoV_Illumina_SE
bracken_report String Refined kraken2 report generated by Bracken Freyja_FASTQ, TheiaCoV_Illumina_PE, TheiaCoV_Illumina_SE, TheiaProk_Illumina_PE, TheiaProk_Illumina_SE, TheiaViral_Illumina_PE
bracken_report File Refined kraken2 report generated by Bracken Kraken_PE, Kraken_SE
bracken_report_clean String Cleaned refined kraken2 report generated by Bracken TheiaMeta_Illumina_PE
bracken_report_dehosted String Dehosted refined kraken2 report generated by Bracken Freyja_FASTQ, TheiaCoV_Illumina_PE
bracken_report_dehosted File Dehosted refined kraken2 report generated by Bracken TheiaCoV_Illumina_SE
Expand Down Expand Up @@ -322,21 +322,28 @@ est_coverage_clean Float Estimated coverage calculated from clean reads and geno
est_coverage_raw Float Estimated coverage calculated from raw reads and genome length TheiaCoV_ONT, TheiaEuk_Illumina_PE, TheiaEuk_ONT, TheiaProk_Illumina_PE, TheiaProk_Illumina_SE, TheiaProk_ONT
est_genome_length Int Estimated genome length TheiaEuk_ONT
est_percent_gene_coverage_tsv File Percent coverage for each gene in the organism being analyzed (depending on the organism input) Freyja_FASTQ, TheiaCoV_ClearLabs, TheiaCoV_Illumina_PE, TheiaCoV_Illumina_SE, TheiaCoV_ONT
ete4_docker String Docker image used for ETE4 taxonomy parsing TheiaViral_Illumina_PE, TheiaViral_ONT
ete4_docker String Docker image used for ETE4 taxonomy parsing Metabuli, TheiaViral_Illumina_PE, TheiaViral_ONT
ete4_status String Status of standardized taxonomy acquisition TheiaViral_Illumina_PE, TheiaViral_ONT
ete4_version String The version of ETE4 used TheiaViral_Illumina_PE, TheiaViral_ONT
ete4_version String The version of ETE4 used Metabuli, TheiaViral_Illumina_PE, TheiaViral_ONT
excluded_samples File A file that contains the names and reasons why a sample was excluded from submission. **For SARS-CoV-2**, there are two sections: First, a section for any samples that failed to meet pre-determined quality thresholds (`number_N` and `vadr_num_alert`). Second, a section that includes a table that describes any missing required metadata for each sample. This table has the sample name for rows and any columns that have missing metadata as headers. If a sample is missing a piece of required metadata, the corresponding cell will be blank. However, if a different sample does have metadata for that column, the associated value will appear in the corresponding cell. **For flu and mpox**, only the second section described above exists. _Please see the example below for more details_. Mercury_Prep_N_Batch
excluded_samples File Text file listing samples that were excluded from BioSample submission for missing required metadata Terra_2_NCBI
failed_uploads File The metadata for any failed uploads Terra_2_GISAID
fasta_utilities_fai File FASTA index file TheiaViral_ONT
fasta_utilities_samtools_docker String Docker image used for samtools in fasta utilities TheiaViral_ONT
fasta_utilities_samtools_version String Version of samtools used in fasta utilities TheiaViral_ONT
fastp_docker String Docker image used for fastp Freyja_FASTQ, TheiaCoV_Illumina_PE, TheiaCoV_Illumina_SE, TheiaEuk_Illumina_PE, TheiaMeta_Illumina_PE, TheiaProk_Illumina_PE, TheiaProk_Illumina_SE, TheiaViral_Illumina_PE, TheiaViral_Panel
fastp_html_report File The HTML report made with fastp TheiaCoV_Illumina_PE, TheiaCoV_Illumina_SE, TheiaEuk_Illumina_PE, TheiaMeta_Illumina_PE, TheiaProk_Illumina_PE, TheiaProk_Illumina_SE, TheiaViral_Illumina_PE, TheiaViral_Panel
fastp_html_report String The HTML report made with fastp Freyja_FASTQ
fastp_json_report File The JSON report made with fastp TheiaCoV_Illumina_PE, TheiaCoV_Illumina_SE, TheiaEuk_Illumina_PE, TheiaMeta_Illumina_PE, TheiaProk_Illumina_PE, TheiaProk_Illumina_SE, TheiaViral_Illumina_PE, TheiaViral_Panel
fastp_json_report String The JSON report made with fastp Freyja_FASTQ
fastp_version String The version of fastp used Freyja_FASTQ, TheiaCoV_Illumina_PE, TheiaCoV_Illumina_SE, TheiaEuk_Illumina_PE, TheiaMeta_Illumina_PE, TheiaProk_Illumina_PE, TheiaProk_Illumina_SE, TheiaViral_Illumina_PE, TheiaViral_Panel
fastp_docker String Docker image used for fastp Freyja_FASTQ, Metabuli, TheiaCoV_Illumina_PE, TheiaCoV_Illumina_SE, TheiaEuk_Illumina_PE, TheiaMeta_Illumina_PE, TheiaProk_Illumina_PE, TheiaProk_Illumina_SE, TheiaViral_Illumina_PE, TheiaViral_Panel
fastp_html_report File The HTML report conveying fastp results Metabuli, TheiaCoV_Illumina_PE, TheiaCoV_Illumina_SE, TheiaEuk_Illumina_PE, TheiaMeta_Illumina_PE, TheiaProk_Illumina_PE, TheiaProk_Illumina_SE, TheiaViral_Illumina_PE, TheiaViral_Panel
fastp_html_report String The HTML report conveying fastp results Freyja_FASTQ
fastp_json_report File The JSON report conveying fastp results Metabuli, TheiaCoV_Illumina_PE, TheiaCoV_Illumina_SE, TheiaEuk_Illumina_PE, TheiaMeta_Illumina_PE, TheiaProk_Illumina_PE, TheiaProk_Illumina_SE, TheiaViral_Illumina_PE, TheiaViral_Panel
fastp_json_report String The JSON report conveying fastp results Freyja_FASTQ
fastp_read1_trimmed File `read1` input trimmed by fastp Metabuli
fastp_read2_trimmed File `read2` input trimmed by fastp Metabuli
fastp_version String The version of fastp used Freyja_FASTQ, Metabuli, TheiaCoV_Illumina_PE, TheiaCoV_Illumina_SE, TheiaEuk_Illumina_PE, TheiaMeta_Illumina_PE, TheiaProk_Illumina_PE, TheiaProk_Illumina_SE, TheiaViral_Illumina_PE, TheiaViral_Panel
fastplong_docker String Docker image used for fastplong Metabuli
fastplong_html_report File The HTML report conveying fastplong results Metabuli
fastplong_json_report File The JSON report conveying fastplong results Metabuli
fastplong_read1_trimmed File `read1` input trimmed by fastplong Metabuli
fastplong_version String The version of fastplong used Metabuli
fastq_dl_date String The date of the read data download SRA_Fetch
fastq_dl_docker String The docker used SRA_Fetch
fastq_dl_fastq_metadata File File containing metadata of the provided accession such as submission_accession, library_selection, instrument_platform, among others SRA_Fetch
Expand Down Expand Up @@ -673,21 +680,26 @@ mercury_prep_n_batch_version String Version of the PHB repository that hosts thi
mercury_script_version String Version of the Mercury tool that was used in this workflow Mercury_Prep_N_Batch
merlin_tag String Merlin tag for the assembly TheiaEuk_ONT
metabuli_classified File Classified reads from Metabuli TheiaViral_ONT
metabuli_classified_read1 File FASTQ of `read1` input classified by Metabuli Metabuli
metabuli_classified_read2 File FASTQ of `read2` input classified by Metabuli Metabuli
metabuli_classified_report File Classification report from Metabuli Metabuli
metabuli_database String Database used for Metabuli TheiaProk_ONT, TheiaViral_ONT
metabuli_docker String Docker image used for Metabuli TheiaProk_ONT, TheiaViral_ONT
metabuli_docker String Docker image used for Metabuli Metabuli, TheiaProk_ONT, TheiaViral_ONT
metabuli_human Float Percent of human reads detected in raw reads Freyja_FASTQ, TheiaCoV_ONT
metabuli_human_dehosted Float Percent of human reads detected after removing human reads Freyja_FASTQ, TheiaCoV_ONT
metabuli_krona_report File Krona visualization report from Metabuli TheiaViral_ONT
metabuli_krona_report File Krona visualization report from Metabuli Metabuli, TheiaViral_ONT
metabuli_percent_target_lineage String Percent of reads classified to the target lineage TheiaViral_ONT
metabuli_read1_extract File Extracted reads from Metabuli TheiaViral_ONT
metabuli_report String Classification report from Metabuli Freyja_FASTQ, TheiaCoV_ONT, TheiaProk_ONT
metabuli_report File Classification report from Metabuli TheiaViral_ONT
metabuli_report File Classification report from Metabuli Metabuli, TheiaViral_ONT
metabuli_report_dehosted String Classification report from Metabuli after removing human reads Freyja_FASTQ, TheiaCoV_ONT
metabuli_status String Status of Metabuli analysis TheiaViral_ONT
metabuli_status String Status of Metabuli analysis Metabuli, TheiaViral_ONT
metabuli_target_organism String Percent of reads classified to target organism for Metabuli analysis TheiaCoV_ONT
metabuli_target_organism_dehosted String Percent of reads classified to target organism for Metabuli analysis after removing human reads TheiaCoV_ONT
metabuli_target_organism_name String Name of the target organism for Metabuli analysis TheiaCoV_ONT
metabuli_version String Version of Metabuli used Freyja_FASTQ, TheiaCoV_ONT, TheiaProk_ONT, TheiaViral_ONT
metabuli_version String Version of Metabuli used Freyja_FASTQ, Metabuli, TheiaCoV_ONT, TheiaProk_ONT, TheiaViral_ONT
metabuli_wf_analysis_date String Analysis date for Metabuli workflow Metabuli
metabuli_wf_version String Version of Metabuli workflow Metabuli
metaspades_docker String The Docker image of metaspades TheiaMeta_Illumina_PE
metaspades_version String The version of metaspades TheiaMeta_Illumina_PE
metaviralspades_docker String Docker image used for MetaviralSPAdes TheiaViral_Illumina_PE
Expand Down Expand Up @@ -730,7 +742,7 @@ nanoplot_tsv_raw File A TSV report describing the raw reads Freyja_FASTQ, TheiaC
nanoplot_version String Version of nanoplot used for analysis TheiaCoV_ONT, TheiaEuk_ONT, TheiaProk_ONT
nanoq_filtered_read1 File Filtered reads from NanoQ TheiaViral_ONT
nanoq_version String Version of nanoq used in analysis Freyja_FASTQ, TheiaEuk_ONT, TheiaProk_ONT, TheiaViral_ONT
ncbi_read_extraction_rank String Read extraction rank used TheiaViral_Illumina_PE, TheiaViral_ONT
ncbi_read_extraction_rank String Read extraction rank used Metabuli, TheiaViral_Illumina_PE, TheiaViral_ONT
ncbi_scrub_docker String The Docker image for NCBI's HRRT (human read removal tool) NCBI_Scrub_PE, NCBI_Scrub_SE, TheiaMeta_Illumina_PE, TheiaViral_Illumina_PE, TheiaViral_ONT, TheiaViral_Panel
ncbi_scrub_human_spots_removed Int Number of spots removed (or masked) NCBI_Scrub_PE, NCBI_Scrub_SE, TheiaViral_Illumina_PE, TheiaViral_ONT, TheiaViral_Panel
ncbi_scrub_pe_analysis_date String Date of analysis NCBI_Scrub_PE
Expand All @@ -739,8 +751,8 @@ ncbi_scrub_read1_dehosted File Dehosted reads after NCBI scrub TheiaViral_Illumi
ncbi_scrub_read2_dehosted File Dehosted reads after NCBI scrub TheiaViral_Illumina_PE, TheiaViral_Panel
ncbi_scrub_se_analysis_date String Date of analysis NCBI_Scrub_SE
ncbi_scrub_se_version String Version of HRRT software used NCBI_Scrub_SE
ncbi_taxon_id String NCBI taxonomy ID of inputted organism following rank extraction TheiaViral_Illumina_PE, TheiaViral_ONT
ncbi_taxon_name String NCBI taxonomy name of inputted taxon following rank extraction TheiaViral_Illumina_PE, TheiaViral_ONT
ncbi_taxon_id String NCBI taxonomy ID of inputted organism following rank extraction Metabuli, TheiaViral_Illumina_PE, TheiaViral_ONT
ncbi_taxon_name String NCBI taxonomy name of inputted taxon following rank extraction Metabuli, TheiaViral_Illumina_PE, TheiaViral_ONT
nextclade_aa_dels String Amino-acid deletions as detected by NextClade. Will be blank for Flu TheiaCoV_ClearLabs, TheiaCoV_FASTA, TheiaCoV_Illumina_PE, TheiaCoV_Illumina_SE, TheiaCoV_ONT, TheiaViral_Illumina_PE, TheiaViral_ONT
nextclade_aa_dels_flu_h5n1 String Amino-acid deletions as detected by NextClade. Specific to flu; it includes deletions for H5N1 whole genome TheiaCoV_FASTA, TheiaCoV_Illumina_PE, TheiaCoV_ONT
nextclade_aa_dels_flu_ha String Amino-acid deletions as detected by NextClade. Specific to flu; it includes deletions for HA segment TheiaCoV_FASTA, TheiaCoV_Illumina_PE, TheiaCoV_ONT, TheiaViral_Illumina_PE, TheiaViral_ONT
Expand Down
Loading
Loading