Skip to content
Merged
Show file tree
Hide file tree
Changes from 1 commit
Commits
Show all changes
104 commits
Select commit Hold shift + click to select a range
462c77e
add metabuli
xonq Feb 7, 2026
6bfeb34
major version update
xonq Feb 10, 2026
b90a85f
taxon id optional for read extraction
xonq Feb 10, 2026
98fb69a
develop standalone wf through to testing
xonq Feb 10, 2026
26cf153
fix miniwdl check discrepancies
xonq Feb 10, 2026
bf38c3c
update read1 select first
xonq Feb 10, 2026
6cf3b28
change inputs for trimming
xonq Feb 10, 2026
f21bbab
merge fastp_pe and fastp_se
xonq Feb 10, 2026
04b4762
call fastp correct
xonq Feb 10, 2026
e3f8e7c
select_first optional output
xonq Feb 10, 2026
7e4c410
Update fastp task parameters for congruence with trimmomatic, disable…
xonq Feb 10, 2026
e1fc4d8
trim adapters by default
xonq Feb 10, 2026
468a9a5
Use select_first for read2 input in fastp call
xonq Feb 10, 2026
93f4e7c
add testing data
xonq Feb 11, 2026
d649f15
send it v1.1
xonq Feb 11, 2026
7bff258
1.0
xonq Feb 11, 2026
a5614d1
read 2 optional input circumvention
xonq Feb 11, 2026
98fb3f0
setup for SE
xonq Feb 11, 2026
1cd6354
fix output harvesting for SE
xonq Feb 11, 2026
d1c3a73
bash syntax correction
xonq Feb 11, 2026
2773061
new input vals
xonq Feb 11, 2026
1418943
add seqmode input
xonq Feb 11, 2026
7acc48a
change input naming
xonq Feb 11, 2026
6bc7e76
fix fastp trim adapters
xonq Feb 11, 2026
4a5e528
call_trim default on, fix implicit propagation
xonq Feb 11, 2026
b6058d9
add metabuli and update versions
xonq Feb 11, 2026
74dd730
init metabuli wf
xonq Feb 11, 2026
77752c7
restrict to ONT input
xonq Feb 11, 2026
45548cd
update with exposed options for wf
xonq Feb 11, 2026
a8429b5
update with illumina explanation
xonq Feb 11, 2026
425fec8
update default taxdump
xonq Feb 11, 2026
7c21284
update implicit illumina
xonq Feb 11, 2026
1a94b3c
fix select_first
xonq Feb 11, 2026
c9864e3
update fastp discussion, set adapter trimming to on by default
xonq Feb 11, 2026
78db6d1
update scoring explanation
xonq Feb 11, 2026
778bbcc
add workflow
xonq Feb 11, 2026
036d61c
get it straight
xonq Feb 11, 2026
d4924a5
update formatting
xonq Feb 11, 2026
ebeed3e
update tables
xonq Feb 11, 2026
e47032c
use concordant fastp output naming
xonq Feb 11, 2026
c6c6bac
update outputs
xonq Feb 11, 2026
64f2fc5
update database name
xonq Feb 11, 2026
c7f017e
add optional taxon input
xonq Feb 11, 2026
b3eb8f2
add ete4
xonq Feb 11, 2026
517e221
add taxon standardization
xonq Feb 11, 2026
46eeef3
add taxon
xonq Feb 11, 2026
0cfa25f
integrate metabuli status
xonq Feb 11, 2026
b7c3a9d
fix metabuli status quotes
xonq Feb 11, 2026
c0481f9
no reads recovered, error
xonq Feb 11, 2026
e75b901
attempt to fix terra grep query
xonq Feb 11, 2026
fc92dab
update table
xonq Feb 11, 2026
92b69eb
update disk size for gtdb
xonq Feb 12, 2026
c6a3742
update to bbp compatible name
xonq Feb 12, 2026
c629bb4
update i/o docs
xonq Feb 12, 2026
eff6183
update descriptions for metabuli workflow variables in input and outp…
xonq Feb 12, 2026
7a37fd4
rm md5s
xonq Feb 12, 2026
33bafc0
burning md5s
xonq Feb 12, 2026
cf6a19e
add example metabuli report
xonq Feb 12, 2026
4cc50ce
revert to 100 GB until discussed
xonq Feb 12, 2026
656e029
use metabuli status correctly
xonq Feb 12, 2026
5ead490
fix read1 propagation based on metabuli
xonq Feb 12, 2026
11d797c
one b
xonq Feb 12, 2026
aa35947
query report, not classifications
xonq Feb 12, 2026
5e3c826
Push Metabuli diagram
cimendes Feb 13, 2026
6228f09
fix workflow type for metabuli
cimendes Feb 16, 2026
fcb0e30
add warning for database size and disk space
cimendes Feb 16, 2026
62bba46
fiz disk size input
cimendes Feb 16, 2026
1b356e7
reorder things for readability
cimendes Feb 16, 2026
a56dddc
update disk size to accomodate gtdb
xonq Feb 18, 2026
db2527f
add metabuli_db input requirement
xonq Feb 18, 2026
a694160
update testing files
xonq Feb 18, 2026
689ae32
update i/o
xonq Feb 18, 2026
0f36a08
update db path and add decompressed size
xonq Feb 18, 2026
303a483
update default and ram/disk size
xonq Feb 18, 2026
449aa0e
init fastplong offload porechop
xonq Feb 19, 2026
6056b26
comment inputs
xonq Feb 19, 2026
70fdd40
init
xonq Feb 19, 2026
fbc6998
update input names
xonq Feb 19, 2026
22c7ebb
add fastplong
xonq Feb 19, 2026
629e34c
correct input
xonq Feb 19, 2026
2365d2e
update i/o docs
xonq Feb 19, 2026
f6fe068
update id
xonq Feb 19, 2026
751fabb
add ont bacterial
xonq Feb 19, 2026
fbb2a10
update with fastplong
xonq Mar 2, 2026
580f23c
add info on database size
xonq Mar 2, 2026
01803ea
update max_ram usage
xonq Mar 4, 2026
2426976
allow taxon name input
xonq Mar 4, 2026
66e1378
use taxon name instead of ID for common hits
xonq Mar 4, 2026
c73eda7
explicitly declare name
xonq Mar 4, 2026
6e7d09b
add gtdb taxdump
xonq Mar 4, 2026
5ec65e1
float taxdump to front
xonq Mar 4, 2026
5e20542
Merge remote-tracking branch 'origin/main' into kzm-metabuli-dev
xonq Mar 4, 2026
372e1a0
change inputs.tsv
xonq Mar 4, 2026
6507d45
taxdump explanation
xonq Mar 4, 2026
4d44164
rm default for optional input
xonq Mar 5, 2026
493366c
Merge remote-tracking branch 'origin/main' into kzm-metabuli-dev
xonq Mar 5, 2026
7fd14a3
remove sc2
xonq Mar 13, 2026
54e0558
Merge remote-tracking branch 'origin/main' into kzm-metabuli-dev
xonq Mar 20, 2026
51f49f7
rm extra
xonq Mar 20, 2026
add9043
rm line
xonq Mar 20, 2026
29bf5b6
update io
xonq Mar 20, 2026
223aa4f
simplify percent_human and target_lineage acquisitions
xonq Mar 23, 2026
eaa3c1f
change to fastplong
xonq Mar 23, 2026
b8953a0
harden regex
xonq Mar 23, 2026
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Prev Previous commit
Next Next commit
update i/o docs
  • Loading branch information
xonq committed Feb 19, 2026
commit 2365d2e5b4b6d30b2c6938cdc1b920b87735a706
32 changes: 24 additions & 8 deletions docs/assets/tables/all_inputs.tsv
Original file line number Diff line number Diff line change
Expand Up @@ -94,7 +94,7 @@ mercury_prep_n_batch sample_names Array[String] The samples you want to submit
mercury_prep_n_batch terra_project_name String The name of your Terra project. You can find this information in the URL of the webpage of your Terra dashboard. For example, if your URL contains #workspaces/example/my_workspace/ then your project name is example Required Mercury_Prep_N_Batch general
mercury_prep_n_batch terra_table_name String The name of the Terra table where your samples can be found. Do not include the entity: prefix, the _id suffix, or the _set_id suffix, just the name of the sample-level data table as listed in the sidebar on lefthand side of the Terra Data tab. Required Mercury_Prep_N_Batch general
mercury_prep_n_batch terra_workspace_name String The name of your Terra workspace where your samples can be found. For example, if your URL contains #workspaces/example/my_workspace/ then your project name is my_workspace Required Mercury_Prep_N_Batch general
metabuli_wf metabuli_db File Metabuli database file Required Metabuli general
metabuli_wf metabuli_db File Required Metabuli general
metabuli_wf read1 File FASTQ file containing read1 sequences Required Metabuli general
metabuli_wf samplename String The name of the sample being analyzed Required Metabuli general
nextclade_batch assembly_fastas Array[File] The assembly files for your samples in FASTA format Required Nextclade_Batch general
Expand Down Expand Up @@ -747,6 +747,20 @@ fastp fastp_quality_trim_score Int Minimum mean base quality relative to window
fastp fastp_trim_adapters Boolean Enable adapter trimming via Fastp True Optional TheiaViral_Illumina_PE, TheiaViral_Panel general
fastp fastp_window_size Int Sliding window size for surveying read quality 4 Optional Metabuli, TheiaViral_Illumina_PE, TheiaViral_Panel general
fastp memory Int Amount of memory/RAM (in GB) to allocate to the task 8 Optional Metabuli, TheiaViral_Illumina_PE, TheiaViral_Panel runtime
fastplong cpu Int Number of CPUs to allocate to the task 4 Optional Metabuli runtime
fastplong cut_front Boolean Apply trimming options from 5' to 3' False Optional Metabuli general
fastplong cut_tail Boolean Apply trimming options from 3' to 5' False Optional Metabuli general
fastplong disk_size Int Amount of storage (in GB) to allocate to the task 100 Optional Metabuli runtime
fastplong docker String The Docker container to use for the task us-docker.pkg.dev/general-theiagen/staphb/fastplong:0.4.1 Optional Metabuli docker
fastplong fastplong_adapter_fasta File FASTA of adapter sequences to trim Optional Metabuli general
fastplong fastplong_args String Additional arguments to use with fastplong Optional Metabuli general
fastplong fastplong_end_adapter String Adapter sequence to trim from the 3' end Optional Metabuli general
fastplong fastplong_min_length Int Minimum read length 15 Optional Metabuli general
fastplong fastplong_quality_trim_score Int Minimum mean base quality relative to window size 20 Optional Metabuli general
fastplong fastplong_start_adapter String Adapter sequence to trim from the 5' end Optional Metabuli general
fastplong fastplong_trim_adapters Boolean Enable adapter trimming via fastplong True Optional Metabuli general
fastplong fastplong_window_size Int Sliding window size for surveying read quality 4 Optional Metabuli general
fastplong memory Int Amount of memory/RAM (in GB) to allocate to the task 8 Optional Metabuli runtime
fastq_scan_clean cpu Int Number of CPUs to allocate to the task 1 Optional TBProfiler_tNGS, TheiaViral_Illumina_PE, TheiaViral_Panel runtime
fastq_scan_clean disk_size Int Amount of storage (in GB) to allocate to the task 50 Optional TBProfiler_tNGS, TheiaViral_Illumina_PE, TheiaViral_Panel runtime
fastq_scan_clean docker String Docker image to use for the task us-docker.pkg.dev/general-theiagen/biocontainers/fastq-scan:1.0.1--h4ac6f70_3 Optional TBProfiler_tNGS, TheiaViral_Illumina_PE, TheiaViral_Panel docker
Expand Down Expand Up @@ -1597,10 +1611,10 @@ merlin_magic virulencefinder_min_percent_coverage Float The threshold for minimu
merlin_magic virulencefinder_min_percent_identity Float Internal component, do not modify Optional TheiaEuk_Illumina_PE, TheiaEuk_ONT general
merlin_magic virulencefinder_min_percent_identity Float The threshold for minimum blast identity Optional TheiaProk_FASTA, TheiaProk_Illumina_PE, TheiaProk_Illumina_SE, TheiaProk_ONT general
metabuli cpu Int Number of CPUs to allocate to the task 4 Optional Metabuli, TheiaViral_ONT runtime
metabuli disk_size Int Amount of storage (in GB) to allocate to the task 250 Optional Metabuli, TheiaViral_ONT runtime
metabuli disk_size Int Amount of storage (in GB) to allocate to the task 250 Optional TheiaViral_ONT runtime
metabuli docker String The Docker container to use for the task us-docker.pkg.dev/general-theiagen/theiagen/metabuli:1.1.1 Optional Metabuli, TheiaViral_ONT docker
metabuli extract_unclassified Boolean True/False that determines if unclassified reads should be extracted and combined with the taxon specific extracted reads False Optional Metabuli general
metabuli memory Int Amount of memory/RAM (in GB) to allocate to the task 16 Optional Metabuli, TheiaViral_ONT runtime
metabuli memory Int Amount of memory/RAM (in GB) to allocate to the task 16 Optional TheiaViral_ONT runtime
metabuli metabuli_db File Metabuli database file gs://theiagen-public-resources-rp/reference_data/databases/metabuli/refseq_virus-v223.tar.gz Optional TheiaViral_ONT database
metabuli min_percent_coverage Float Minimum query coverage threshold (0.0 - 1.0) 0 Optional Metabuli, TheiaViral_ONT general
metabuli min_score Float Minimum sequenece similarity score (0.0 - 1.0) 0 Optional Metabuli, TheiaViral_ONT general
Expand All @@ -1609,6 +1623,8 @@ metabuli read2 File Internal component, do not modify Optional TheiaViral_ONT g
metabuli taxdump_path File Path to compressed taxonomy dump gs://theiagen-public-resources-rp/reference_data/databases/metabuli/ncbi_taxdump_20260211.tar.gz Optional Metabuli, TheiaViral_ONT general
metabuli_wf call_trim Boolean Call adapter and read trimming via Fastp (Illumina) or Porechop (ONT) True Optional Metabuli general
metabuli_wf illumina Boolean Input reads are Illumina - automatically inferred if read2 is populated, ONT is assumed otherwise Optional Metabuli general
metabuli_wf metabuli_disk_size Int Amount of storage (in GB) to allocate to the task 250 Optional Metabuli runtime
metabuli_wf metabuli_mem Int Amount of memory/RAM (in GB) to allocate to the task 32 Optional Metabuli runtime
metabuli_wf read2 File FASTQ file containing read2 sequences Optional Metabuli general
metabuli_wf taxon String NCBI taxonomy compatible taxon name/ID to enable read extraction Optional Metabuli general
metaspades_pe cpu Int Number of CPUs to allocate to the task 4 Optional TheiaMeta_Illumina_PE runtime
Expand Down Expand Up @@ -1881,11 +1897,11 @@ plasmidfinder memory Int Amount of memory/RAM (in GB) to allocate to the task 8
plasmidfinder method_path String Path to files for a user-specified method to use (blast or kma) Optional TheiaProk_FASTA, TheiaProk_Illumina_PE, TheiaProk_Illumina_SE, TheiaProk_ONT general
plasmidfinder min_percent_coverage Float Threshold for minimum coverage, default threshold from PlasmidFinder CLI tool is used (0.60) 0.6 Optional TheiaProk_FASTA, TheiaProk_Illumina_PE, TheiaProk_Illumina_SE, TheiaProk_ONT general
plasmidfinder min_percent_identity Float Threshold for mininum blast identity, default threshold from PlasmidFinder CLI tool is used (0.90). This default differs from the default of the PlasmidFinder webtool (0.95) 0.9 Optional TheiaProk_FASTA, TheiaProk_Illumina_PE, TheiaProk_Illumina_SE, TheiaProk_ONT general
porechop cpu Int Number of CPUs to allocate to the task 4 Optional Metabuli, TheiaViral_ONT runtime
porechop disk_size Int Amount of storage (in GB) to allocate to the task 100 Optional Metabuli, TheiaViral_ONT runtime
porechop docker String The Docker container to use for the task us-docker.pkg.dev/general-theiagen/staphb/porechop:0.2.4 Optional Metabuli, TheiaViral_ONT docker
porechop memory Int Amount of memory/RAM (in GB) to allocate to the task 16 Optional Metabuli, TheiaViral_ONT runtime
porechop trimopts String Additional trimming options for Porechop Optional Metabuli, TheiaViral_ONT general
porechop cpu Int Number of CPUs to allocate to the task 4 Optional TheiaViral_ONT runtime
porechop disk_size Int Amount of storage (in GB) to allocate to the task 100 Optional TheiaViral_ONT runtime
porechop docker String The Docker container to use for the task us-docker.pkg.dev/general-theiagen/staphb/porechop:0.2.4 Optional TheiaViral_ONT docker
porechop memory Int Amount of memory/RAM (in GB) to allocate to the task 16 Optional TheiaViral_ONT runtime
porechop trimopts String Additional trimming options for Porechop Optional TheiaViral_ONT general
prep_augur_metadata cpu Int Number of CPUs to allocate to the task 1 Optional Augur_Prep runtime
prep_augur_metadata disk_size Int Amount of storage (in GB) to allocate to the task 10 Optional Augur_Prep runtime
prep_augur_metadata docker String The Docker container to use for the task us-docker.pkg.dev/general-theiagen/theiagen/utility:1.1 Optional Augur_Prep docker
Expand Down
20 changes: 12 additions & 8 deletions docs/assets/tables/all_outputs.tsv
Original file line number Diff line number Diff line change
Expand Up @@ -322,13 +322,18 @@ fasta_utilities_fai File FASTA index file TheiaViral_ONT
fasta_utilities_samtools_docker String Docker image used for samtools in fasta utilities TheiaViral_ONT
fasta_utilities_samtools_version String Version of samtools used in fasta utilities TheiaViral_ONT
fastp_docker String Docker image used for fastp Freyja_FASTQ, Metabuli, TheiaCoV_Illumina_PE, TheiaCoV_Illumina_SE, TheiaEuk_Illumina_PE, TheiaMeta_Illumina_PE, TheiaProk_Illumina_PE, TheiaProk_Illumina_SE, TheiaViral_Illumina_PE, TheiaViral_Panel
fastp_html_report File The HTML report made with fastp Metabuli, TheiaCoV_Illumina_PE, TheiaCoV_Illumina_SE, TheiaEuk_Illumina_PE, TheiaMeta_Illumina_PE, TheiaProk_Illumina_PE, TheiaProk_Illumina_SE, TheiaViral_Illumina_PE, TheiaViral_Panel
fastp_html_report String The HTML report made with fastp Freyja_FASTQ
fastp_json_report File The JSON report made with fastp Metabuli, TheiaCoV_Illumina_PE, TheiaCoV_Illumina_SE, TheiaEuk_Illumina_PE, TheiaMeta_Illumina_PE, TheiaProk_Illumina_PE, TheiaProk_Illumina_SE, TheiaViral_Illumina_PE, TheiaViral_Panel
fastp_json_report String The JSON report made with fastp Freyja_FASTQ
fastp_read1_trimmed File `read1` input trimmed by Fastp Metabuli
fastp_read2_trimmed File `read2` input trimmed by Fastp Metabuli
fastp_html_report File The HTML report conveying fastp results Metabuli, TheiaCoV_Illumina_PE, TheiaCoV_Illumina_SE, TheiaEuk_Illumina_PE, TheiaMeta_Illumina_PE, TheiaProk_Illumina_PE, TheiaProk_Illumina_SE, TheiaViral_Illumina_PE, TheiaViral_Panel
fastp_html_report String The HTML report conveying fastp results Freyja_FASTQ
fastp_json_report File The JSON report conveying fastp results Metabuli, TheiaCoV_Illumina_PE, TheiaCoV_Illumina_SE, TheiaEuk_Illumina_PE, TheiaMeta_Illumina_PE, TheiaProk_Illumina_PE, TheiaProk_Illumina_SE, TheiaViral_Illumina_PE, TheiaViral_Panel
fastp_json_report String The JSON report conveying fastp results Freyja_FASTQ
fastp_read1_trimmed File `read1` input trimmed by fastp Metabuli
fastp_read2_trimmed File `read2` input trimmed by fastp Metabuli
fastp_version String The version of fastp used Freyja_FASTQ, Metabuli, TheiaCoV_Illumina_PE, TheiaCoV_Illumina_SE, TheiaEuk_Illumina_PE, TheiaMeta_Illumina_PE, TheiaProk_Illumina_PE, TheiaProk_Illumina_SE, TheiaViral_Illumina_PE, TheiaViral_Panel
fastplong_docker String Docker image used for fastplong Metabuli
fastplong_html_report File The HTML report conveying fastplong results Metabuli
fastplong_json_report File The JSON report conveying fastplong results Metabuli
fastplong_read1_trimmed File `read1` input trimmed by fastplong Metabuli
fastplong_version String The version of fastplong used Metabuli
fastq_dl_date String The date of the read data download SRA_Fetch
fastq_dl_docker String The docker used SRA_Fetch
fastq_dl_fastq_metadata File File containing metadata of the provided accession such as submission_accession, library_selection, instrument_platform, among others SRA_Fetch
Expand Down Expand Up @@ -854,9 +859,8 @@ poppunk_gps_cluster String GPS cluster predicted by PopPUNK TheiaProk_FASTA, The
poppunk_GPS_db_version String Version of GPSC database used TheiaProk_FASTA, TheiaProk_Illumina_PE, TheiaProk_Illumina_SE, TheiaProk_ONT
poppunk_gps_external_cluster_csv File GPSC v6 scheme designations TheiaProk_FASTA, TheiaProk_Illumina_PE, TheiaProk_Illumina_SE, TheiaProk_ONT
poppunk_version String Version of PopPUNK used TheiaProk_FASTA, TheiaProk_Illumina_PE, TheiaProk_Illumina_SE, TheiaProk_ONT
porechop_read1_trimmed File FASTQ of `read1` input trimmed by Porechop Metabuli
porechop_trimmed_read1 File Trimmed reads from Porechop TheiaViral_ONT
porechop_version String Version of Porechop used Metabuli, TheiaEuk_ONT, TheiaProk_ONT, TheiaViral_ONT
porechop_version String Version of Porechop used TheiaEuk_ONT, TheiaProk_ONT, TheiaViral_ONT
prepped_ena_data File Prepared data formatted for ENA submission Terra_2_ENA
primer_bed_name String Name of the primer bed files used for primer trimming Freyja_FASTQ, TheiaCoV_ClearLabs, TheiaCoV_Illumina_PE, TheiaCoV_Illumina_SE, TheiaCoV_ONT
primer_trimmed_read_percent Float Percentage of read data with primers trimmed as determined by iVar trim Freyja_FASTQ, TheiaCoV_Illumina_PE, TheiaCoV_Illumina_SE
Expand Down
Loading