Sooner or later I would also like to move from Ram's format where uninformative fastq lines are removed before putting the records on one line to my more straight-forward "tabbed fastq" format where nothing is removed and records are jammed together like this (for paired-end):
samtools fastq $b | paste - - - - - - - - | hadoop fs -put - ${b%.bam}.tfq
(or ... | mc4 | hadoop fs ... ....tfq.mc4 with mc4 compression.) Which is much less hassle to handle through command line tools.
While readtools support for this is not so urgent, maybe it would not be so difficult to already include an option to produce this format?
Sooner or later I would also like to move from Ram's format where uninformative fastq lines are removed before putting the records on one line to my more straight-forward "tabbed fastq" format where nothing is removed and records are jammed together like this (for paired-end):
samtools fastq$b | paste - - - - - - - - | hadoop fs -put - $ {b%.bam}.tfq
(or ... | mc4 | hadoop fs ... ....tfq.mc4 with mc4 compression.) Which is much less hassle to handle through command line tools.
While readtools support for this is not so urgent, maybe it would not be so difficult to already include an option to produce this format?