-
Notifications
You must be signed in to change notification settings - Fork 19
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
fqtools type
incorrectly typing fastq
#14
Comments
Development on |
Yes it has stopped for now, as I’ve got to do my main job :-)
What do you mean by typing fastq?
Alastair
…On 19 Jul 2018, 21:03 +0100, Nick Youngblut ***@***.***>, wrote:
Development on fqtools seems to have stopped. Anyone know of a good alternative tool for typing fastq files?
—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub, or mute the thread.
|
Yeah, main jobs do tend to get in the way :)
|
I’ve had a brief look, but the detection algorithm is doing what I told it to. This indicates that my logic is wrong.
I’ll try to sort this when I’m back at the office; mid August.
Thanks,
Alastair
…On 19 Jul 2018, 21:38 +0100, Nick Youngblut ***@***.***>, wrote:
Yeah, main jobs do tend to get in the way :)
fqtools type doesn't always type correctly, at least as needed for correct conversion of fastq formats with Bio.SeqIO. I'm probably just going to add a function to my fastq format conversion script that will first type the input format as designated by fqtools type, and then if that doesn't work, just try all fastq input formats until one works.
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub, or mute the thread.
|
Great! Here's an example of different classifications for read1 vs read2:
|
It appears that the current algorithm for
fqtools type
can get the fastq quality format wrong. Here's a reproducible example:If
Bio.SeqIO
is then used to read these fastq files with the "type" specified byfqtools type
, then the following error occurs:Maybe using the min & max of qual values (the full range) for all sequences in the fastq file would help prevent these mis-calls?
The text was updated successfully, but these errors were encountered: