New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Couldn't automatically detect the sequence identifier field in the fastq id string. #701
Comments
What is the output of What command did you use to convert from sra format to fastq? i.e. the |
Aha! I downloaded the files using package SRAdb the output of Do I need to download the files again as SRA then convert to fastq? |
I would at least try that on one file to see if that fixes this issue. |
Hi, I wanted to re-open this because I am having a similar issue. I'm using paired-end sequence data sequenced on Illumina MiSeq and also downloaded from the NCBI SRA database. I downloaded the files originally as SRA files and then converted them to zipped fastq files (fastq.gz) using fastq-dump with a flag to make sure each sample had separate files for the forward and reverse reads. I'm getting the same error when I run DADA2 on these files: The head of my one of my fastq files I'm reading into DADA2 looks like this: Do you know what might be going on and how I could fix this issue? |
This error is because the original fastq id lines have been replaced by these SRA id lines, which Do you need to use the |
Thank you for the quick reply. It looks like that solved the issue! |
I'm having a similar problem with the SRA id lines, except i do require the matchIDs = TRUE flag. What then? |
@d-callan Unfortunately I'm not sure if there is a solutions in that case. The original IDs are required to match the paired reads together if they are now in different orders. |
thanks anyhow. I'm not convinced they are truly ordered differently. but im finding there are definitely differing number of read counts for forward and reverse. perhaps i can put together a script quickly to remove those reads which dont have a partner before passing to dada2 and see where that gets me. was mostly just hoping i might not have to.. |
Hi apologies for resurrecting an old thread, I was just wondering if you managed to find a solution to this? as I've found myself in the same situation |
I am also meeting a similar issue with the head of fastq files. They were obtained by Illumina MiSeq, not downloaded from the NCBI SRA database. Is there something wrong with the head that can't be detected? |
Hi,
I'm looking at sequence data downloaded from the NCBI SRA database. When running filterAndTrim I get he following error:
After looking at the source code I tried inserting a dummy identifier, so instead of the identifier reading @SRR9876543.1 1/1, it would read @M012345:SRR9876543.1 1/1, but this didn't work.
Could you give me a suggestion how I can get around this?
Many thanks.
The text was updated successfully, but these errors were encountered: