fastp not removing all Illumina universal adapter sequences as indicated by FastQC #558

luckyvivi · 2024-04-15T14:37:47Z

Hi, I recently ran fastp on an Illumina dataset with the following command:
fastp -i SRR18278237.fastq.gz -o SRR18278237.fastp.gz -z 9 -l 15 -w 16 --dedup --dup_calc_accuracy 6 -x -3 --cut_mean_quality 20 -j SRR18278237.fastp.json -h SRR18278237.fastp.html

I expected that this command would remove the Illumina universal adapter sequences from the reads. However, after running FastQC on the output files, I'm still seeing a significant adapter content in the FastQC report, specifically towards the end of the reads (please see attached screenshot).

Could you please help me understand the following:

Is there a possibility that fastp might not remove some of the adapter sequences under certain conditions?
Do I need to specify the adapter sequences explicitly using the -a option, even though these are standard Illumina universal adapters?
Is there anything in my fastp command that might have prevented the adapter sequences from being adequately detected and trimmed?

I have attached the JSON and HTML reports from fastp for your reference. I would greatly appreciate any insights or suggestions you might have to resolve this issue.

Thank you for your assistance and for developing such a useful tool.

Best regards,
Xiaowen
Uploading SRR18278237 (1).fastp.zip…

The text was updated successfully, but these errors were encountered:

luckyvivi · 2024-04-15T14:39:01Z

SRR18278237 (1).fastp.zip

nreid · 2024-05-14T14:20:08Z

I have a similar issue, but with Nextera adapters. fastp says no contamination, FastQC says nextera, up to 10% by the read end. Even when I supply the Nextera fasta file (the one provided by trimmomatic) virtually no trimming happens.

Trimmomatic with ILLUMINACLIP:"${ADAPTERS}":2:30:10 SLIDINGWINDOW:4:25 MINLEN:45 and drops 7.25% of all reads.

This isn't a perfect comparison, I think fastp default min window Q is 20, not 25, but still. Something seems off here. I'm using v0.23.2.

realzhang · 2024-05-30T02:05:23Z

Same problem. Any suggestion is welcome. Thanks!

nreid · 2024-05-30T11:05:37Z

I switched back to fastqc/trimmomatic/fastqc. I'm removing fastp from my workflows.

There are also a couple concerning GitHub issues about reproducibility. I like the tool but I can't use it if these things aren't resolved.

hp399 · 2024-09-18T08:48:39Z

Hi, there~I met a similar problem and I figured out an explanation myself which at least works for mine.

The possible reason that Fastp does not recoginze and remove the adapter while FastQC detects is that R1 reads are shorter than 150bp, which means the adapter in R1.fastq.gz detected by FastQC is actually the reversed and complementary adapter of R2. So, in this situation, if you want to remove the adapter in R1 via Fastp, specify the adapter sequence in Fastp command with "-a reversed_and_complementary_adapter_sequence_of_Read2". And if you want to remove the adapter in R2, use the sequence of reversed and complementary adapter of R1.

When you have a library shorter than 150bp, Sequencer will keep reading bases after finishing your inserts and continue to read the bases according to the adapter of the opposite strand. My guess is that FastQC can detect those widely-used adapters both reversed or not while Fastp can't, which means Fastp can only auto-detect those widely-used adapters literally based on the sequences given.

I would suggest to play with Fastp with the sequence of the other strand adapter. Or you can simply extract some reads sequence and analyze it manually, to find where the adapter is and what actual it is.

Please feel free to let me know if I didn't make it clear or if it works for you. Thanks!

Liping-L mentioned this issue Oct 31, 2024

Adapter Detection Issue with Paired-End Data in fastp Leading to Incomplete Trimming #583

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fastp not removing all Illumina universal adapter sequences as indicated by FastQC #558

fastp not removing all Illumina universal adapter sequences as indicated by FastQC #558

luckyvivi commented Apr 15, 2024

luckyvivi commented Apr 15, 2024

nreid commented May 14, 2024

realzhang commented May 30, 2024

nreid commented May 30, 2024 •

edited

Loading

hp399 commented Sep 18, 2024

fastp not removing all Illumina universal adapter sequences as indicated by FastQC #558

fastp not removing all Illumina universal adapter sequences as indicated by FastQC #558

Comments

luckyvivi commented Apr 15, 2024

luckyvivi commented Apr 15, 2024

nreid commented May 14, 2024

realzhang commented May 30, 2024

nreid commented May 30, 2024 • edited Loading

hp399 commented Sep 18, 2024

nreid commented May 30, 2024 •

edited

Loading