1. Per Base Sequence Quality
Warning
A warning will be issued if the lower quartile for any base is less than 10, or if the median
for any base is less than 25.
Failure
This module will raise a failure if the lower quartile for any base is less than 5 or if the
median for any base is less than 20.
2. Per Sequence Quality Scores
Warning
A warning is raised if the most frequently observed mean quality is below 27 - this equates
to a 0.2% error rate.
Failure
An error is raised if the most frequently observed mean quality is below 20 - this equates
to a 1% error rate.
3. Per Base Sequence Content
Warning
This module issues a warning if the difference between A and T, or G and C is greater
than 10% in any position.
Failure
This module will fail if the difference between A and T, or G and C is greater than 20% in
any position.
4. Per Base GC Content
Warning
This module issues a warning it the GC content of any base strays more than 5% from the
mean GC content.
Failure
This module will fail if the GC content of any base strays more than 10% from the mean
GC content.
5. Per Sequence GC Content
Warning
A warning is raised if the sum of the deviations from the normal distribution represents
more than 15% of the reads.
Failure
This module will indicate a failure if the sum of the deviations from the normal distribution
represents more than 30% of the reads.
6. Per Base N Content
Warning
This module raises a warning if any position shows an N content of >5%.
Failure
This module will raise an error if any position shows an N content of >20%.
7. Sequence Length Distribution
Warning
This module will raise a warning if all sequences are not the same length.
Failure
This module will raise an error if any of the sequences have zero length.
8. Duplicate Sequences
Warning
This module will issue a warning if non-unique sequences make up more than 20% of the
total.
Failure
This module will issue a error if non-unique sequences make up more than 50% of the
total.
9. Overrepresented Sequences
Warning
This module will issue a warning if any sequence is found to represent more than 0.1% of
the total.
Failure
This module will issue an error if any sequence is found to represent more than 1% of the
total.
10. Overrepresented Kmers
Warning
This module will issue a warning if any k-mer is enriched more than 3 fold overall, or more
than 5 fold at any individual position.
Failure
This module will issue a error if any k-mer is enriched more than 10 fold at any individual
base position.
reference:
Quality control: Assessing FASTQC results
Introduction to bulk RNA-seq
hbctraining.github.io
'Bioinformatics' 카테고리의 다른 글
Check list before using bioinformatics tools (0) | 2023.05.08 |
---|---|
CNV classification tools (0) | 2023.05.02 |
Bedtools v2.17.0 vs v2.30.0 비교 [coverage] (0) | 2023.02.23 |
NA12878 fastq download (0) | 2023.02.16 |
bioinformatics tools installation (2023 최신버전) (0) | 2023.02.07 |