Comments (11)
It is normal that some reads are not able to be assigned to a region due to ambiguous pairing. Please refer to the Genomic Alignments manual, subsection Ambiguous pairing for further information: https://bioconductor.org/packages/devel/bioc/manuals/GenomicAlignments/man/GenomicAlignments.pdf
from drop.
It looks like about 10% of my reads are ambiguous pairing. Is that a normal % ?
from drop.
I checked one of my samples and I got 3.2% of ambiguous pairings. We use a poly-A enriched protocol, if you do total RNA you might get more ambiguous pairings. Have a look at the 'Obtained Read Counts' and 'Obtained Read Count Ratio' in the Count Summary html. You can see how those numbers look for the test dataset: https://www.cmm.in.tum.de/public/paper/drop_analysis/webDir/html/AberrantExpression/Counting/v29/Summary_all.html
from drop.
I also have polyA enriched. How do these graphs show the ambiguous pairings %? What does 'Obtained read counts' and 'Obtained read count ratio' mean?
from drop.
Those graphs show the total reads that were counted, and the proportion reads counted / total reads in the BAM file. Reads that were not counted can be due to falling in intronic or intergenic regions, or being ambiguous. Unfortunately, currently, we make no distinction between them.
from drop.
but from your Expressed genes plot, I saw that you have ~13K expressed genes, which goes in line with other count matrices I've analyzed.
from drop.
It doesn't quite make sense that "Obtained read count ratio" is proportion reads counted/total reads, becasue in your above Summary_all.html, the proportions are only about 35-45%. That is much too low to be due to intronic or ambiguous pairings. Is there another explanation?
from drop.
Any update about the above?
from drop.
Another reason is that, by default, we use the mode IntersectionStrict (https://htseq.readthedocs.io/en/release_0.11.1/count.html) which considers reads that fully overlap a region. That parameter can be adjusted in the config file, to for example Union, which is more permissive.
from drop.
Which mode do you recommend?
from drop.
IntersectionStrict. In that way we are sure that a read is assigned to a gene only if it fully overlaps it. Reads that span across junctions can actually mean aberrant splicing (eg exon elongation) and are counted in the aberrant splicing module.
Also, the default parameter of inter_feature is FALSE, which means that reads that fully overlap 2 different regions will be assigned to both.
from drop.
Related Issues (20)
- Running pipeline offline in trusted research environemnt HOT 1
- lymphoblastoid cell lines datasets of gene counts
- Error running aberrantSplicing HOT 2
- Error in AberrantSplicing_pipeline_FRASER_04_fit_hyperparameters_FraseR_R HOT 3
- Error in MAE QC create matrix dna rna cor HOT 3
- CalledProcessError in installRPackages.R HOT 5
- Default running folder is out of space HOT 2
- Chunk options `#+echo` not correctly parsed HOT 1
- Incompatible with `Snakemake>=8` HOT 2
- Writing `rds` files as log can crash Snakemake execution HOT 1
- Error in h(simpleError(msg, call)) HOT 8
- requirementsR.txt referencing HEAD leads to irreproducibility / pipeline breaking HOT 3
- Problem running DROP HOT 2
- Annotation file asks for columns that shouldn't be needed HOT 2
- Error in rule AberrantSplicing_pipeline_Counting_01_1_countRNA_splitReads_samplewise_R HOT 10
- Pipeline fails with no significant results (AberrantSplicing_pipeline_FRASER_08_extract_results_FraseR_R) HOT 1
- Error in rule AberrantSplicing_pipeline_Counting_01_1_countRNA_splitReads_samplewise_R HOT 2
- Pipeline FAILS when specifying subsets of genes to test HOT 1
- useNames = NA is defunct HOT 4
- conda setup using yaml doesn't work HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from drop.