Comments (3)
Dear Sangjin Lee,
I think that is a clever way.
So with the 90x coverage you can find often artifacts that simply accumulate. They will likely be marked as IMPRECISE and often have a genotype of 0/0 indicating a very low support or frequency of reads supporting them. The AF= tag gives you the allele frequency of each event.
Without knowing your sample I assume you will find alot of false insertion and translocation calls.
Translocation can indicate some evidence that some contigs/scaffolds could further be joined.
Insertions accumulate sometimes due to a base calling error/ polymerase stuttering during the sequencing. PacBio is working on it and it should get improved over the next version of the basecaller.
0/1 SVs can happen as these are the heterozygous SVs. As you know most of the assemblers only report one sequence as a representation.
So what I would do is to focus on the 1/1. These should either mark problems from NGMLR+ sniffles or form the assembly. You might want to pinpoint these regions and also look at the short read evidence for these regions.
I hope this is helping you. Otherwise feel free to ping me and we can chat.
Cheers
Fritz
from sniffles.
Hey Fritz,
Thank you for the swift response. This is a really helpful response. I would love to be able to discuss this further with you, but I would to do so in private as the data is unpublished. If there are any additional problems, can I send you an email through your baylor college of medicine email address?
I will concentrate on the homozygous genotypes for now, visualize the long-read bam file and short-read bam file using IGV. I guess I could potentially increase the number of minimum reads required for genotyping to reduce the number of false positive variants.
Best,
Sangjin Lee
from sniffles.
Yes feel free to ping me. The fastest is over my gmail: [email protected] ,but the bcm also works.
I think it would be great to stick the heads together and come up with a small pipeline that can automatically assess this. Just an idea, but I am happy to discuss further options.
Cheers
Fritz
from sniffles.
Related Issues (20)
- Detection of Reciprocal BNDs
- How to turn off joint genotyping in multisample calling
- Phailed SV phasing
- tandem repeat annotation for another human genome resource HOT 1
- Installation issue on apple M2 HOT 10
- Segmentation fault: 11 on modbam files (ONT) HOT 2
- Does sniffles work on haploid genomes
- variant allele frequency adjustment HOT 1
- Does Sniffles need cs tag HOT 1
- Difference between SUPPORT, DV and RNAMES HOT 3
- Invalid END position HOT 2
- Can't run _any_ version > 2.2 without an Exception. HOT 6
- Why the longest insertion is very short HOT 2
- <INS> sequence missing in ALT field and represented as C<INS>, T<INS>, A<INS>, G<INS> HOT 2
- Are calls made from supplementary alignments? HOT 1
- Allow the optional emission of sv length of evidence
- false genotyping large deletion HOT 3
- Fatal error when genotyping VCF with missing QUAL HOT 1
- Accessing dev releases/OSX Parallelism bug HOT 1
- Some "FILTER" lines not defined in raw VCF header
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from sniffles.