Code Monkey home page Code Monkey logo

Comments (2)

umahsn avatar umahsn commented on August 25, 2024

It is present in the snp_stats file because it was picked up as a candidate site, but not included in the vcf file because it was determined to be false positive. NanoCaller calculated probability of presence of A base =0.1847 which is too low for a variant call. Are you using Nanopore reads or PacBio? It might help to zoom out on IGV to see the surrounding 1-2000 bp for a better understanding of why this was regarded as false positive.

We are planning to release an update which allows you to get a different snapshot of the bam file than IGV, similar to the one in Fig1 of our biorxiv paper. It will show you only the high alternative allele frequency sites and skips other bases, and this would allow you to see if an allele might be false or not.

from nanocaller.

Mailinnia avatar Mailinnia commented on August 25, 2024

If I filter for only primary read mappings, then it calls the variant fine:
snps.vcf:
#CHROM POS ID REF ALT QUAL FILTER INFO FORMAT SAMPLE
Chr_22 42131531 . G A 38.810 PASS . GT:DP:FQ 0/1:45:0.4444

snp_stats:
pos,ref,prob_GT,prob_A,prob_G,prob_T,prob_C,DP,freq
42131531,G,0.9283,0.5908,0.9420,0.0000,0.0000,45,0.4444

igv_snapshot

I'm trying to understand why it calculates the probability of presence of A base to be so low when the supplementary reads are included.
I'm guessing including the supplementary reads introduces too much 'noise' in the surrounding area?

I'm using ONT data that has been corrected with Netcat. I just wanted to compare the variant callings with and without error correction.

from nanocaller.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.