Code Monkey home page Code Monkey logo

Comments (9)

xunchen85 avatar xunchen85 commented on June 25, 2024

yeah, there is a bug in the script, but it will be corrected after you run the validation step.

Xun

from vicaller.

junyanzho avatar junyanzho commented on June 25, 2024

Hi, xunchen,

Maybe I didn’t quite understand what you said. Which columns are right? which wrong and corrected by validation?
I try to run validate function with one integration using below command:
VIcaller.pl validate
-c VIcaller.config -i Sample
-S Sample_18_72481819_72483840_hepatitis_b_virus_21326584
-G 21326584 -V hepatitis_b_virus -t 10

After run it without error, the No._chimeric_reads and No._split_reads still 0, but No._reads_supporting_VI is 61. Seems still not consistently.

For virus integration detection, should validation and allele fraction be required? Can I only do detection analysis?

Thanks and hope relpy
JY

from vicaller.

xunchen85 avatar xunchen85 commented on June 25, 2024

I see, after you run the validation step, it will add a few columns, including columns of validation_chimeric and Validation_split. The sum of these columns should be "61". It should give you the same information. My original idea is that because some of the reads can not be successfully validated, thus the detected chimeric and split reads may not be correct. Indeed, the sum of validated chimeric and split reads should be mainly considered unless you want keep as more candidates as you want.

Regarding your another question, yes, you can run "detect" function to detect viral integration and run "calculate" function to obtain the allele fraction.

Best,
Xun

from vicaller.

xunchen85 avatar xunchen85 commented on June 25, 2024

I will soon correct the two columns in the main VIcaller script which will use the original number.

Xun

from vicaller.

junyanzho avatar junyanzho commented on June 25, 2024

Dear Xun,
I didn't found column name "validation_chimeric" and "Validation_split". For other similar names, sum is not equal to 61.

No._reads_supporting_VI Average_alignment_score Is_cell_line_contamination Is_vector Validation_chimeric_confident Validation_chimeric_weak Validation_chimeric_false Validation_split_confident Validation_split_weak Validation_split_false
61 89.18033 - - 100 0 0 32 0 0

Best!

JY

from vicaller.

xunchen85 avatar xunchen85 commented on June 25, 2024

I'm wondering if you can share more info. I only see the inconsistency for the supporting reads, but it is hard for me to follow and address the potential issues without additional information. I am also not quite sure if you successfully run the validation step.

Validated reads were extracted from the visualization figure. Thus you can first check the corresponding visualization record. You can count how many unique reads, how many chimeric and split reads there, 61 or 132? If you are not sure, you can share me the screenshot, visualization file, fuq file, output file, that i can help debug it.

If you run script step by step, it may also bring some potential issues, especially when you modified the script on your own.

Best,
Xun

from vicaller.

junyanzho avatar junyanzho commented on June 25, 2024

Hi xunchen,

  1. I checked the sample.visualization file, number of lines starts with 'O2' is 132 and sequence from 'seq0' to 'seq132'.
  2. After revised script<Result_visual3-3.pl>, I run detect function as your manual recommend, not step by step, and the run log seems no error.

Regards,
JY

from vicaller.

junyanzho avatar junyanzho commented on June 25, 2024

Hi xunchen,
I found some information:

  1. No._reads_supporting_VI from output is field 39 of file sample.virus_f2.
  2. Validation reads counted from file sample.visualization that is from file sample_f2
  3. sample_f2 -> sample_f22 -> sample.virus_f -> sample.virus_f2 through multiple process.
  4. Total validation reads is 132, while No._reads_supporting_VI is 61.

Regards
JY

from vicaller.

xunchen85 avatar xunchen85 commented on June 25, 2024

From what you described, you may check or validate the wrong visualizationg record.

You can double check the GI number, because if there are multiple candidate GIs detected, we will keep all of them. Meanwhile, you also can check if there is another record show similar or the same number of reads as "61".

if you use your own customized library, that may also be the issue with the viral ref name which may cause the inconsistency.

Xun

from vicaller.

Related Issues (12)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.