Comments (9)
yeah, there is a bug in the script, but it will be corrected after you run the validation step.
Xun
from vicaller.
Hi, xunchen,
Maybe I didn’t quite understand what you said. Which columns are right? which wrong and corrected by validation?
I try to run validate function with one integration using below command:
VIcaller.pl validate
-c VIcaller.config -i Sample
-S Sample_18_72481819_72483840_hepatitis_b_virus_21326584
-G 21326584 -V hepatitis_b_virus -t 10
After run it without error, the No._chimeric_reads and No._split_reads still 0, but No._reads_supporting_VI is 61. Seems still not consistently.
For virus integration detection, should validation and allele fraction be required? Can I only do detection analysis?
Thanks and hope relpy
JY
from vicaller.
I see, after you run the validation step, it will add a few columns, including columns of validation_chimeric and Validation_split. The sum of these columns should be "61". It should give you the same information. My original idea is that because some of the reads can not be successfully validated, thus the detected chimeric and split reads may not be correct. Indeed, the sum of validated chimeric and split reads should be mainly considered unless you want keep as more candidates as you want.
Regarding your another question, yes, you can run "detect" function to detect viral integration and run "calculate" function to obtain the allele fraction.
Best,
Xun
from vicaller.
I will soon correct the two columns in the main VIcaller script which will use the original number.
Xun
from vicaller.
Dear Xun,
I didn't found column name "validation_chimeric" and "Validation_split". For other similar names, sum is not equal to 61.
No._reads_supporting_VI | Average_alignment_score | Is_cell_line_contamination | Is_vector | Validation_chimeric_confident | Validation_chimeric_weak | Validation_chimeric_false | Validation_split_confident | Validation_split_weak | Validation_split_false |
---|---|---|---|---|---|---|---|---|---|
61 | 89.18033 | - | - | 100 | 0 | 0 | 32 | 0 | 0 |
Best!
JY
from vicaller.
I'm wondering if you can share more info. I only see the inconsistency for the supporting reads, but it is hard for me to follow and address the potential issues without additional information. I am also not quite sure if you successfully run the validation step.
Validated reads were extracted from the visualization figure. Thus you can first check the corresponding visualization record. You can count how many unique reads, how many chimeric and split reads there, 61 or 132? If you are not sure, you can share me the screenshot, visualization file, fuq file, output file, that i can help debug it.
If you run script step by step, it may also bring some potential issues, especially when you modified the script on your own.
Best,
Xun
from vicaller.
Hi xunchen,
- I checked the sample.visualization file, number of lines starts with 'O2' is 132 and sequence from 'seq0' to 'seq132'.
- After revised script<Result_visual3-3.pl>, I run detect function as your manual recommend, not step by step, and the run log seems no error.
Regards,
JY
from vicaller.
Hi xunchen,
I found some information:
- No._reads_supporting_VI from output is field 39 of file sample.virus_f2.
- Validation reads counted from file sample.visualization that is from file sample_f2
- sample_f2 -> sample_f22 -> sample.virus_f -> sample.virus_f2 through multiple process.
- Total validation reads is 132, while No._reads_supporting_VI is 61.
Regards
JY
from vicaller.
From what you described, you may check or validate the wrong visualizationg record.
You can double check the GI number, because if there are multiple candidate GIs detected, we will keep all of them. Meanwhile, you also can check if there is another record show similar or the same number of reads as "61".
if you use your own customized library, that may also be the issue with the viral ref name which may cause the inconsistency.
Xun
from vicaller.
Related Issues (12)
- The seq.output of test differ depending on input. HOT 4
- viral database HOT 2
- Download RepBase for RepeatMasker HOT 1
- Vicaller.pl 1164 line is strange. HOT 9
- What is input file of calculate? HOT 1
- Extract spanning reads and junction reads around integration site HOT 3
- How to filter the list from the "detect" result HOT 3
- where to download HOT 2
- About input data HOT 1
- Tophat with --no-coverage-search HOT 4
- Result_visual3-3.pl ERROR HOT 6
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from vicaller.