Comments (7)
You have to run vg pack
on the same graph you mapped to with vg giraffe
. In this case, that woudl appear to be all.giraffe.gbz
.
from vg.
You have to run
vg pack
on the same graph you mapped to withvg giraffe
. In this case, that woudl appear to beall.giraffe.gbz
.
But I want to use vg call --vcf
to do genotyping. As here mentioned,vg call --vcf
only works on the the xg
file, not gbz
file.
And my confusion is : should not vg call
and vg pack
use same graph?
Thanks a lot for your help and time!
from vg.
If you really want the xg, you can make it with vg convert -Hx all.giraffe.gbz > all.giraffe.xg
and that will work with vg call / pack
.
The important thing is to make sure that the xg and gbz are identical (which doesn't necesarily happen if they are created independently).
from vg.
If you really want the xg, you can make it with
vg convert -Hx all.giraffe.gbz > all.giraffe.xg
and that will work withvg call / pack
.The important thing is to make sure that the xg and gbz are identical (which doesn't necesarily happen if they are created independently).
Hi, I tried convert all.giraffe.gbz
to all.giraffe.xg
by using vg convert -Hx all.giraffe.gbz > all.giraffe.xg
. And then I executed:
vg pack --xg all.giraffe.xg --packs-out sample.pack --gam sample.gam
vg call all.giraffe.xg --pack sample.pack --vcf ref.vcf --snarls all.snarls > sample.geno.vcf
But a empty result vcf returned with the log:
[VCFTraversalFinder] Warning: No alt path (prefix=_alt_80bfe3aa8146b33a31c5b8045e279c732771b806_) found in graph for variant
....
What I can guarantee is that the vcf used in the call and the vcf used when generating gbz are the same.
And the snarls
index file which I pre-computed is generated from all.giraffe.xg
using: vg snarls all.giraffe.xg > all.snarls
.
So I am confused that the variation cannot be found in the graph. How can I solve this problem?
Thanks for your help!
from vg.
Hmm, this issue seems to be coming up a lot for some reason. I think we need to improve the documentation. If you want to use -v
with vg call
you cannot use vg autoindex
. You must follow this particular recipe exactly
from vg.
First of all, thank you very much for grand help, which has eased my anxiety :)
Well, combining the above information, I have two roads to choose from:
-
Follow this issue's recipe #3974 (comment) to regenerate the
gbz/dist/min
index file and re-perform thegiraffe
for hundreds of samples. This has already taken 2 months for me, so I don't think it is a wise choice. -
Take
gbz
graph to performvg calls
and use the-a
parameter for multiple samples. For this method, my confusion is, how different are the coordinates of such a result and the result of usingvg call -v ref.vcf
? And how to balance the difference? If there is not much difference between the two, is it more recommended to usegbz
, which contains known haplotype information, instead of vcf?
Do you have any suggestions?
Some proposals about wiki/docs/Q&A
As for the upgrade of documentation and wiki, I think it is very necessary.
The ability of giraffe
is quite excellent, so I used the autoindex
method highly recommended on the wiki for this, but for subsequent pack
and call
, only gbz/dist/min
index file is not enough (if vg call -v ref.vcf
is used). These documents confuse users, some pages even stagnate in 2017, and most importantly, vg is a team/project that updates quickly, so it is crucial to keep the documents updated, and for some popular bioinformatics analysis, such as population NGS genotype identification, I hope there are some best practice tutorials that will greatly help users.
During the period of using vg, I also put forward some issues for vg to help improve, such as : errors in building distance index
: #3884 .and distance Index file permission problem
: #3865 . In this process, I am glad to see that the efficient vg team is constantly building better software, which has given me great help and support.
I have noticed that many similar issues have been opened, which will confuse users. Is it possible to provide a dedicated online community channel(like Discord) to allows users to get help from the vg team and other users?
Best wishes.
from vg.
For your question: I think you are best to just call with vg call -az
. This will use the haplotypes in the GBZ. Your vcf records won't be identical to the input VCF but they will be equivalent, and it will probably do a better job in complex sites.
Because the vg tools are under development and changing all the time, it has been challenging to maintain the documentation. But I am in complete agreement with you that they need some work now. But we try to support people the best we can here on github.
from vg.
Related Issues (20)
- Giraffe alignment is very slow and produces warning[vg::Watchdog] messages unless rescue is disabled HOT 13
- Giraffe needs to better deal with systems where multiple processes opening the distance index causes slowdowns
- Any option like BWA MEM '-C'? HOT 1
- vg convert failes while converting gbz to xg with signal 6 error HOT 8
- Release vg v1.60.0
- `vg convert --vg-algorithm` loses start coordinates of paths in its W lines HOT 3
- The Amazing Uninjectable SAM HOT 1
- Throw an error "error: trying to access a snarl tree node of the wrong type" in `haplotypes` HOT 11
- Get input file error HOT 5
- vg pack error
- Negative or "*" end-position for named-coordinates gaf with giraffe HOT 1
- vg autoindex vg giraffe pggb gfa HOT 1
- Hi, I have a question about vg augment. HOT 5
- sdsl-lite cmake issue HOT 3
- Error when trying to map reads to a spliced-pangenome HOT 8
- Different BAM alignments when using an Ubuntu 20.04 base image versus a 22.04 base image
- Issue Converting PGGB GFA Files for Panacus HOT 4
- confused about the number of genome and gff files being provided to vg autoindex while mapping RNA data HOT 3
- Hello, I have a question about vg map. HOT 1
- Update vg rna to project reasonably onto haplotypes containing CNVs
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from vg.