Comments (5)
Although vg sim
can run with long read input, it's really designed for short reads. If you use it to generate long reads, you won't get very realistic errors or a realistic read length distribution. In our own testing and development, we've used pbsim
to simulate long reads. You would probably want to generate the reads from FASTAs of sample haplotypes, rather than directly from the GBZ file.
from vg.
Thank you!
And can I use vg sim
and the .gbz
file to generate short reads using vg sim -x graph.xg **-g graph.gbz** -m SAMPLE -n 1000 -l 150 -a > SAMPLE.gam
?
Now I have the .gbz
file of all chromosomes pangenome
graph. And I want to generate short reads only for chr21
. Do I need to withdraw the .gbz file of chr21? I don't find Related command.
from vg.
Although
vg sim
can run with long read input, it's really designed for short reads. If you use it to generate long reads, you won't get very realistic errors or a realistic read length distribution. In our own testing and development, we've usedpbsim
to simulate long reads. You would probably want to generate the reads from FASTAs of sample haplotypes, rather than directly from the GBZ file.
I'm simulating long reads using pbsim3
and the output is .maf
file. If I want to do some simulation experiment such as calling SV based on the simulation reads, can I use the maf
file as the truth set
? Or use some public truth set?
Do you have any suggestions?
from vg.
Looking through our script, it seems that we used the maf2sam
subcommand of bioconvert
.
from vg.
@tanger-code If you want to simulate from just one named path in the graph, you can use the -P
option to vg sim
.
But that simulates from just that path; it won't include variants in the graph that leave the embedded path.
I don't think we have a way to simulate from the connected component of the graph that contains a path, other than using vg chunk --components -p name-of-path
to pull out that subgraph and then simulating from it.
from vg.
Related Issues (20)
- Throw an error "error: trying to access a snarl tree node of the wrong type" in `haplotypes` HOT 11
- Get input file error HOT 5
- vg pack error
- Negative or "*" end-position for named-coordinates gaf with giraffe HOT 1
- vg autoindex vg giraffe pggb gfa HOT 1
- Hi, I have a question about vg augment. HOT 5
- sdsl-lite cmake issue HOT 3
- Error when trying to map reads to a spliced-pangenome HOT 8
- Different BAM alignments when using an Ubuntu 20.04 base image versus a 22.04 base image
- Issue Converting PGGB GFA Files for Panacus HOT 4
- confused about the number of genome and gff files being provided to vg autoindex while mapping RNA data HOT 3
- Hello, I have a question about vg map. HOT 1
- Update vg rna to project reasonably onto haplotypes containing CNVs
- Error When Running "vg gbwt --gbz-format -g hprc-v1.0-mc-grch38.gfa.gz -G hprc-v1.0-mc-grch38.gfa" HOT 1
- Memory limit exceeded during vg autoindex for GCSA/LCP indexing
- Memory limit exceeded during vg autoindex for GCSA/LCP indexing HOT 12
- Read coverage of haplotype sampling HOT 2
- Unrecoverable error in GCSA2 indexing HOT 1
- Release vg v1.61.0
- vg giraffe invalid alignment: Edit erroneously claims mismatch on node HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from vg.