Comments (6)
Hi Simone,
-
It makes sense that BCALM outputs way more unitigs than reference contigs, because each unitig is broken as soon as there is a repeated k-mer in the reference with a different base after and/or before.
-
Regarding the small difference in kmer counts, this is odd. Could you please run a third kmer counter, e.g. KMC3 https://github.com/refresh-bio/KMC to see if it agrees with one method or the other?
Rayan
from bcalm.
Hi Rayan,
- Yes absolutely, but I found weird that most unitigs are of length ~k, and that for k=25 I got almost 3 times the size of the reference. Still, this is of course possible.
- I will run it and post the results. UPDATE: The k-mer count given by KMC agrees with jellyfish's
Thank you for your quick answer!
Simone
from bcalm.
Hi Simone,
Regarding 2. wow yes, indeed. Thanks for reporting it, I am investigating this discrepancy in DSK counts.
from bcalm.
No problem, let me know if you need more infos to reproduce the error
from bcalm.
Hi Simone,
Apologies for the delay in addressing this bug. It has stumped me for a while because that test case was quite large. Finally I found the issue: the reference FASTA file that you provided contains IUPAC nucleotides:
...caaTGGGCTTTCCATATATGGCCTTAATTATGTTGAGGTAGTTTCCTTCTGTTCCTAGTTTGGTGRATGTTTTTTATCAT.....
and BCALM doesn't handle them so well. It replaced the 'R' with a 'C', and shouldn't have done so.
from bcalm.
thanks again for the detailed bug report!
from bcalm.
Related Issues (20)
- Running bcalm raises error: `cannot create a union-find data structure, too many elements.` HOT 6
- error: 'predecessors' following the 'template' keyword does not refer to a template HOT 8
- Error: libc++abi.dylib: terminating with uncaught exception of type std::out_of_range: basic_string HOT 5
- Compiling bcalm causes trouble when no .git repository is found HOT 1
- Output of bcalm2 similar to minia 3.2.1 unitigs? HOT 2
- Memory error with many reference genomes as input HOT 16
- High frequency of nodes with degree of two with length = 2k-1 HOT 2
- .fa files contain lowercase `km` instead of the documented uppercase `KM` HOT 3
- bcalm 2.2.1 on conda for Mac OS X produces incorrect cDBG (the zombie bug returns!!) HOT 12
- Add better input parsing HOT 1
- Improve error message for bad fasta input HOT 1
- Apparently bcalm crashes with very large k's (4096) HOT 1
- bcalm -version return code 1 to shell
- Glue is too strong, because glue files are not deleted.
- insert unitig disregarding the reverse complement HOT 1
- Option `-out-dir` is not used ? HOT 1
- bug report on a metagenomics dataset HOT 3
- Add header description with `-all-abundance-counts` HOT 2
- Is the output file reproducible? HOT 2
- Set the minimum value of -kmer-size HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from bcalm.