Comments (6)
Hi, Chris,
Could you please tell me the number of samples of your dataset? It seems that there are no quality bins generated using the coverage information only, which makes it fail to finish the second-stage ensemble process. As stated in the manuscript (https://www.biorxiv.org/content/10.1101/2021.07.25.453671v1.full.pdf), some component binning results for integration are generated using coverage information alone as features, and we recommend applying MetaBinner to multi-sample datasets.
MetaWRAP is a good choice for small-scale datasets or datasets with few samples. Or maybe the ensemble results generated by the first-stage ensemble process of MetaBinner using the combined features is fine. You can find them in "../metabinner_res/ensemble_res/X_t_logtrans_2postprocess/greedy_cont_weight_3_mincomp_50.0_maxcont_15.0_bins".
Thanks for your report, and we will improve the error reminder. Please feel free to let us know if you have any further questions.
Best wishes,
Ziye
from metabinner.
Hi Ziye,
Thanks for the quick reply! It was only one little mock community I was messing around with, so that will explain it then.
Thanks again for the help!
Chris
from metabinner.
Hey, I'm running into the same problem, but was wondering: is there a minimum sample size you recommend? Or a lower limit beyond which you recommend switching to a program/pipeline?
from metabinner.
Hi. On my side MetaBinner performs very poorly on low diversity metagenomes.
Additionally to previous comments, how do you create multi-samples dataset? Are we supposed to merge bamfiles ? This is very painful to do.
from metabinner.
Hey, I'm running into the same problem, but was wondering: is there a minimum sample size you recommend? Or a lower limit beyond which you recommend switching to a program/pipeline?
Hi,
I recommend running MetaBinner on the datasets with no less than ten samples, but there isn't an apparent minimum sample size. The performance may be influenced by the sample size, the datasets' complexity, the assembly's quality (e.g. assembly contiguity), and so on. Sorry for not replying in time.
Best,
Ziye
from metabinner.
Hi. On my side MetaBinner performs very poorly on low diversity metagenomes.
Additionally to previous comments, how do you create multi-samples dataset? Are we supposed to merge bamfiles ? This is very painful to do.
Hi,
The binner was not developed to handle the low diversity metagenomes; understandably, it didn't perform well on the low diversity metagenomes. But we hope that MetaBinner will also perform well on such datasets. If the low diversity metagenomes you used are publicly available, could you please send us the link to the metagenomes and let us figure it out. Thanks very much.
We align the reads from each sequencing sample against the assembly file to generate multiple bam files, calculate depth (coverage) for each bam file and merge the outputs. The process is similar to that of most binners.
Best,
Ziye
from metabinner.
Related Issues (20)
- something wrong in the script - Filter_tooshort.py HOT 2
- Two bugs to fix for preparing data HOT 2
- A detail description of the final output HOT 3
- error in component_binning.py HOT 3
- Final bin directory? HOT 2
- Something went wrong with running split_hhbins.py. HOT 3
- error in subscript of component_binning.py HOT 4
- Work in eukaryotic organisms? HOT 1
- make gen_coverage_file accept compressed files HOT 2
- SafetyError - Incorrect size HOT 1
- custom temporary path is needed HOT 2
- Input file does not exists HOT 4
- Input contains NaN, infinity or a value too large for dtype('float64') in Component Binning Step HOT 1
- no pplacer available for mac os HOT 1
- How to set the tmp dir of metabinner? HOT 2
- Differences between SolidBin and MetaBinner HOT 2
- A bug in gen_kmer.py HOT 3
- run_FragGeneScan.pl error - Hmmsearch failed HOT 2
- how to bin the contigs combined from different samples HOT 1
- markerCMD failed HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from metabinner.