Code Monkey home page Code Monkey logo

Comments (6)

ziyewang avatar ziyewang commented on June 28, 2024 1

Hi, Chris,

Could you please tell me the number of samples of your dataset? It seems that there are no quality bins generated using the coverage information only, which makes it fail to finish the second-stage ensemble process. As stated in the manuscript (https://www.biorxiv.org/content/10.1101/2021.07.25.453671v1.full.pdf), some component binning results for integration are generated using coverage information alone as features, and we recommend applying MetaBinner to multi-sample datasets.
MetaWRAP is a good choice for small-scale datasets or datasets with few samples. Or maybe the ensemble results generated by the first-stage ensemble process of MetaBinner using the combined features is fine. You can find them in "../metabinner_res/ensemble_res/X_t_logtrans_2postprocess/greedy_cont_weight_3_mincomp_50.0_maxcont_15.0_bins".

Thanks for your report, and we will improve the error reminder. Please feel free to let us know if you have any further questions.

Best wishes,
Ziye

from metabinner.

Chrisjrt avatar Chrisjrt commented on June 28, 2024

Hi Ziye,

Thanks for the quick reply! It was only one little mock community I was messing around with, so that will explain it then.

Thanks again for the help!

Chris

from metabinner.

ThijsSt avatar ThijsSt commented on June 28, 2024

Hey, I'm running into the same problem, but was wondering: is there a minimum sample size you recommend? Or a lower limit beyond which you recommend switching to a program/pipeline?

from metabinner.

jsgounot avatar jsgounot commented on June 28, 2024

Hi. On my side MetaBinner performs very poorly on low diversity metagenomes.
Additionally to previous comments, how do you create multi-samples dataset? Are we supposed to merge bamfiles ? This is very painful to do.

from metabinner.

ziyewang avatar ziyewang commented on June 28, 2024

Hey, I'm running into the same problem, but was wondering: is there a minimum sample size you recommend? Or a lower limit beyond which you recommend switching to a program/pipeline?

Hi,

I recommend running MetaBinner on the datasets with no less than ten samples, but there isn't an apparent minimum sample size. The performance may be influenced by the sample size, the datasets' complexity, the assembly's quality (e.g. assembly contiguity), and so on. Sorry for not replying in time.

Best,
Ziye

from metabinner.

ziyewang avatar ziyewang commented on June 28, 2024

Hi. On my side MetaBinner performs very poorly on low diversity metagenomes.
Additionally to previous comments, how do you create multi-samples dataset? Are we supposed to merge bamfiles ? This is very painful to do.

Hi,

The binner was not developed to handle the low diversity metagenomes; understandably, it didn't perform well on the low diversity metagenomes. But we hope that MetaBinner will also perform well on such datasets. If the low diversity metagenomes you used are publicly available, could you please send us the link to the metagenomes and let us figure it out. Thanks very much.

We align the reads from each sequencing sample against the assembly file to generate multiple bam files, calculate depth (coverage) for each bam file and merge the outputs. The process is similar to that of most binners.

Best,
Ziye

from metabinner.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.