niu-lab / gclust Goto Github PK
View Code? Open in Web Editor NEWgenome sized sequences clustering
License: Apache License 2.0
genome sized sequences clustering
License: Apache License 2.0
Thanks for your wonderful tool !
My problem is
if there are the parameters related with the alignment coverage.
For example,
Just like the picture shows, the query genome and target genome share 99.46% identity but only 84% coverage. When I set the "-memiden" to 99, they will be assigned to the same cluster....
So, if there are some parameters about the "coverage" threashold filtering?
In my experiment ,there are 2 highly similar genome, their identity and coverage is displayed as below picture:
However, when I set the "-memiden" to 99, they are assigned to different clusters, that really makes me confused...I am not sure what's going on...
(All the alignment in the picture is done by the online megablast alignment tool.)
Dear developers,
Thank you for the useful good tool. I have followed your instructions manual, however, by running gclust exactly as you did, no output file with the actual clusters nucleotide sequence is produced. Only a list of the clusters with the genomes/contigs in each.
It would be a much better and easy to use tool, if will produce a similar output like cd-hit does, with a representative clusters fasta file.
Thank you and best regards
Vadim
Heya great program really convenient. Just finished using it. Just wanted to note that you might want to update the line in the README on how to find representatives as I think it points to an old file. Should be:
make -f gclust/script/makefile_createreps
instead of
make -f gclust/script/makefile
Thank you for writing this!
Hi.
The examples show how the input file is one fasta file. I have several files I need to cluster, is there a way to do this w/o changing the source code?
thanks.
-ricardo
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.