dizak / mothulity Goto Github PK
View Code? Open in Web Editor NEWEasy-to-use tool facilitating work with Mothur.
Home Page: https://dizak.github.io/mothulity
License: BSD 3-Clause "New" or "Revised" License
Easy-to-use tool facilitating work with Mothur.
Home Page: https://dizak.github.io/mothulity
License: BSD 3-Clause "New" or "Revised" License
mothulity_fc
does not fire up since argparse
import is missing.mothulity_fc
does not fire up since names_sanitizer
is not properly referenced with mut
mothulity
v1.6.0 with pip
.Should run the analysis
Throws the import error.
Software supports only paired-end reads analysis.
There's an ugly repetition of the menu at the project's page. It should be done once, using jekyll's _layouts
mothulity still tries to find beta-diversity-specific HTML files despite there's only one sample.
Run mothulity.py /path/to/files/ --render-html
to reproduce.
The threshold for defining what is a junk group and what is not should be made upon SD rather than plain mean division by an arbitrary number.
mothulity
know each step of analysis from each other. This might be achieved with few steps:
mothur
commands, instead of keeping them in a plaing text in the template. The question is: to what extent the logic from templates should be removed?mothulity
, it should know how the files would be named after each step. If it's done - resolving #9 is straightforward.mothulity
already creates the hidden logfile. It can be used as the cache.It would be convenient to enable email notification using mothulity.py arguments and not to bother with headnode_notifier itself.
The output example is now reference at the end of the brief tutorial. It should be also accessible from the top menu
Venn diagrams for species overlap between samples feature
mothulity should be added to bioconda. The deployment should be managed by travis-CI
There is a terrible mess introduced into INSTALL.sh
when travis CI was attached to the repo.
PathTests must include if mothur
and blast
executables can be found after installation.
Software fails to deliver proper final results/visualization of samples with high biodiversity. During converting samples taxonomy (tax.summary obtained from mothur) to xml (needed for Kroma) part of the data is lost.
Software performs analysis and produces a full report including proper krona charts for taxonomical representation of the observed OTUs.
Software fails to convert tax.summary to xml, providing a wrong taxonomical representation of the observed OTUs (many taxa are missing, part of the pie chart is described as "other bacteria").
mpi exec when --nodes opt > 1 and mem-per-cpu and processors and partition
Software does not support checkpointing and does not allow user to continue analysis from a last/given step.
--dry-run
arg.missing apostrophe at the end, unwanted slash before make.contigs command, messed up slurm flags newlines, pairwise.seqs command typo
If there no fastq files or for other reason the input directory is obviously wrong, everything tries and fails. It should be detected before mothur is called.
ab2a2de on branch UPD#64
doctests do not work after py
extension is removed from the python scripts. Also, these are the basic level math functions that would be suitable for that kind of tests.
python -m doctest
on any of the mothulity
scriptsPassing doctests
Failing doctest mostly due to undefined variables since the tested functions self-reference fails without py
extension.
Hey,
Would you like to provide users example data for testing your software possibilities? It could be also used to validate installation & usage processes.
Cheers.
Some of the mothur's commands are unable to read path with a dash. Mothulity should warn the user.
mothulity.py
on it.This is one of the commands unable to read the path with the dash.
filter.seqs(fasta=current, vertical=T, trump=.)
Using /home/dizak/Pulpit/MiSeq_SOP/temp/temp-temp/mothur.job.trim.contigs.good.unique.good.fasta as input file for the fasta parameter.
Unable to open /home/dizak/Pulpit/MiSeq_SOP/temp/temp. Trying default /home/dizak/anaconda2/envs/mothulity/bin/temp
Unable to open /home/dizak/anaconda2/envs/mothulity/bin/temp. Trying output directory /home/dizak/Pulpit/MiSeq_SOP/temp/temp-temp/temp
Unable to open /home/dizak/Pulpit/MiSeq_SOP/temp/temp-temp/temp. It will be disregarded.
Unable to open temp/mothur.job.trim.contigs.good.unique.good.fasta. Trying default /home/dizak/anaconda2/envs/mothulity/bin/mothur.job.trim.contigs.good.unique.good.fasta
Unable to open /home/dizak/anaconda2/envs/mothulity/bin/mothur.job.trim.contigs.good.unique.good.fasta. Trying output directory /home/dizak/Pulpit/MiSeq_SOP/temp/temp-temp/mothur.job.trim.contigs.good.unique.good.fasta
The analysis should have run normally with full output.
Mothulity_dbaser calls function download (for silva_102) with wrong argument names (download_path instead of download_directory).
Downloads silva_102 database.
Execution fails with nasty python error.
mothulity.py
should not require the positional argument in some cases, for instance, when setting the default database path.
There is no need for database path search when --analysis-only is invoked. It should be omitted.
The descriptions provided should be a little bit updated since adding mothulity to pypi
Adding path autocompletion for guided installation.
Create bash wraper for main mothulity python script, remove shebang edition from installation.
Would be really nice if readme could:
Databases (provided by mothulity_dbaser and used by mothulity in analysis) could be by default stored in mothulity directory.
Due to the limitations of acg & dcg methods with default parameters (in complex analysis many bacteria are assigned to "other bacteria" category i.e. proteobacteria), one must consider tuning the setup if the acg will remain the main clustering method.
The issue needs to be further discussed.
Evaluate whether CLI arguments are not impossible or out of logical range, eg max-length < min-length or cluster-method out of allowed values.
Software does not download database unite-ITS-02.
Download path: ./Unite_ITS_02.zip
Connecting...
Failed to establish connection. Response code 404
Execute the following command:
$ ./mothulity_dbaser.py --unite-ITS-02
Metadata should be handled for complete beta-diversity analysis.
Following sources should be supported:
mothulity.py --set-align-database-path
or mothulity.py --set-taxonomy-database-path
.The path supplied to mothulity.py --set-align-database-path
or mothulity.py --set-taxonomy-database-path
should be expanded to an absolute path.
The path supplied to mothulty.py --set-align-database-path
or mothulity.py --set-taxonomy-database-path
is passed literally to the config file.
When mothulity_dbaser.py fails to download one of the multiple databases, it gives an unwrapped error and quits prematurely without going to the next in the queue.
There is vsearch option available for chimeric reads search.
The dereplicate
param in the chimeric reads search might need investigation whether it is truly needed.
As the OTU-clustering part is the most computing power and time-consuming part of the analysis, there is an urgent need for using some alternative way of computing it.
INSTALL.sh
should provide a clean way of omitting the database download step. Right now it is possible using the -t 6
opt but it is not clear for the user. Also, there should be no break
just semicolons here:
Lines 70 to 75 in b12439f
no path is passed for unpacking so it works only for current directory
If the precision parameter in the OTU-clustering command is changed, the preprocessing part runs just fine, but the analysis part does not.
agc
clustering method.precision
parameter in the cluster command to cluster.split(fasta=current, count=current, taxonomy=current, cutoff=0.03, large=T, method=agc, precision=1000);
Once the preprocessing step is done, mothulity should run the analysis steps and output the final report in the HTML file.
The final report HTML is not generated. The shared (both full and subsampled) and tax.summary files are in the analysis/OTU/ directory. The alpha biodiversity part runs as expected, though the beta directory contains only the mothur's log files.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.