Code Monkey home page Code Monkey logo

mothulity's People

Contributors

dizak avatar setnea avatar

Stargazers

 avatar  avatar

Watchers

 avatar  avatar

mothulity's Issues

tutorial

f210c6c

Proposed change

Brief tutorial should be added to project's page

Dysfunctional mothulity_fc

f210c6c

Description

  1. mothulity_fc does not fire up since argparse import is missing.
  2. mothulity_fc does not fire up since names_sanitizer is not properly referenced with mut

Steps to reproduce

  1. Install mothulity v1.6.0 with pip.
  2. Run analysis.

Expected behaviour

Should run the analysis

Actual behaviour

Throws the import error.

Custom layout

b1a8511

Proposed change

There's an ugly repetition of the menu at the project's page. It should be done once, using jekyll's _layouts

define junk groups with SD

f4d4887

Proposed change

The threshold for defining what is a junk group and what is not should be made upon SD rather than plain mean division by an arbitrary number.

analysis step awarness

f4d4887

Proposed feature

mothulity know each step of analysis from each other. This might be achieved with few steps:

  • Enclosing particular mothur commands, instead of keeping them in a plaing text in the template. The question is: to what extent the logic from templates should be removed?
  • Once the command is passed to mothulity, it should know how the files would be named after each step. If it's done - resolving #9 is straightforward.
  • mothulity already creates the hidden logfile. It can be used as the cache.

python3

b12439f

Proposed change

Mothulity should be written in python3

pip

4328477

Proposed change

mothulity should be added to pypi. The deployment should be managed by travis-ci.

output example in page menu

ee44239

Proposed change

The output example is now reference at the end of the brief tutorial. It should be also accessible from the top menu

bioconda

4328477

Proposed change

mothulity should be added to bioconda. The deployment should be managed by travis-CI

Wrong results/final output #major

Description

Software fails to deliver proper final results/visualization of samples with high biodiversity. During converting samples taxonomy (tax.summary obtained from mothur) to xml (needed for Kroma) part of the data is lost.

Steps to reproduce

  1. Run analysis of samples with high biodiversity.

Expected behaviour

Software performs analysis and produces a full report including proper krona charts for taxonomical representation of the observed OTUs.

Actual behaviour

Software fails to convert tax.summary to xml, providing a wrong taxonomical representation of the observed OTUs (many taxa are missing, part of the pie chart is described as "other bacteria").

mpi

mpi exec when --nodes opt > 1 and mem-per-cpu and processors and partition

Checkpointing feature

Software does not support checkpointing and does not allow user to continue analysis from a last/given step.

Verify mothulity action

commit

3bc6bd9

Proposed change

  • doctests and unittests are not enough. mothulity actual actions should be checked during each build.
  • As it not feasible with restricted resources of travis, it should be done at least as a comparison with the script generated with --dry-run arg.
  • Once this is accomplished, run on MiSeq dataset requirement can be removed from the pull request template.

ITS_templ_str typos

missing apostrophe at the end, unwanted slash before make.contigs command, messed up slurm flags newlines, pairwise.seqs command typo

doctest to unittest

ab2a2de on branch UPD#64

Description

doctests do not work after py extension is removed from the python scripts. Also, these are the basic level math functions that would be suitable for that kind of tests.

Steps to reproduce

  1. Run python -m doctest on any of the mothulity scripts

Expected behaviour

Passing doctests

Actual behaviour

Failing doctest mostly due to undefined variables since the tested functions self-reference fails without py extension.

Example data for testing

Hey,
Would you like to provide users example data for testing your software possibilities? It could be also used to validate installation & usage processes.

Cheers.

Clean mothulity help

4328477

Proposed change

mothulity help page needs some review and refurb. Some args' help describe action:

mothulity/mothulity.py

Lines 469 to 486 in 4328477

settings.add_argument("--set-align-database-path",
action="store",
dest="set_align_database_path",
metavar="",
default=None,
help="Set persistent path to align database.")
settings.add_argument("--set-taxonomy-database-path",
action="store",
dest="set_taxonomy_database_path",
metavar="",
default=None,
help="Set persistent path to taxonomy database.")
settings.add_argument("--set-config-path",
action="store",
dest="set_config_path",
metavar="",
default=None,
help="Set temporary path to config file.")

while other describe objects:

mothulity/mothulity.py

Lines 218 to 223 in 4328477

parser.add_argument(action="store",
dest="files_directory",
metavar="path/to/files",
default=".",
help="input directory path. It is used as working\
directory for the job. Default CWD.")

Throw warning if dash in inputdir

b12439f

Description

Some of the mothur's commands are unable to read path with a dash. Mothulity should warn the user.

Steps to reproduce

  1. Create a project with a dash in its input path.
  2. Run mothulity.py on it.

Expected behaviour

This is one of the commands unable to read the path with the dash.

filter.seqs(fasta=current, vertical=T, trump=.)
Using /home/dizak/Pulpit/MiSeq_SOP/temp/temp-temp/mothur.job.trim.contigs.good.unique.good.fasta as input file for the fasta parameter.
Unable to open /home/dizak/Pulpit/MiSeq_SOP/temp/temp. Trying default /home/dizak/anaconda2/envs/mothulity/bin/temp
Unable to open /home/dizak/anaconda2/envs/mothulity/bin/temp. Trying output directory /home/dizak/Pulpit/MiSeq_SOP/temp/temp-temp/temp
Unable to open /home/dizak/Pulpit/MiSeq_SOP/temp/temp-temp/temp. It will be disregarded.
Unable to open temp/mothur.job.trim.contigs.good.unique.good.fasta. Trying default /home/dizak/anaconda2/envs/mothulity/bin/mothur.job.trim.contigs.good.unique.good.fasta
Unable to open /home/dizak/anaconda2/envs/mothulity/bin/mothur.job.trim.contigs.good.unique.good.fasta. Trying output directory /home/dizak/Pulpit/MiSeq_SOP/temp/temp-temp/mothur.job.trim.contigs.good.unique.good.fasta

Actual behaviour

The analysis should have run normally with full output.

Wrong argument names (dbaser) #minor

Description

Mothulity_dbaser calls function download (for silva_102) with wrong argument names (download_path instead of download_directory).

Steps to reproduce

  1. Download silva_102 database i.e. during installation.

Expected behaviour

Downloads silva_102 database.

Actual behaviour

Execution fails with nasty python error.

Extensive documentation is missing

Would be really nice if readme could:

  • briefly introduce mothulity basic execution options,
  • provide example of usage i.e. based on MiSeqSOP dataset mentioned in "example data section".

Databases storing

Databases (provided by mothulity_dbaser and used by mothulity in analysis) could be by default stored in mothulity directory.

Cluster.split (acg & dcg methods default parameters)

Due to the limitations of acg & dcg methods with default parameters (in complex analysis many bacteria are assigned to "other bacteria" category i.e. proteobacteria), one must consider tuning the setup if the acg will remain the main clustering method.

The issue needs to be further discussed.

Evaluate CLI arguments

Evaluate whether CLI arguments are not impossible or out of logical range, eg max-length < min-length or cluster-method out of allowed values.

#Bug database download (unite-ITS-02)

Description

Software does not download database unite-ITS-02.

Download path: ./Unite_ITS_02.zip
Connecting...
Failed to establish connection. Response code 404

Steps to reproduce

Execute the following command:
$ ./mothulity_dbaser.py --unite-ITS-02

Meta-data

Metadata should be handled for complete beta-diversity analysis.

Following sources should be supported:

  • CSV
  • Excel
  • Google Sheets

abspath when setting database path in config

17bb386

Steps to reproduce

  1. Pass relative path to mothulity.py --set-align-database-path or mothulity.py --set-taxonomy-database-path.

Expected behavior

The path supplied to mothulity.py --set-align-database-path or mothulity.py --set-taxonomy-database-path should be expanded to an absolute path.

Actual behavior

The path supplied to mothulty.py --set-align-database-path or mothulity.py --set-taxonomy-database-path is passed literally to the config file.

pypi downloads badge

0dc2ce9

Proposed change

There's a cool website pepy that show the number of downloads from pypi and even gives a badge! It should be added to the project's page.

chimera.vsearch

7441da4

Proposed change

There is vsearch option available for chimeric reads search.

Proposed change

The dereplicate param in the chimeric reads search might need investigation whether it is truly needed.

OTU-clustering without mothur

As the OTU-clustering part is the most computing power and time-consuming part of the analysis, there is an urgent need for using some alternative way of computing it.

db download

no path is passed for unpacking so it works only for current directory

no report with clustering precision modified

f4d4887

Description

If the precision parameter in the OTU-clustering command is changed, the preprocessing part runs just fine, but the analysis part does not.

Steps to reproduce

  1. Dry-run preprocessing part with agc clustering method.
  2. Modify the precision parameter in the cluster command to cluster.split(fasta=current, count=current, taxonomy=current, cutoff=0.03, large=T, method=agc, precision=1000);
  3. Run the modified script.

Expected behaviour

Once the preprocessing step is done, mothulity should run the analysis steps and output the final report in the HTML file.

Actual behaviour

The final report HTML is not generated. The shared (both full and subsampled) and tax.summary files are in the analysis/OTU/ directory. The alpha biodiversity part runs as expected, though the beta directory contains only the mothur's log files.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.