Code Monkey home page Code Monkey logo

Comments (5)

MrOlm avatar MrOlm commented on August 16, 2024

Hello,

This problem is a result of the .stb file in the profile step being different from the compare. Specifically, if you look at the genome_info.tsv file of your sample "HeP-1057-10" you will find the genome "22903", but this genome is not in the provided .stb file.

Best,
Matt

from instrain.

Sanrrone avatar Sanrrone commented on August 16, 2024

Effectively the '22903' is in the genome_info file. However, the file .stb file (hg.tsv in my case) is the same in both executions (profile and compare).

$ grep -w 22903 /scratch/project_2007362/software/HumGutDB/hg.tsv
kraken:taxid|3020030|HumGut_20030_1	22903

So I do not understand why the error. is it because I am using numbers instead of strings in the bin column?

just in case, my parameters are:

#profile
inStrain profile --use_full_fasta_header -p $c -c 7 --min_scaffold_reads 7 -s $new/software/HumGutDB/hg.tsv --skip_plot_generation -o ${sname} $bam $new/software/HumGutDB/hg.fasta --database_mode

#compare
inStrain compare -o ${hep}_compare --skip_plot_generation -p $c -s $new/software/HumGutDB/hg.tsv -i $samples --database_mode -d

from instrain.

MrOlm avatar MrOlm commented on August 16, 2024

Hello,

Urg, I do worry it might be due to the numbers instead of strings for bin names. I thought I fixed a few years ago, but it's possible that I only fixed it for profile and not compare.

If you could please confirm that you're running the most recent version of inStrain, that would be ideal. If so, this is likely a number / string problem that I need to fix. As a workaround, adding a letter to your bin names (even just an "a" in front of all of them) should fix the issue.

Apologies,
Matt

from instrain.

Sanrrone avatar Sanrrone commented on August 16, 2024

I installed it via conda

inStrain -h

                ...::: inStrain v1.8.0 :::...
   
  Matt Olm and Alex Crits-Christoph. MIT License. Banfield Lab, UC Berkeley.

  Choose one of the operations below for more detailed help. See https://instrain.readthedocs.io for documentation.
  Example: inStrain profile -h

  Main operations:
    profile           -> Create an inStrain profile (microdiversity analysis) from a mapping file
    compare            -> Compare multiple inStrain profiles (popANI, coverage_overlap, etc.)

  Auxiliary operations:
    check_deps        -> Print a list of dependencies, versions, and whether they're working
    parse_annotations -> Run a number of outputs based a table of gene annotations 
    quick_profile     -> Quickly calculate coverage and breadth of a mapping using coverM
    filter_reads      -> Commands related to filtering reads from .bam files
    plot              -> Make figures from the results of "profile" or "compare"
    other             -> Other miscellaneous operations

Best,
Sandro

from instrain.

Sanrrone avatar Sanrrone commented on August 16, 2024

Just to complete the issue, it is solved by adding an non-numeric name to the bins as you suggested.

thank you!

from instrain.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.