Code Monkey home page Code Monkey logo

Comments (9)

niemasd avatar niemasd commented on July 26, 2024

Are you using cat? The files are gzipped, so you need to use zcat (or equivalently gunzip -c if you're on Mac and don't have zcat), not cat

from favites.

niemasd avatar niemasd commented on July 26, 2024

Never mind, I found the directory you're working in (FAVITES/output/set7/output_7_1 within your home folder), and I see that the files are already unzipped. I'll investigate

from favites.

smazrouee avatar smazrouee commented on July 26, 2024

from favites.

niemasd avatar niemasd commented on July 26, 2024

I was able to reproduce the error. It seems like an issue with merging the trees. I'll investigate and roll out an update to FAVITES when it's fixed

from favites.

niemasd avatar niemasd commented on July 26, 2024

Caught the issue: I forgot to update the merge function after gzipping the trees, so it was looking for files with extension .tre instead of .tre.gz. Fixing and will roll out in update later today

from favites.

niemasd avatar niemasd commented on July 26, 2024

Fixed in commit: 12283fb

Building updated Docker image

from favites.

smazrouee avatar smazrouee commented on July 26, 2024

from favites.

niemasd avatar niemasd commented on July 26, 2024

You can either switch to FAVITES version 1.1.3 (which now has the merging bug fixed), or you can use the method I described in my recent email to you to manually merge the trees. Switching versions will be easier, but it will take more compute time.

from favites.

niemasd avatar niemasd commented on July 26, 2024

Reproducing my "manually merge trees" solution here so I can close this ticket:

While I'm fixing the bug in merging trees, here's a way you can manually merge them instead of having to rerun. First, the relevant files:

  • The seed tree can be found in OUTPUT/seed_sequences/seed.txt
  • For each leaf in the seed tree, the corresponding sequence can be found in OUTPUT/seed_sequences/seqgen.out
  • For each leaf sequence, the file OUTPUT/seed_sequences/seed_sequences.tsv.gz has the corresponding N# root of the individual trees
  • The corresponding N# SeqGen files (which include the individual trees) can be found in OUTPUT/SeqGen_output

To do the merge, you should be able to do the following:

  1. Load the seed tree (OUTPUT/seed_sequences/seed.txt)
  2. Map each leaf in the seed tree to an N# label (seed tree -> sequence from OUTPUT/seed_sequences/seqgen.out -> N# label from OUTPUT/seed_sequences/seed_sequences.tsv.gz)
  3. For each N# label, load the corresponding tree (OUTPUT/SeqGen_output) and set the corresponding leaf in the seed tree to the root of the N# tree

from favites.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.