Code Monkey home page Code Monkey logo

arundinariasnp's People

Contributors

rajewski avatar

Stargazers

 avatar

Watchers

 avatar

arundinariasnp's Issues

Update paths

Problem

Paths in scripts are relative to the UC Riverside HPC directories. This will break anywhere else

Solution

re-write to be relative to the repo

Organize Directories

Problem

The current directory structure is mostly a historical hodge-lodge for when I was writing this up during my PhD. The locations of inputs, outputs, scripts, references, etc is just whatever was convenient at the time, but it has no objective structure to an outside

Solution

  • Create an organized set of directories including:
    • ./References to capture Picard and boa refs along with tables of samples names, etc
    • ./FASTQ, but we should pull directly from NCBI instead
    • ./Logs to replace ./history, and also not include these in git
    • ./Results
    • ./Results/Phylo to capture phylogenetic results (.nex, .phy.) from MrBayes, RAxML, jModelTest, SplitsTree, etc
    • ./Results/SNP to capture results from GATK, WhatsHap, SpeedSeq, bcftools, etc
    • ./Figures to contain both intermediate plots and manuscript figures
    • others?
  • Change paths, see #2

Pull from SRA

Problem

The FASTQ files in this repo simply take up space since they are backed uploaded to SRA.

Solution

Add a script to pull these files to a local directory. The files will need to either be renamed to match the old names from Purdue, or (better) the scripts changed so that they reference standard file names

Update README

Follow the #5 PR a lot of the points in the README have changed. I had added some small news updates to the top, but it needs a re-write

Docker

Problem

All of the scripts here depend on the UC Riverside HPCC infrastructure in terms of install software like MrBayes, GATK, etc. In addition there are a number of rather off-the-map R packages.

From a reproducibility standpoint, it would be nice to wrap all of this into a single docker container with all dependencies pre-installed.

Solution

Create a Dockerfile to include in the repo that specifies how to build a container to house the analysis. Subsequently, this container should be built and made publicly available in Dockerhub.

Additional considerations

Inside the scripts, file paths will need to be changed. Submission scripts will need to either be removed or reworked to call Docker instead. Removal is probably better since most HPC environments would require Singularity over docker and the submission itself is likely to be very environment-specific (Slurm vs PBS, etc)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.