jsa-aerial / aerobio Goto Github PK
View Code? Open in Web Editor NEWExtensible full DAG streaming computation server with services and jobs for RNA-Seq, Tn-Seq, WG-Seq and Term-Seq.
License: MIT License
Extensible full DAG streaming computation server with services and jobs for RNA-Seq, Tn-Seq, WG-Seq and Term-Seq.
License: MIT License
calc_fitness and aggregate both use gbk parsing to determine annotations but this has two problems:
Using GTFs:
Sample names and IDs in SampleSheet [Data] section must not contain underbars _
, as they are the delimiter used by the conversion software (bcl2fastq and bcl-convert) to demarcate sequencer conversion information - NOT sample names/ids. This can result in 'missing input fastqs' during experiment barcode demultiplexing.
Hi, I'm trying to see if I could install the package on my MacBook pro. It looks like the config.clj is missing? I wonder if you have it or it's not necessary for running? Thank you!
TItle pretty much says it all. This is both for performance and maintenance reasons
pgmgraph and friends should be separate library!! Additionally tool and job db, create, delete, update should be part of this and removed from server (which is a crazy place for it)
Per page 9 of cookbook, this needs to be accounted for by including missing expansion factor fields. The validator should check and indicate error if this is not done.
From Stephen:
I am running some more samples for Eddie and one feature I would like to request is that the email sent saying a phase was successful should have the EID with it. I'm running some of the plasmids in parallel and its not clear which sample is which by the email
Looks like the replicate IDs can only be a single character, which limits the number of possible replicates in an experiment. For large (e.g. human population) studies there might be hundreds of samples in one group, so it would be nice to have replicate ID's that can be more than a single character. A workaround for now is to do things in batches of 26 and name replicates a-z.
Make sure that no replicate names repeat in Exp-SampleSheet.csv. This is for RNASeq,TNSeq and TermSeq.
Make sure that SampleName/IDs do not repeat in SampleSheet. This is because bcl2fastq will catch this but it has no good way of relaying the error to Aerobio...
Is it plausible to do a good job of generating the various experiment description sheets. See #4 as supporting information
Would be great to also have the option of providing fasta+gtf OR genbank file (would have to be converted to fasta/gtf first)
While issue #2 speaks to the need of splitting this off into a separate lib which would then be a dependency, this issue is a marker indicating the need to rewrite the (rather ugly) custom graph rewriting code to use Specter. Specter has vastly cleaner general purpose rewrite code for all manner of nested datastructures, including graphs, trees, maps, and all manner of nesting. This is exactly what the flow graph recursive rewriting code needs.
Generally, running with replicates enabled is what is typically needed the most. Combination runs are less likely to be used / desired - though they can still happen. So, change the default behavior to be replicate oriented while allowing explicit replicates
modifier for backward compatibility and adding new combined
modifier to indicate that a combination run is desired.
Simplest way would be to put a strm-take between get-comparison-files and breseq-runs. Then stagger launch the number given to spread alignments out.
Currently, status reports in progress output with a [done: [list of nodes that have finished]]. The 'done' is confusing and would be more accurate to be run progress
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.