Comments (2)
I;ve reorganized the directory structure, to include the folder analysis_pipelines under lib. This replaces the data_type folder. Under this are subfodlers for the genetic _alteration_types listed above. Each holds a python script that executes a pipeline for processing the data. Under data_type there were distinctly named files (eg. mutation_data.py) that contained shared support functions used by each of the pipelines. This was changed to support_functions.py. To implement these changes, the study config file was modified to include a separate analysis column, which specified the genetic_alteration_type. The file still contains a type column which is the datatype. The code contains an object of class CONFIG that was modified to handle this, and it is from this object that the path to the pipeline script is located. I've tested this for MUTATION_EXTENDED, using the Mutect pipeline. All the MUTATION_EXTENDED pipelines were modified, and should be working now. Other Genetic_alteration_types still need to be fixed. I'll set up some additional tests to ensure these are working okay
from cbioportal_tools.
I am setting up test datasets for each pipeline. As modifications to the code are made, I'll test each of the pipelines for successful creation of an import folder. If any failures, I'll open up an issue to address it in the code.
Closing this issue, will open as a separate issue to develop this further as I think some additional code reorganization would benefit being able to extend the tool with new pipelines.
from cbioportal_tools.
Related Issues (20)
- Consolidate test data files?
- Tests for janus.py command line script HOT 1
- Investigate mutation test warnings
- Intermittent CAP mutation test error
- Refactor calls to shell
- Inconsistent _concat.txt suffix in output
- Cross-check sample names in config
- Replace get_tgca.r HOT 1
- Clean up preProcCNA.r HOT 1
- Dependency (if any) between pipeline config files HOT 1
- Fix or suppress numpy warning at runtime
- Move deprecated code into a "legacy" package
- Document & validate study config HOT 4
- Reinstate "colours" for cancer type HOT 1
- Add vcf2maf dependency in janus module
- Get fasta reference from environment module HOT 1
- Environment variables for mutation pipeline HOT 3
- Make mutation tests faster
- Non-CAP mutation scripts do not generate metadata
- Ensure pipeline R scripts and data files are installed by setup.py HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from cbioportal_tools.