Comments (7)
The plan is to use PSCBS to perform segmentation of both log2 copy ratios and allele frequencies if a VCF file of SNVs is provided. If the VCF contains a paired normal/control sample in addition to the tumor/test sample, take advantage of that.
Emit a .cns file with extra columns indicating allele-specific values.
from cnvkit.
@chapmanb I saw you added a similar feature in bcbio for BubbleTree, but segmenting on allele frequencies instead of rather than in addition to total copy number. It looks like this would be generally useful to have in CNVkit. Am I reading that right? If so, mind if I borrow the code and expose it through CNVkit's segment
command?
from cnvkit.
Eric;
You're absolutely welcome to any of my code that's of use. However the BubbleTree work is not well tested right now and needs work to get up to date with the latest BubbleTree development version so it might not be the best time to grab it. I'm going to try and reboot the heterogeneity analysis work in bcbio and can give you a heads up when this is better tested and integrated.
from cnvkit.
Loosely relevant to this issue, here's a paper where PyClone was used with CNVkit to identify/examine subclones. I'll have to read this paper more carefully and maybe ask the lead author about the experience.
from cnvkit.
Would be very interested if segment could work with allele frequencies solely. Is there a way to do this?
As implemented--including an extra column of the average b-allele frequency of a copy number segment--is worthless because copy neutral loss of heterozygosity can drive many dramatic changes in baf within a single copy number segment.
from cnvkit.
You could probably manage it through the cnvlib
API -- see the implementation of cnvlib.segment.__init__
where the variants array is used, it's just a few lines to run segmentation on the SNP allele frequencies once you have loaded them via cnvlib.cmdutil.load_het_snps
.
Once you've segmented your read depth ratios and/or SNP b-allele frequencies, use the call
command to infer integer total copy number and allele-specific copy number (the cn
, cn1
and cn2
columns, respectively). Uniparental disomy shows up in the latter columns but not the former.
from cnvkit.
@etal OK thanks. I got the variants loaded through the API, but can you give another hint on how to call which method in segments/init to segment on just baf?
from cnvkit.
Related Issues (20)
- Interpretation of WGS data (epilepsy): Where should I commence? HOT 5
- CNVkit on WGBS data
- difference between batch bams vs individual bam
- scatter plot
- Correct way to get SV BED/VCF HOT 2
- CNV search in exomes
- diagram bars size
- AttributeError: module 'pomegranate' has no attribute 'NormalDistribution' in hmm-germline HOT 2
- Error while indexing BAM files
- Cnvkit call and cnvkit genemetrics_differences + scatter plot
- Convert SEG format and do CNVkit to analysis ?
- Gender Inference in Reference Step
- Update included references for hg38 HOT 1
- What are the specific steps to run for cnvkit.py batch -m wgs?
- How to normalise different sequencing coverage samples in CNVKit?
- For paired-WGS, is it necessary step of markduplicates by PICARD? HOT 1
- `import-rna` not compatible with pandas 2 HOT 1
- Can the results of CNVKIT be applied to clinical clinical?
- UnicodeDecodeError: 'utf-8' codec can't decode byte 0x8b in position 1: invalid start byte` HOT 1
- negative number in cn column of *call.cns
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from cnvkit.