Comments (4)
Hi,
No, this is not something totally unexpected. SyRI tries to compare different combinations in which alignments can be selected to represent rearrangements. So, if a large rearrangement is represented by many small alignments, then the number of combinations increases, increasing the runtime. What affects runtime is not the number of duplications/translocations but the number of alignments describing them. Allowing larger gaps while doing whole-genome alignment, could provide few longer alignments solving this issue.
That being said, from the mummerplot, do these regions look like 'inversions' (regions inverted at their own loci) or 'inverted translocations/duplication' (inverted and relocated to a different loci)? I ask this because, the function for finding inversions was recently optimised, so if you are using the newer version of SyRI and the region is indeed an inversion, then this problem should not be there. However, the function for finding translocations and duplications is still slow for these cases. Would it be possible to share the syri.log file to see which step is taking longer?
from syri.
Thanks for the quick reply! The info is certainly useful. I've attached the log files (thankfully, runB as finished some time overnight). I've attached the log files and, for your reference the mummerplots.
-----Edit-----
The axis between the mummerplots are switched, if it makes a difference.
from syri.
Hi! Thanks for sharing the log files and the mummerplots. SyRI is indeed taking time in identifying inversions. I did not expect that as function for inversion identification should be quite fast now. But, it seems that it would need to be optimised further.
from syri.
No problem. At least I know the software is running properly. If only I worked on yeast or something similar! Best of luck with optimizing the software (if you plan on going that route).
from syri.
Related Issues (20)
- re-arragement between two Eukaryotic genomes HOT 1
- different names of sequences HOT 1
- Merge vcf from different sample HOT 1
- ERROR - Numerical chromosome id HOT 1
- Issue with alignment HOT 2
- Inversion disappears after running chroder HOT 3
- high fraction of inverted alignments HOT 2
- Installation error with conda create HOT 8
- KeyError thrown when running example with v1.6.3 HOT 6
- Import Error HOT 1
- the parameter --reg can not show Translocation HOT 1
- The question with "Unequal number of chromosomes in the genomes" error HOT 1
- Issue of choice of aligner for whole genome alignment of two set of genomes HOT 2
- SAM reader - ERROR - Incorrect CIGAR string found. CIGAR string can only have I/D/H/S/X/=. CIGAR STRING: CM028859.1 HOT 2
- Cython package not detected HOT 3
- Erro in reading BAM file. reference_id -1 out of range HOT 1
- Synteny differs between same set of assemblies HOT 5
- Genome specific sequences HOT 3
- Chromosomes IDs do not match HOT 2
- IndexError: list index out of range (but no error when swapping the reference and query) HOT 7
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from syri.