Comments (5)
I think pushing the version is the right solution. Somebody just needs to
find time to work on this...
-- jt
On Fri, Jul 29, 2016 at 12:41 PM, Daniel Blankenberg <
[email protected]> wrote:
One example is the hg38 projected chr1 multiz100way from UCSC:
http://hgdownload.soe.ucsc.edu/goldenPath/hg38/multiz100way/maf/chr1.maf.gz
(5.9G gz'd, 66G gunzip'd)maf_build_index.py chr1.maf
Traceback (most recent call last):
File "/path_to/bin/maf_build_index.py", line 83, in
if name == "main": main()
File "/path_to/bin/maf_build_index.py", line 80, in main
indexes.write( out )
File "/path_to/lib/python2.7/site-packages/bx/interval_index_file.py", line 332, in write
write_packed( f, ">I", base )
File "/path_to/lib/python2.7/site-packages/bx/interval_index_file.py", line 463, in write_packed
f.write( pack( pattern, *vals ) )
struct.error: 'I' format requires 0 <= number <= 4294967295One possibility is to up the version number and store unsigned integers as
unsigned long long >Q, which would max out at 18446744073709551615 vs
4294967295. Would double the packed size though.Another potential workaround could be to break the MAF up into multiple
files, but I haven't tested this.xref: https://biostar.usegalaxy.org/p/18196/
—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub
#8, or mute the thread
https://github.com/notifications/unsubscribe-auth/AAE4ZSWKFE4vERJLatl6XcOwXGIzEhg3ks5qai1QgaJpZM4JYXhp
.
from bx-python.
Thanks for looking into this.
This issue have been bothering us for several weeks. Hope to have a solution to this problem soon.
from bx-python.
Any progress on this?
I just ran into what seems to be the same problem on a large MAF file (48Gb):
(base) /mnt/e/genemod/better_dNdS_models/drosophila/11_6_2020/cactus_work$ python /home/jodyhey/miniconda3/bin/maf_build_index.py drosophila_cactus.maf drosophila_cactus.mafindex
Traceback (most recent call last):
File "/home/jodyhey/miniconda3/bin/maf_build_index.py", line 82, in
main()
File "/home/jodyhey/miniconda3/bin/maf_build_index.py", line 77, in main
indexes.write(out)
File "/home/jodyhey/miniconda3/lib/python3.8/site-packages/bx/interval_index_file.py", line 351, in write
write_packed(f, ">I", base)
File "/home/jodyhey/miniconda3/lib/python3.8/site-packages/bx/interval_index_file.py", line 486, in write_packed
f.write(pack(pattern, *vals))
struct.error: 'I' format requires 0 <= number <= 4294967295
from bx-python.
@jodyhey No one is working on this issue, sorry, but pull requests are welcome!
from bx-python.
Is there any update on this issue? with how .maf files have gotten bigger lately, this might become a more common issue.
here I ran the script on a 30 Gb .lzo file (85 Gb uncompressed)
python3 maf_index.py
Traceback (most recent call last):
File "maf_index.py", line 75, in <module>
main()
File "maf_index.py", line 70, in main
indexes.write(out)
File "/home/pc575/jupyter-env-icelake/lib/python3.7/site-packages/bx/interval_index_file.py", line 351, in write
write_packed(f, ">I", base)
File "/home/pc575/jupyter-env-icelake/lib/python3.7/site-packages/bx/interval_index_file.py", line 486, in write_packed
f.write(pack(pattern, *vals))
struct.error: 'I' format requires 0 <= number <= 4294967295
from bx-python.
Related Issues (20)
- Incompatible types str and byte under python3 HOT 1
- Wheel for Python 3.8 HOT 2
- StopIteration transformed to RuntimeError on python >= 3.7
- No source tarballs for the release 0.8.8 on the PyPI website HOT 1
- error can't found bx.intervals module
- get on bx.align.maf.MAFIndexedAccess results in wrong alignment blocks HOT 4
- Importing ABC directly from collections was deprecated and will be removed in Python 3.10. Use collections.abc
- time.clock has been removed in Python 3.8
- ImportError: undefined symbol: PyUnicodeUCS2_FromStringAndSize HOT 1
- binned_bitsets_from_list errors when chromosome size is larger than set MAX (512M) HOT 1
- Got wrong positions on chomped query maf subset HOT 2
- How to dealing with maf files compressed with bzip2 or lzop ? HOT 4
- Failed to load seekbzip2 module when dealing with bz2 files HOT 1
- undefined symbol: PyUnicodeUCS2_FromStringAndSize HOT 3
- 0.8.11 Doesn't install some built shared libraries HOT 8
- documentation? HOT 3
- Add support to release linux aarch64 wheels HOT 1
- multiple sequence alignment
- upgrading bx-python to newer versions of python HOT 1
- ERROR Missing dependencies: oldest-supported-numpy HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from bx-python.