Comments (10)
Hmm, I have tested decompress_ptr() quite extensively before releasing 1.4.3. Could you please post a minimal example reproducing the problem?
from python-blosc.
@FrancescAlted we just received the same issue on our builds: pandas-dev/pandas#14143
this works on macosx, failing on linux though.
from python-blosc.
@FrancescAlted ahh I see @solarix888 reported this from pandas ok, let me see if i can repro
from python-blosc.
hmm, I cannot repro this locally (on a 64-bit linux vm). maybe some conflict with some other libs on linux? (tried 3.5, now trying 2.7)
from python-blosc.
Hmm, my specific use case is that I save a pandas DataFrame to msgpack using the pd.to_msgpack() method, compressed using blosc. Then I read it back later. In this case, the compressed DataFrame was compressed with the previous version python-blosc, and then version update, I wasn't able to decompress.
from python-blosc.
-> return _ext.decompress(bytesobj, as_bytearray)
(Pdb) l
480 ... as_bytearray=True)) is bytearray
481 True
482
483 """
484
485 -> return _ext.decompress(bytesobj, as_bytearray)
486
487
488 def decompress_ptr(bytesobj, address):
489 """decompress_ptr(bytesobj, address)
490
(Pdb) p bytesobj
'\x02\x01\x01\x08\xf0\x00\x00\x00\xf0\x00\x00\x00L\x00\x00\x00\x14\x00\x00\x004\x00\x00\x00!\x00\x00\xe0\xaa\x00\x1f\xf0\x00\x08\x10\x14\x18\x1c "$&(*,.0123456789:;<=\x00?@\xe0\t\x00\x08@@@@@@@@@'
so this appears to be a compress-decompress ONLY on 2.7/linux. In this example we are using latin-1 encoding (but not sure if that matters)
The data is coming from a file that we created
here.
I don't know the exact version of blosc that was used originally, maybe https://github.com/kawochen remembers (this was checked in 7 months ago)
And working on 1.4.1 just fine.
lmk if any other data can provide
from python-blosc.
Yep, I can reproduce that. There is a fix in: Blosc/c-blosc@07d4bb0. I'll try to release new versions of C-Blosc and Python-Blosc as soon as possible,
from python-blosc.
I have released new versions of C-Blosc (1.11.1) and Python-Blosc (1.4.4). Please give them a test.
from python-blosc.
thanks @FrancescAlted looks great!
from python-blosc.
I had a very similar issue with version 1.5.1, getting an exit code -1073741819 while decompressing data generated with pandas msgpack function.
Upgrading to blosc 1.6.1 on the environment doing the decompression fixed the issue. The compression is still performed with blosc 1.5.1.
from python-blosc.
Related Issues (20)
- Issues decompressing bytes from files HOT 1
- Replace obsolete `popen2` HOT 1
- Properly identify vendored `cpuinfo.py` version
- Blosc_ROOT cmake warning: Policy CMP0074 is not set HOT 2
- "RuntimeError: Cannot decompress" for a compressed sequence of more than 7240 zero bytes HOT 1
- Very bad compression on short inputs 1-127 bytes long HOT 5
- “python_requires” should be set with “>=3.6”, as blosc 1.10.6 is not compatible with all Python versions. HOT 2
- wrong setuptools build command
- Concatenate two blosc compressed bytes objects HOT 2
- LICENSES/BLOSC.txt HOT 4
- Rename default branch HOT 1
- Update pypi with latest blosc version HOT 3
- Wheel for Python 3.10 and Python 3.11 HOT 3
- Cannot install blosc 1.11.0 on apple M1 machine HOT 3
- decompress in fore-end HOT 1
- README link to python-blosc2 seems useful HOT 1
- __pack_tensor__ must be made portable and not depend on Python HOT 2
- __pack_tensor__ should be in the beginning of the file to avoid seeking the whole file HOT 2
- Python 3.12 compatibility HOT 6
- Numpy 2 compatibility
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from python-blosc.