Comments (3)
I should update that this problem happens on "BYTE_ARRAY" type of decimal , which is generated by Apache Nifi, not "FIXED_LEN_BYTE_ARRAY" which is generated by pyArrow.
On standart FIXED_LEN_BYTE_ARRAY there seems no problem, but if size is not set the jibberish characters appear.
On left hand metadata from parquet file generated by Nifi, on right hand the one generated by pyArrow (which I was going to share non important data, but pyarrow converts to fixed_len_byte_array which parquet viewer shows fine, so problem may seem to be parquet files with decimal generated by Nifi with Parquet 1.0 writer)
from parquetviewer.
Could you also share a sample file please?
from parquetviewer.
I think these types of decimal are now supported in the parquet-dotnet library: aloneguid/parquet-dotnet#166
Since there is no sample file I'm not able to validate this but this ticket has been open for a long time so I'm closing it out. If anyone could confirm it works or provide a sample file that doesn't work please feel free to comment or reopen this issue.
from parquetviewer.
Related Issues (20)
- [FEATURE REQUEST] Ability to read parquet files compressed with "zstd" HOT 2
- [FEATURE REQUEST] Display timestamp fields in human-intelligible format HOT 5
- [FEAT] Add metadata viewer HOT 2
- [FEATURE-REQUEST] Ability to sort data by columns HOT 1
- [FEAT] Double-clicking a parquet file should open ParquetViewer with the contents of file HOT 4
- [FEAT] Adjust column size to data/column name HOT 2
- [FEAT] Time Decimals (milliseconds) in CSV export HOT 2
- [BUG] very low/high dates/timestamps (0001-01-01 and 9999-12-31 23:59:59.9999) cause problems HOT 2
- [BUG] Close button is not at the top right HOT 7
- [BUG] Opening ParquetViewer to an empty view HOT 9
- [BUG] App doesn't launch via pre-compiled binaries and any IDE besides Visual Studio HOT 6
- [FEAT] Display Rowgroup info HOT 1
- [BUG] Cannot open Parquet file with 2 similar column names (different case) HOT 5
- [BUG] Handling Files with many columns HOT 8
- [BUG] Cannot open file because of missing column, but column is present. HOT 4
- [FEAT] Search text in multiple Parquet files in one folder HOT 6
- [BUG] Error when opening file containing columns of LIST type HOT 3
- [BUG] sbyte and byte types swapped HOT 1
- [BUG] Unable to open the file which contains a nullable guid column HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from parquetviewer.