Comments (5)
I haven't been able to contribute to the official repo as much as I wanted to lately. I needed an iceberg implementation that works with datafusion rather quickly. Since this repo already had basic support for datafusion when the official repo was started, it was faster to implement the missing functionality here than to wait for the official repo. Additionally this repo is closer to the arrow ecosystem by using the object_store crate.
I hope I will have more time in the future to contribute to the official repo.
It would be great to have you on board for either repo.
from iceberg-rust.
Noted, thanks! I'm going to study up some more on Iceberg and then figure out what to work on next.
from iceberg-rust.
Actually, can you expand on what you mean by closer to the arrow ecosytem by using object_store crate?
from iceberg-rust.
Well, the official apache iceberg-rust crate uses Opendal to handle the interaction with cloud object stores while I preferred to use the object_store crate which is part of the arrow-rs ecosystem.
But I have to admit that this is a bit of a constructed argument because I think there is a crate that implements the ObjectStore
trait for opendal.
Since I mostly focused on Datafusion, which uses object_store
internally, it was easier for me to use it directly.
from iceberg-rust.
Great, thank you for the information @JanKaul
from iceberg-rust.
Related Issues (20)
- Implement ExprRewriter to filter partition columns
- Status of Writing to Iceberg? HOT 6
- Docker Image, CLI Client and Webserver HOT 1
- Inconsistent unit test HOT 3
- New home of the repository? HOT 1
- Project status HOT 3
- Question: Datafile writer HOT 8
- Change Catalog trait to allow REST catalog
- Adopt changes to Materialized View spec
- Add Rest-catalog
- Use Trino Materialized View Spec
- fix datafile path with partition
- fix catalog interface
- Improve builders
- timestamp with timezone primitive type should be timestamptz HOT 1
- `README` mixes up `order_id` with `product_id` HOT 2
- Does this crate work with the python datafusion package? HOT 1
- Partition Datafusion plan by iceberg partitioning
- Storage partition join
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from iceberg-rust.