Code Monkey home page Code Monkey logo

Comments (12)

ankur4369 avatar ankur4369 commented on August 21, 2024 1

@yingsu00 I have started to understand the code. I am already up to speed with developer docs on Velox. But still wrapping my head around the implementation details.

In terms of time commitment, I should easily be able to spend ~4-5 hours daily going up to 10-12 hours/day if required. However, being new to the code base, I may need some guidance to speed up.

from velox.

wubiaoi avatar wubiaoi commented on August 21, 2024 1

@pedroerp Now I mainly use online use cases to do some performance tests. First, I will convert the format to parquet format. If the performance is good, I will develop orc format support, because online Hive tables are mainly in orc format.

from velox.

ankur4369 avatar ankur4369 commented on August 21, 2024

@yingsu00 I would like to attempt this and get it done. Still pretty new to the library but I hope to be able to contribute with guidance. Let me know if this is okay.

from velox.

yingsu00 avatar yingsu00 commented on August 21, 2024

@ankur4369 Thank you for offering help! ORC reader is crucial and performance critical. Some functionality may already be supported in DWRF reader but many are not. It will take several months to make it work and performant. How's your time commitment on it?

@mbasmanova @pedroerp What's Meta's plan on this? Will XLDB folks work on ORC reader?

from velox.

mbasmanova avatar mbasmanova commented on August 21, 2024

@ankur4369 It would be very nice if you could help add support for the ORC format. ORC is similar to DWRF and we have support for DWRF already. I hope it would be just a handful of tweaks to add support for ORC. We should be able to help by answering questions, brainstorming and reviewing the PRs. I'm CC-ing Zhenyuan and Orri who should be able to provide guidance as well.

I suggest to first figure out how to make ORC work without worrying too much about performance. Perf optimizations can come after basic support is in place.

CC: @zzhao0 @oerling

from velox.

yingsu00 avatar yingsu00 commented on August 21, 2024

@ankur4369 Is there any update on this?

from velox.

stale avatar stale commented on August 21, 2024

Is this still relevant? If so, what is blocking it? Is there anything you can do to help move it forward?

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs.

from velox.

wpleonardo avatar wpleonardo commented on August 21, 2024

@mbasmanova @pedroerp Does Velox have any plan to support ORC reader? I saw the above message, seems this feature didn't finish, right?

from velox.

mbasmanova avatar mbasmanova commented on August 21, 2024

@wpleonardo It would be nice to add support for ORC. Would you like to help build that?

from velox.

pedroerp avatar pedroerp commented on August 21, 2024

Re-opening since ORC support is still relevant.

from velox.

wubiaoi avatar wubiaoi commented on August 21, 2024

Hi, what's the status now?

from velox.

pedroerp avatar pedroerp commented on August 21, 2024

I don't know of anyone actively working on this. @wubiaoi, is this something you would be interested in contributing?

from velox.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.