Code Monkey home page Code Monkey logo

Comments (6)

AFgh24 avatar AFgh24 commented on August 26, 2024 1

If your initial request was something like text search across a folder full of different parquet files, like grep search, that's not something I plan on adding any time soon.

Yes
My request was this
I hope you add this feature soon
At the right time

from parquetviewer.

mukunku avatar mukunku commented on August 26, 2024

Could you try the following to see if it will work for you?

  1. Using v2.6.0 go to File -> Open Folder
  2. Select your folder containing your parquet files
  3. Once loaded, increase Record Count to how ever much you need. (Search only runs on records that are loaded into memory)
  4. Perform your search in the query box

from parquetviewer.

mukunku avatar mukunku commented on August 26, 2024

Hey @AFgh24 Any luck with the latest beta release?

from parquetviewer.

AFgh24 avatar AFgh24 commented on August 26, 2024

Thank

I put two sample files in one folder

The first example file opens in the folder (no problem)
https://github.com/Teradata/kylo/tree/master/samples/sample-data/parquet

The second example files are closed without being fully loaded (problematic)
https://huggingface.co/docs/datasets-server/parquet

https://huggingface.co/datasets/duorc/resolve/refs%2Fconvert%2Fparquet/SelfRC/duorc-validation.parquet

https://huggingface.co/datasets/duorc/resolve/refs%2Fconvert%2Fparquet/ParaphraseRC/duorc-validation.parquet

When both sample files are placed in the same folder, the following error is given

https://gcdnb.pbrd.co/images/I2sVBNG44bza.png

from parquetviewer.

mukunku avatar mukunku commented on August 26, 2024

Hey @AFgh24 ,

Thanks for the sample files. This error means the folder you are trying to open contains different parquet files. The utility cannot open multiple parquet files with different schemas. It only works if all the parquet files in the folder have the same schema, meaning they are truly partitioned.
image

If your initial request was something like text search across a folder full of different parquet files, like grep search, that's not something I plan on adding any time soon.

from parquetviewer.

mukunku avatar mukunku commented on August 26, 2024

Going to close this issue out with the won't fix tag for now.

from parquetviewer.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.