Code Monkey home page Code Monkey logo

Comments (8)

jornfranke avatar jornfranke commented on June 13, 2024 1

Target version 1.5.0 and support for Spark 3.1.0. Hopefully the datasource V2 api will then be stable

from spark-hadoopoffice-ds.

jornfranke avatar jornfranke commented on June 13, 2024

target version hadoopoffice 1.4.0

from spark-hadoopoffice-ds.

danijelz avatar danijelz commented on June 13, 2024

@jornfranke, is this issue still considered? When do you plan relising version 1.5.0? Thanks...

from spark-hadoopoffice-ds.

jornfranke avatar jornfranke commented on June 13, 2024

Yes. Normally the current version should work on Spark 3.1, it is just a matter of testing. Release by urgency. E.g. if you find out it does not work on Spark 3.1 then I will prioritize efforts to make it happen.

from spark-hadoopoffice-ds.

danijelz avatar danijelz commented on June 13, 2024

@jornfranke, I tried it and it works. I only found one little issue with io.file.buffer.size property. It seems like its type has changed from String to Integer so it threw java.lang.NumberFormatException: For input string: "64K" at some point I couldn't control. Overriding the value while trying to read dataset sparkSession.read().option("io.file.buffer.size", 4096) didn't help but it helped to upgrade to hadoop version 3.3.0.

from spark-hadoopoffice-ds.

jornfranke avatar jornfranke commented on June 13, 2024

valid point - i will address this and see if it can be tested with Spark 3

from spark-hadoopoffice-ds.

jornfranke avatar jornfranke commented on June 13, 2024

Sorry, this point is not an issue in HadoopOffice as we do not set io.file.buffer.size and we do not compile the Hadoop Version into the library. I assume your spark installation must require it.

from spark-hadoopoffice-ds.

jornfranke avatar jornfranke commented on June 13, 2024

There was via a forum the wish to update the dependency of Spark to 3.x to avoid notification by JFrog Artifactory X-Ray.
I will do this after some cleanups

Note: The Jfrog warning is a false positive - we do not include Spark - we have it as provided, ie HadoopOffice uses the version provided by the runtime of the user.

from spark-hadoopoffice-ds.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.