Comments (8)
Target version 1.5.0 and support for Spark 3.1.0. Hopefully the datasource V2 api will then be stable
from spark-hadoopoffice-ds.
target version hadoopoffice 1.4.0
from spark-hadoopoffice-ds.
@jornfranke, is this issue still considered? When do you plan relising version 1.5.0? Thanks...
from spark-hadoopoffice-ds.
Yes. Normally the current version should work on Spark 3.1, it is just a matter of testing. Release by urgency. E.g. if you find out it does not work on Spark 3.1 then I will prioritize efforts to make it happen.
from spark-hadoopoffice-ds.
@jornfranke, I tried it and it works. I only found one little issue with io.file.buffer.size
property. It seems like its type has changed from String
to Integer
so it threw java.lang.NumberFormatException: For input string: "64K"
at some point I couldn't control. Overriding the value while trying to read dataset sparkSession.read().option("io.file.buffer.size", 4096)
didn't help but it helped to upgrade to hadoop version 3.3.0
.
from spark-hadoopoffice-ds.
valid point - i will address this and see if it can be tested with Spark 3
from spark-hadoopoffice-ds.
Sorry, this point is not an issue in HadoopOffice as we do not set io.file.buffer.size and we do not compile the Hadoop Version into the library. I assume your spark installation must require it.
from spark-hadoopoffice-ds.
There was via a forum the wish to update the dependency of Spark to 3.x to avoid notification by JFrog Artifactory X-Ray.
I will do this after some cleanups
Note: The Jfrog warning is a false positive - we do not include Spark - we have it as provided, ie HadoopOffice uses the version provided by the runtime of the user.
from spark-hadoopoffice-ds.
Related Issues (20)
- java.lang.NoSuchMethodError while using spark-hadoopoffice-ds_2.11 (v1.2.4) HOT 7
- Doesn't generate excel file as expected but no obvious error HOT 3
- Cell with Integer value has an = sign HOT 24
- Spark Datasource does not pick up options for writing correctly HOT 2
- Job abortion on databricks HOT 2
- IndexOutOfBoundsException: When reading xlsx file. HOT 8
- How do i get a schema that's all String with "read.spark.simpleMode" HOT 2
- Add support for partitioning by folders in Hive/Spark Style HOT 19
- Writing to XLSX only outputs String fields but XLS works fine HOT 11
- workbookDocument .getWorkbook().getWorkbookPr() can return null HOT 8
- Support for Scala 2.13 and drop support for Scala 2.11 HOT 1
- lowFootprint: numeric column values result in nulls HOT 7
- Nullpointer Exception when using Spark with Kyroserializer HOT 2
- "hadoopoffice.write.header.write" is only working for default sheet HOT 6
- Spill over to next sheet if number of rows exceeding Excel limitations
- Skipped imported decimal values HOT 11
- Not all HadoopOffice configuration is applied correctly
- CVE-2021-44228: Mitigate Log4shell HOT 2
- inferSchema for Excelfiles with one row is not working correctly
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from spark-hadoopoffice-ds.