Comments (6)
impala-insert.sql fails with a memory limit exceeded which I am trying to increase but it is still failing. So far at 40G and failing. I also get errors: Failed to close HDFS file: ... and Error(255): Unknown error 255. Also the hint clustered is not used with my version (impalad version 2.5.0-cdh5.7.0) . any ideas of an alternative I could use to clustered?
from impala-tpcds-kit.
With such an old version (almost 3 years old) you will have to load the fact tables by limiting the number of partitions being loaded at one time in a multi-pass format.
from impala-tpcds-kit.
from impala-tpcds-kit.
I can't get impala updated on our work system which is annoying. So in impala-insert.sql I am currently trying to split the table for example, catalog_sales, by splitting the partitioning further. Is that the right way to go about it, and could you recommend the best way to do that split. Many thanks for your cooperation.
from impala-tpcds-kit.
The problem is that it takes a significant amount of memory to insert into every partition from every host, so the hints deal with that. On older versions you will have to limit the number of partitions being inserted into by adding a predicate on the text table. For example, taking 20 values of ss_sold_date_sk
from store_sales
at a time.
from impala-tpcds-kit.
Thanks Greg. I have found another database that I might use if I can't get this working. Thanks for the help.
from impala-tpcds-kit.
Related Issues (20)
- The answer is not consistent HOT 1
- Error when loading sales_store table
- -FILTER error. output not going to stdout HOT 2
- inventory table
- benchmark script
- Impala Scripts no longer work on latest CDH HOT 1
- lack of table HOT 4
- Running ./run-gen-facts.sh produces hdfs.DFSClient: DataStreamer Exception
- Does this repository provide a realistic TPC-DS benchmark?
- Not generating child tables HOT 1
- Exception when generating store_sales table. HOT 1
- load-store-sales index out of range HOT 4
- create reason and ship_mode external table with wrong file location
- query30.sql column c_last_review_date_sk does not exist, c_last_review_date intended? HOT 1
- query39.sql contains 2 queries HOT 1
- query5a.sql AnalysisException HOT 1
- DATE is not supported in CDH 5.16 Impala HOT 3
- Can impala tpcds run in Hadoop single node mode
- Missing copyright NOTICE
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from impala-tpcds-kit.