Comments (1)
From the readme:
3. imp: This is the the set of queries created by Cloudera for testing in Impala.
The queries are available to provide an apples to apples comparison of run times to
compare with Impala. The queries were copied from this public repo:
https://github.com/cloudera/impala-tpcds-kit
The license file for these queries is also included:
impala_queries_license.txt
The Impala queries were changed for syntax and to remove partition cheating. Impala
doesn't support the concatenation with || so they changed the SQL to use their concat()
function. This was changed back. Intervals were changed from "interval 30 days" to
"'30 days'::interval". Query hints were removed. There are 117 removals of explicit
partition pruning in 63 unique queries. Queries 3, 4, 7, 9, 14, 24, 35, 46, and 51 were
heavily modified by Cloudera so reverting to the original TPC-DS version.
from tpc-ds.
Related Issues (20)
- Creating socket failed during dataload HOT 2
- hawq_rm_nvseg_perquery_perseg_limit clarification HOT 1
- Very poor HDFS throughput HOT 2
- Unable to load more than 50GB data in hdfs through tcpds script HOT 8
- Sharing TPC-DS test results of HAWQ & SparkSQL
- Generate data step hangs HOT 14
- relation "pg_filespace_entry" does not exist HOT 7
- Changes in Postgresql.conf causing to Stop Greenplum HOT 5
- Canceling query because of high VMEM usage. HOT 2
- ERROR: could not open file "../log/rollout_gen_data.log" for reading: No such file or directory HOT 10
- Can not execute tpcds.sh in offline environments HOT 2
- Setting RUN_COMPILE_TPCDS="false" does not disable compiling HOT 2
- 请教问题 HOT 5
- what's the difference with score and qphds HOT 2
- Should 02_init/rollout.sh set search path for ADMIN_USER? HOT 3
- ERROR: http response code 404 from gpfdist HOT 19
- Selected scale factor is NOT valid && Connection timed out HOT 7
- Generating data takes long time HOT 4
- Session report not avaialbe HOT 6
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from tpc-ds.