Comments (10)
spark-submit
should be in the master in /root/spark
if the setup completed successfully
from spark-ec2.
@shivaram This is good to hear - but went through the process multiple times and /root/spark
only has /conf
.
I'll dig in some more to see if I come up with something, thanks! Will follow-up shortly.
from spark-ec2.
Confirmed a couple more times and seemingly no errors on my end. If this isn't an issue for anyone else, any tips for figuring out what is going on here?
from spark-ec2.
oh, this wasn't loud enough in the logs:
Initializing spark
--2017-03-29 19:05:47-- http://s3.amazonaws.com/spark-related-packages/spark-1.6.2-bin-hadoop1.tgz
Resolving s3.amazonaws.com (s3.amazonaws.com)... 52.216.1.75
Connecting to s3.amazonaws.com (s3.amazonaws.com)|52.216.1.75|:80... connected.
HTTP request sent, awaiting response... 404 Not Found
2017-03-29 19:05:47 ERROR 404: Not Found.
ERROR: Unknown Spark version
spark/init.sh: line 137: return: -1: invalid option
return: usage: return [n]
Unpacking Spark
tar (child): spark-*.tgz: Cannot open: No such file or directory
tar (child): Error is not recoverable: exiting now
tar: Child returned status 2
tar: Error is not recoverable: exiting now
rm: cannot remove `spark-*.tgz': No such file or directory
mv: missing destination file operand after `spark'
Read the docs that we could specify the Spark package. Is it required?
from spark-ec2.
Read the docs that we could specify the Spark package. Is it required?
Bump to this. Willing to push an update to make this required if the above is expected behavior when not specifying repo url or version.
from spark-ec2.
I think this is a specific problem with hadoop version 1 and spark 1.6.2. can you try passing hadoop version as 2 or yarn and see if it works
from spark-ec2.
To be clear, I've been getting past this by specifying a commit hash which I prefer anyhow. But yes, I will give this a try to provide some feedback. Thanks!
from spark-ec2.
adding --hadoop-major-version 2
to launch
fixed it.
Anything we should do to either circumvent in code and/or document? Feel free to close if not.
from spark-ec2.
I think it would be great if we could change the default to not be the failure case -- Can you send a PR changing the default hadoop version to either 2
or yarn
?
from spark-ec2.
You got it. Busy next few days but will follow through.
Will also include some documentation on the use of --hadoop-major-version
which is seemingly missing from README.
Thanks again.
from spark-ec2.
Related Issues (20)
- Where to find the spark home?
- Enable python2.7
- Support for S3 V4 HOT 1
- Quick question on setting fs.s3a.endpoint HOT 2
- Can I add an extra disk at /mnt3? HOT 1
- Cluster is created but Spark is not able to installed in ec2 master and slaves HOT 1
- support for spark 2.2.0? HOT 8
- "--spark-version" does not work HOT 1
- tachyon not supported? HOT 1
- Documentation incorrect regarding missing "ec2" directory HOT 3
- VPC/Subnet requirements not documented HOT 2
- branch-2.0 should use scala 2.11 HOT 2
- Getting InvalidClassException when running SparkPi example locally but pointing to master on AWS HOT 2
- salve nodes not started on re-start HOT 2
- ap-northeast-2 seoul region support HOT 4
- Attach an existing persistant EBS volume to the cluster
- ERROR StandaloneSchedulerBackend: Application has been killed. Reason: All masters are unresponsive! Giving up. HOT 3
- Classpath/dependency resolution of JAR app HOT 1
- Need support for ca-central-1 HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from spark-ec2.