Comments (3)
If your goal is to investigate OOMs in the Spark driver, sparkMeasure is not the best tool.
Spark metrics instrumentation has metrics for Java memory usage on the driver (and executors), which may be useful for your case, see https://github.com/LucaCanali/Miscellaneous/tree/master/Spark_Dashboard
Or else you can try using JAVA instrumentation like JFR, or debugging tools like https://github.com/jvm-profiling-tools/async-profiler
from sparkmeasure.
Hi,
Flight recorder mode with file output currently has a very simple implementation, where all metrics are buffered into the driver memory and will only be written out as the application finishes, which indeed is a problem if youhave a driver crash in between.
You may want to have a looks the Filght recorder mode with InfluxDB sink and/or Kafka sink, to overcome those issues: https://github.com/LucaCanali/sparkMeasure/blob/master/docs/Flight_recorder_mode_InfluxDBSink.md https://github.com/LucaCanali/sparkMeasure/blob/master/docs/Flight_recorder_mode_KafkaSink.md
You may also want to check the SPark dashboard project https://github.com/cerndb/spark-dashboard
Best,
Luca
from sparkmeasure.
@LucaCanali
Thanks for the reply. Even if Filght recorder is using InfluxDB sink and/or Kafka sink, it would not be able to report metrics of the latest / on-going task/stage/executor at the time of the crash right? One solution might be to have some sort of monitoring done outside of Spark Driver? Do you know of anything like that or is there any other workaround I might have missed? Thanks!
from sparkmeasure.
Related Issues (20)
- Support for SparkR HOT 1
- Memory usage HOT 1
- Send to Prometheus not available from Python HOT 1
- TaskMetrics and StageMetrics does not extend a common trait
- Notable difference to REST API HOT 2
- Better to have the IO metrics for non-hdfs type such as S3 Storage HOT 1
- Guide for metrics interpretation HOT 1
- How to find time spent in I/O for a task HOT 2
- How to get the execution time for EACH operator in Spark SQL? HOT 1
- NoSuchMethodError (ScalaObjectMapper) HOT 3
- throwing error when trying to make work locally HOT 9
- Peak Memory usage - PySpark 3 on Azure Synapse HOT 2
- TypeError: 'JavaPackage' object is not callable HOT 3
- can't find spark-measure 0.21 HOT 2
- java compatibility HOT 1
- testSparkMeasureScala sbt problem HOT 2
- Issue when using sbt dependency - retrofit not resolved HOT 1
- NoSuchMethodError: org.apache.spark.scheduler.SparkListenerExecutorMetricsUpdate.executorUpdates() HOT 5
- 'JavaPackage' object is not callable when using the Colab Example HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from sparkmeasure.