Comments (6)
Hi, right now ze_tracer does not support stages highlighting like opencl-intercept-layer. But you can collect Chrome trace that includes both host and device activities with --chrome-call-logging
and --chrome-device-timeline
flags, where each kernel execution has it's unique ID that can be matched with host API calls.
E.g. one may find zeCommandListAppendLaunchKernel
call (append phase), zeCommandQueueExecuteCommandLists
call (submit phase), and kernel execution (execution phase) with the same ID to retrieve similar information.
If you need exactly the same functionality as ChromePerformanceTimingInStages provides - yes, I believe we can do this as well, just let us know. But note that L0 works differently than OpenCL and has not quite the same stages.
from pti-gpu.
I think the ChromePerformanceTimingInStages is very helpful in performance analysis, it is great if your team can support this one.
from pti-gpu.
Got it, we will start work on this capability. Will update you when it will be ready.
from pti-gpu.
great, thank you!!!
from pti-gpu.
Done, --chrome-device-stages
option. Please try it and let me know if there are any issues with this new feature.
from pti-gpu.
@anton-v-gorshkov thank you for the quick response, I will check it. And close this issue now. Thanks.
from pti-gpu.
Related Issues (20)
- [oneprof] - please add per kernel aggregation
- Can't build gpuinfo HOT 2
- Does oneprof support collecting of L1 and L2 cache hit ratio? HOT 1
- windows oneprof atexit issues HOT 1
- Old MD library name on Lin HOT 1
- intel_gpu_abrt: 9: Bad substitution
- [zetracer] Report median time
- Unable to find target metric group: ComputeBasic HOT 1
- Assertion status == ZE_RESULT_SUCCESS failed HOT 1
- oneprof crashes when using mpirun + workload that calls make
- unitrace fails with: Assertion `zeEventQueryStatus(event) == ZE_RESULT_NOT_READY' failed.
- execute sysmon running process unknown HOT 2
- oneprof -q fails with error "ZE_RESULT_SUCCESS' failed" HOT 2
- Assertion `instance->device_sync <= queued' failed HOT 1
- [BUG][onetrace][IMME CmdList] Tool got less kernel calls in report than actual submitted HOT 2
- oneprof fails for LLM workloads
- [PTI-SDK] Buffer event timestamp conversion HOT 4
- [PTI-SDK] Memory copy record does not contain copied size HOT 5
- ze_tracer/onetrace: Assertion `call->command != command' failed with simple SYCL Graph application HOT 5
- Run into Segmentation fault when using unitrace for python script HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from pti-gpu.