Comments (7)
Hi,
I did not met this issue. In my case, loading ptxinfo and extracting ptx files were going well.
Did you set CUDA_INSTALL_PATH
? GPGPU-Sim may call the ptxas
command based on the value.
from gpgpu-sim_distribution.
Hi,
I did not met this issue. In my case, loading ptxinfo and extracting ptx files were going well.Did you set
CUDA_INSTALL_PATH
? GPGPU-Sim may call theptxas
command based on the value.
Thank you. I have fixed the ptxas problem. (Although I do not know how did it get fixed by only switching a computer. )
Have you tried the MNIST example in https://github.com/gpgpu-sim/gpgpu-sim_simulations? Seems the developers use this app for testing... But finally, I got a PTX not implementation error.
from gpgpu-sim_distribution.
As you commented, I tried gpgpu-sim/gpgpu-sim_simulations
.
When I run the mnistCUDNN
, ptx extraction and some implicit_convolve_sgemm
call work well. I could not find PTX not implementation error
.
However, I've met a deadlock and the simulator has finally been dead as following message.
GPGPU-Sim uArch: Shader 32 bind to kernel 1 '_ZN5cudnn6detail23implicit_convolve_sgemmIffLi128ELi5ELi5ELi3ELi3ELi3ELi1ELb1ELb0ELb1EEEviiiPKT_iPT0_PS2_18kernel_conv_paramsiffiS6_S6_ii'
<CTA alloc> : sm_idx=32 sid=32 max_cta_per_sm=16
GPGPU-Sim uArch: Shader 34 bind to kernel 1 '_ZN5cudnn6detail23implicit_convolve_sgemmIffLi128ELi5ELi5ELi3ELi3ELi3ELi1ELb1ELb0ELb1EEEviiiPKT_iPT0_PS2_18kernel_conv_paramsiffiS6_S6_ii'
<CTA alloc> : sm_idx=34 sid=34 max_cta_per_sm=16
GPGPU-Sim uArch: ERROR ** deadlock detected: last writeback core 34 @ gpu_sim_cycle 12492 (+ gpu_tot_sim_cycle 4294867296) (87508 cycles ago)
GPGPU-Sim uArch: DEADLOCK shader cores no longer committing instructions [core(# threads)]:
GPGPU-Sim uArch: DEADLOCK 0(64) 1(0) 2(64) 3(0) 4(64) 5(0) 6(64) 7(0) + others ... + others ... + others ... + others ... + others ... + others ... + others ... + others ... + others ... + others ... + others ... + others ... + others ... + others ...
Re-run the simulator in gdb and use debug routines in .gdbinit to debug this
Aborted (core dumped)
from gpgpu-sim_distribution.
As you commented, I tried
gpgpu-sim/gpgpu-sim_simulations
.
When I run themnistCUDNN
, ptx extraction and someimplicit_convolve_sgemm
call work well. I could not findPTX not implementation error
.
However, I've met a deadlock and the simulator has finally been dead as following message.GPGPU-Sim uArch: Shader 32 bind to kernel 1 '_ZN5cudnn6detail23implicit_convolve_sgemmIffLi128ELi5ELi5ELi3ELi3ELi3ELi1ELb1ELb0ELb1EEEviiiPKT_iPT0_PS2_18kernel_conv_paramsiffiS6_S6_ii' <CTA alloc> : sm_idx=32 sid=32 max_cta_per_sm=16 GPGPU-Sim uArch: Shader 34 bind to kernel 1 '_ZN5cudnn6detail23implicit_convolve_sgemmIffLi128ELi5ELi5ELi3ELi3ELi3ELi1ELb1ELb0ELb1EEEviiiPKT_iPT0_PS2_18kernel_conv_paramsiffiS6_S6_ii' <CTA alloc> : sm_idx=34 sid=34 max_cta_per_sm=16 GPGPU-Sim uArch: ERROR ** deadlock detected: last writeback core 34 @ gpu_sim_cycle 12492 (+ gpu_tot_sim_cycle 4294867296) (87508 cycles ago) GPGPU-Sim uArch: DEADLOCK shader cores no longer committing instructions [core(# threads)]: GPGPU-Sim uArch: DEADLOCK 0(64) 1(0) 2(64) 3(0) 4(64) 5(0) 6(64) 7(0) + others ... + others ... + others ... + others ... + others ... + others ... + others ... + others ... + others ... + others ... + others ... + others ... + others ... + others ... Re-run the simulator in gdb and use debug routines in .gdbinit to debug this Aborted (core dumped)
Which configuration files are you using? When I used TITANX (Maxwell), I also got a deadlock. Could you try TITANV config instead?
Seems in your log file, you also got "Warning: cannot find deviceFun" --- I guess the problem is gpgpusim failed to find some device functions. (may be they do not belong to cuDNN? will check later. )
Thank you.
from gpgpu-sim_distribution.
Hello, I've found that mnistCUDNN
require both libcudnn
and libcublas
, but GPGPU-Sim support extracting PTX files from only one library(its path is defined at PYTORCH_BIN
variable).
After I modified GPGPU-Sim so that it supports extracting PTX files from multiple libraries.
I uploaded my pull request(PR #116).
Although mnistCUDNN calls several kernels for fermi or maxwell architecture(ex. cudnn7maxwell4gemm) with volta architecture config file, anyway it works well.
from gpgpu-sim_distribution.
Was mnistCUDNN linked statically with libcudnn and libcublas? I believe that was a requirement in order to account for gpgpu-sim only being able to extract PTX files from 1 file. The Makefile in deval281shah's mnistCUDNN example should be passing flags to statically link those libraries.
from gpgpu-sim_distribution.
@RedCarrottt @bigwater
Hello,
when I test minist/main.py
, the output GPGPU-Sim PTX: Parsing libcaffe2_ gpu.1.sm_ 61.ptx
lasted a long time and hasn't stopped yet. Have you ever encountered this problem when running this? Is it a mistake or does it take so long to handle PTX itself.
Thanks.
from gpgpu-sim_distribution.
Related Issues (20)
- microarchitecture model bug, ldst pop m_accessq banking error
- Seg Error HOT 1
- Need config for RTX 3090
- Data storage in physical memory
- Run ptx directly in GPGPUSim HOT 2
- When running GPGPU-sim with NVIDIA driver, such an error occurs. HOT 7
- how to get the Logic and arithmetic instructions of the PTX-level
- Resolve GCC Warning and Address Potential Bug in Checkpoint Functionality
- When running in gdb, an error occurs
- DVFS in GPGPU-sim
- make[1]: *** [Makefile:76: depend] Error 127 & make: *** [Makefile:207: cuda-sim] Error 2 in RHEL 9 HOT 20
- RuntimeError: cublas runtime error(gpgpu-sim with pytorch) HOT 1
- Deadlock when scaling up problem sizes
- ptx_parse() fuction doesn't return when executing different applications HOT 3
- Does gpgpu-sim support CUDA driver api? HOT 2
- How to complie it in ubuntu 22.04? HOT 2
- [Question] Using `gpgpu_ptx_sim_load_ptx_from_string` without affecting original `gpgpu_context`? HOT 1
- Independent Warp Scheduling in Volta+'s SIMT model HOT 1
- Does not natively support emulation within python runtime: HOT 1
- Segmentation fault (core dumped) when running gpgpusim 4.0/4.2 with cutlass 1.3(maybe due to .loc instruction syntax error in PTX) HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from gpgpu-sim_distribution.