Comments (7)
I have ported darknet (https://github.com/pjreddie/darknet) for GPGPU-sim (cuBLAS and cuRAND were replaced by my own kernels), and executed VGGnet-16 (inference).
It works fine with my native GPU1080Ti, but it shows buffer overflow while loading weight data from a file (528MB).
GPGPU-sim consumed more than 4GB memory while loading the weight file of VGGnet-16, and it showed buffer overflow which was not shown at native GPU.
from gpgpu-sim_distribution.
What type of application are you trying to run?
from gpgpu-sim_distribution.
from gpgpu-sim_distribution.
I'm using vgg-16.cfg and imagenet1k.data to validate an input image.
Here is my command line:
./darknet classifier predict cfg/imagenet1k.data cfg/vgg-16.cfg vgg-16.weights data/eagle.jpg
I face overflow error before execution of neural network layers.. GPU memory contents were corrupted while loading VGGnet-16 weight file.
As my observation, no GPU kernel was executed before memory corruption.
from gpgpu-sim_distribution.
@cyk0521 sorry for hijacking the thread, but is it possible to share your port of darknet? We are trying to use darknet with gpgpu-sim also facing the same issue regarding cuBLAS and cuRAND. Though we are using tiny yolo without memory overflow issue (for now).
from gpgpu-sim_distribution.
from gpgpu-sim_distribution.
While running resnet50, i faced the same issue.
Due to the overflow of the variable that holds address value, functional simulation fails.
from gpgpu-sim_distribution.
Related Issues (20)
- Data storage in physical memory
- Run ptx directly in GPGPUSim HOT 2
- When running GPGPU-sim with NVIDIA driver, such an error occurs. HOT 7
- how to get the Logic and arithmetic instructions of the PTX-level
- Resolve GCC Warning and Address Potential Bug in Checkpoint Functionality
- When running in gdb, an error occurs
- DVFS in GPGPU-sim
- make[1]: *** [Makefile:76: depend] Error 127 & make: *** [Makefile:207: cuda-sim] Error 2 in RHEL 9 HOT 20
- RuntimeError: cublas runtime error(gpgpu-sim with pytorch) HOT 1
- Deadlock when scaling up problem sizes
- ptx_parse() fuction doesn't return when executing different applications HOT 3
- Does gpgpu-sim support CUDA driver api? HOT 2
- How to complie it in ubuntu 22.04? HOT 2
- [Question] Using `gpgpu_ptx_sim_load_ptx_from_string` without affecting original `gpgpu_context`? HOT 1
- Independent Warp Scheduling in Volta+'s SIMT model HOT 1
- Does not natively support emulation within python runtime: HOT 1
- Segmentation fault (core dumped) when running gpgpusim 4.0/4.2 with cutlass 1.3(maybe due to .loc instruction syntax error in PTX) HOT 1
- gpgpu-sim v4: Increasing DRAM Frequency has no impact on performance
- make: Getting issue with the g++ command in Ubuntu 22.04
- Question about L2 configuration
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from gpgpu-sim_distribution.