The simplemanycoresim from domiii

domiii / simplemanycoresim Goto Github PK

View Code? Open in Web Editor NEW

A very simple many-core simulator based on ARMulator. This also the final Project for an Advanced Computer Architecture class from 2012

C++ 84.10% C 4.41% Python 2.44% Shell 7.58% CMake 1.47%

simplemanycoresim's Introduction

This was a final project for Advanced Computer Architecture at NTU in the 2012-1 semester.

Goal of the project was to design, implement and (partly) test a new Many-Core CPU model with L2 caches shared between all cores of a block (a block being a rectangular subset of cores).

The endgame goal was to automatically load balance L2-cache blocking by modeling the L2 cache as a fluid, and then optimizing it using a simplified version of the Navier-Stokes equations, where pressure could be modeled as a combination of (a) cache miss rate and/or (b) router congestion.

Link: Final Report (pdf)

TODO LIST:

Add include guards to header files
Add include files
Modify ARMulator
- Merge MMU modifications
- Merge CPU modifications
Fill in all TODOs and obvious compiler errors
- How to design the Address class? Can we do bitwise packing?
- foreach
- Some more

Important tasks: - Create & run benchmarks -> Several matmuls with matrices of different sizes -> Run benchmarks with all kinds of different settings - Take stats from sim* variables and average them over tasks (especially: Cache, MMU, Router) - Print screenshots of simulation stats to file -> Visualize results using Matlab - Do not send a request to memory on L2 miss, if it is already being requested?

Less important: Extras: - Reduce congestion by batching multiple requests to the same line into a single packet

Far-away extras:
    - Add associativity to caches & good cache replacement algorithm
    - Broadcasts
    - Cache coherence
    - Allow for dynamic Core Block sizes
        - Scheduler must core block sizes against capacity miss rates
    - Produce compiler that understands the new paradigm annotations and can automatically instrument the code with place holders

Comparison to proposal - No broadcast - No cache coherence - No __syncthreads() instruction - Only simulate load, not store

Recommend Projects

domiii / simplemanycoresim Goto Github PK

simplemanycoresim's Introduction

simplemanycoresim's People

Contributors

Watchers

Forkers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent