2 issues here #277

eric1hello · 2023-04-27T05:45:43Z

I found 2 issues here:
---- shader.cc
I think the pI1 shall be pI2.
if ((pI1->oprnd_type == INT_OP) || (pI1->oprnd_type == UN_OP)) { //these counters get added up in mcPat to compute scheduler power
m_stats->m_num_INTdecoded_insn[m_sid]++;
---- I tried to enable cooperative_groups in bellow cuda, but seem it doesn't work , something issue with PTX, do you know the reasons?

device int atomicAggInc(int *ptr) {
auto g = cg::coalesced_threads();
int prev;

if (g.thread_rank() == 0)
prev = atomicAdd(ptr,g.size());

prev = g.thread_rank() + g.shfl(prev,0);

return prev;

}

global void vectorAdd(float *A, const float *B, float *C , int numElements) {

int i = blockDim.x * blockIdx.x + threadIdx.x;

//if (i < numElements) {
// C[i] = A[i] + B[i];
//}
if ( i%10 == 0){
int rankIdx = atomicAggInc(&count);
printf ("blockIdx = %d, threadIdx = %d, rank = %d \n",blockIdx.x ,threadIdx.x,rankIdx);
}
}

do not truncate 32 MSB bits of the memory address

…tion into HEAD

bug fix was_writeback_sent()

Fix cache hash function and renaming

…tion into HEAD

…bution into sub_core_devel

Purdue Updates Merging

Sims should work with latest CUDA

Throwing an error on updating CUDA is a bit much. Let's warn them

Turning off the format code for now. Currently, set up to run on an internal cluster. Needs testing on docker.

* added support to cuda 12, by predicating texuture cache * format code

Support for CUDA 12.x and Ubuntu 24

Deleting old files used by Jenkins

* fix_cache_string: update cache config help text * Automated Format --------- Co-authored-by: purdue-jenkins <purdue-jenkins@users.noreply.github.com> Co-authored-by: Tim Rogers <timrogers@gmail.com>

… simulation to work with CUDA 12. (#95) * Fixing the formatter to always use a consistent format and running it on the codebase * Update linux-so-version.txt * Update Makefile * A couple of unnecessary files that are lingering around * Support CUDA 12 * Getting the PTX simulations to work with CUDA 12. The issue is that ptxas added more information (number of barriers and compile time). We have to parse these or lexx/yacc fail. * Update ptxinfo.l debug MACRO was ineffective * Update gpgpusim_check.cmake Update to make the CUDA version print a warning, not an error and updating the print to be more reflective of what the actual problem is.

* fix_cache_string: update cache desc in config files and remove typos * fix_cache_string: update gitignore

* Gcc13 support (#87) * Update setup_environment Sims should work with latest CUDA * Update setup_environment Throwing an error on updating CUDA is a bit much. Let's warn them * Update main.yml Turning off the format code for now. Currently, set up to run on an internal cluster. Needs testing on docker. * fix gcc13 unit64 missing header --------- Co-authored-by: Tor Aamodt <aamodt@ece.ubc.ca> Co-authored-by: Tim Rogers <timrogers@gmail.com> * Cuda12 support (#86) * Update setup_environment Sims should work with latest CUDA * Update setup_environment Throwing an error on updating CUDA is a bit much. Let's warn them * Update main.yml Turning off the format code for now. Currently, set up to run on an internal cluster. Needs testing on docker. * added support to cuda 12, by predicating texuture cache * format code --------- Co-authored-by: Tor Aamodt <aamodt@ece.ubc.ca> Co-authored-by: Tim Rogers <timrogers@gmail.com> * Changed to use the new image * merge upstream (#88) * Update setup_environment Sims should work with latest CUDA * Update setup_environment Throwing an error on updating CUDA is a bit much. Let's warn them * Update main.yml Turning off the format code for now. Currently, set up to run on an internal cluster. Needs testing on docker. --------- Co-authored-by: Tor Aamodt <aamodt@ece.ubc.ca> * Updated docker image * Update CMakeLists.txt Support CUDA 12 --------- Co-authored-by: Ahmad Alawneh <Lahmos4@gmail.com> Co-authored-by: Tor Aamodt <aamodt@ece.ubc.ca> Co-authored-by: Tim Rogers <timrogers@gmail.com> Co-authored-by: Ni Kang <kang222@tgrogers-gpu01.ecn.purdue.edu>

* sst-integration-stream: add apis to make SST integration works with stream * Add dev container specs

Signed-off-by: Athishpranav2003 <athishanna@gmail.com> Co-authored-by: Anu <anallat@purdue.edu> Co-authored-by: WilliamMTK <China_Aisa@live.com>

* update_readme: add CMake, devcontainer, and SST * update_readme: fix branch for sst-elements * update_readme: fix typos

GPGPU-Sim needs libGL.so or the link fails.

)

* Update the AccelSim test script to target the repo specified by the user. * Change the warning for a missing ACCELSIM_REPO environment variable to an error and update workflow file to point to a temperory fix for circular dependency issue * fix the repo name * Change back to use dev branch

* performance inprovements * use node_id before incremented * Cleanup iSLIP * run set_dram_power_stats only when power model enabled --------- Co-authored-by: WilliamMTK <China_Aisa@live.com>

* add a100 conf * change l1 write ratio

* i_love_zsh: make setup script universal for other shells * Update setup_environment Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * Update setup_environment Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> --------- Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

* update_ci: use test matrix * update_ci: add updated test script and better name * update_ci: use minimal image for accelsim test as well * update_ci: use github image for sst run * update_ci: revert main yaml

tgrogers and others added 30 commits January 27, 2021 11:19

Merge pull request #5 from allencho1222/patch-2

e6b0608

do not truncate 32 MSB bits of the memory address

Merge branch 'dev' of https://github.com/accel-sim/gpgpu-sim_distribu…

5ac0b60

…tion into HEAD

bug fix was_writeback_sent

f3a0077

Merge pull request #7 from JRPan/fix-was_writeback_sent

67f89ab

bug fix was_writeback_sent()

fix hash funciton

51d9925

Merge pull request #9 from JRPan/fix-cache-hash

2f96645

Fix cache hash function and renaming

adding new RTX 3070 config

b430b36

Merge branch 'dev' of https://github.com/accel-sim/gpgpu-sim_distribu…

deb5eb5

…tion into HEAD

change the L1 cache policy to be on-miss based on recent ubench

09f10eb

change the L1 cache policy based on recent ubench

1ee03f0

parition CU allocation, add prints

5533464

minor fixes

645a0ea

useful print statement

46423a2

validated collector unit partitioning based on scheduler

b672880

sub core model dispatches only to assigned exec pipelines

fa76ab4

minor fix accessing du

c905726

fix find_ready reg_id

a72b84e

dont need du id

6ad5bad

remove prints

9219236

need at least 1 cu per sched for sub_core model, fix find_ready() reg_id

52a890c

move reg_id calc to cu object init

2db9120

fix assert

4825a1d

clean up redundant method args

e2b410d

more cleanup

9c0156b

cleanup find_ready

28c3c94

partition issue() in the shader execute stage

28d0565

Merge branch 'sub_core_devel' of github.com:barnes88/gpgpu-sim_distri…

08ad045

…bution into sub_core_devel

minor fixes, pure virtual calls

ec55c68

add prints for ex issue validation

71455d8

issue function needed to be constrained

640674b

aamodt and others added 6 commits February 10, 2025 17:35

Merge pull request gpgpu-sim#313 from accel-sim/dev

0e39753

Purdue Updates Merging

Deleting old files used by Jenkins

1842711

Update setup_environment

47ff869

Sims should work with latest CUDA

Update setup_environment

aa505b8

Throwing an error on updating CUDA is a bit much. Let's warn them

Update main.yml

68e1cd3

Turning off the format code for now. Currently, set up to run on an internal cluster. Needs testing on docker.

Merge branch 'dev' into dev

182cc28

tgrogers force-pushed the dev branch from a34057f to 48af0c9 Compare February 14, 2025 22:40

LAhmos and others added 23 commits February 14, 2025 23:47

Cuda12 (#94)

c3966b6

* added support to cuda 12, by predicating texuture cache * format code

fix gcc13 unit64 missing header (#93)

6658752

Merge pull request gpgpu-sim#316 from accel-sim/dev

8172d40

Support for CUDA 12.x and Ubuntu 24

Merge pull request gpgpu-sim#314 from tgrogers/dev

a4ce3fe

Deleting old files used by Jenkins

fix_cache_string: update cache config help text (#76)

7934dfe

* fix_cache_string: update cache config help text * Automated Format --------- Co-authored-by: purdue-jenkins <purdue-jenkins@users.noreply.github.com> Co-authored-by: Tim Rogers <timrogers@gmail.com>

Fix cache string description in config files (#96)

360d856

* fix_cache_string: update cache desc in config files and remove typos * fix_cache_string: update gitignore

fix_cudaStreamSynchronize: fix the issue (#100)

63e2548

sst-integration-stream: make SST integration works with streams (#103)

6ab2ca4

* sst-integration-stream: add apis to make SST integration works with stream * Add dev container specs

Corrected offset for args in printf (#84)

e36aff2

Signed-off-by: Athishpranav2003 <athishanna@gmail.com> Co-authored-by: Anu <anallat@purdue.edu> Co-authored-by: WilliamMTK <China_Aisa@live.com>

update_readme: add CMake, devcontainer, and SST documentation (#108)

102cd4d

* update_readme: add CMake, devcontainer, and SST * update_readme: fix branch for sst-elements * update_readme: fix typos

Adding a check for OpenGL (#111)

b7dc8ba

GPGPU-Sim needs libGL.so or the link fails.

adding a function to check active status of a thread (#112)

363fe2c

change function name (#113)

8124cbb

fix a bug , we inc number of inst without checking if inst is null (#114

1c6cd9b

)

Performance improvements (#67)

bc268aa

* performance inprovements * use node_id before incremented * Cleanup iSLIP * run set_dram_power_stats only when power model enabled --------- Co-authored-by: WilliamMTK <China_Aisa@live.com>

add a100 conf (#120)

7d3ad23

* add a100 conf * change l1 write ratio

rename conf (#123)

6b22d66

Update ci (#119)

67d8a3b

* update_ci: use test matrix * update_ci: add updated test script and better name * update_ci: use minimal image for accelsim test as well * update_ci: use github image for sst run * update_ci: revert main yaml

add v2 of device proprites (#124)

9d88a1c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

2 issues here #277

2 issues here #277

Uh oh!

eric1hello commented Apr 27, 2023

Uh oh!

Uh oh!

2 issues here #277

Are you sure you want to change the base?

2 issues here #277

Uh oh!

Conversation

eric1hello commented Apr 27, 2023

Uh oh!

Uh oh!