1
Fork 0
mirror of https://github.com/Steffo99/unimore-hpc-assignments.git synced 2024-11-22 16:14:24 +00:00
Commit graph

61 commits

Author SHA1 Message Date
a2a070bb3a
Configure makefile to use nvcc 2022-11-28 14:37:37 +01:00
be3a4ec301
Update README 2022-11-28 14:30:41 +01:00
140c40bf6c
Improve README 2022-11-17 20:59:38 +01:00
28479dfb4b
Cleanup comments 2022-11-17 19:53:46 +01:00
44fe50bd4a
Reduce EXTRALARGE_DATASET to 12000 2022-11-17 19:53:39 +01:00
bffa050239
Merge branch 'master' of github.com:Steffo99/unimore-hpc-1 2022-11-17 19:14:17 +01:00
097efddbe3
Remove -g3 CFLAG 2022-11-17 19:13:26 +01:00
Fabio Zanichelli
0ba75336e6
Spostato polybench_start per cronometrare anche le inizializzazioni 2022-11-17 17:33:59 +01:00
a86d078546
Run bench with all dataset sizes 2022-11-17 14:58:32 +01:00
c91361ba88
Add indicator of progress for single runs 2022-11-17 03:02:20 +01:00
7cd8707bb9
Make some optimizations toggleable, so results can be compared easily 2022-11-17 02:59:31 +01:00
60a061991b
main: Remove commented duplicate polybench_start_instruments 2022-11-17 02:07:57 +01:00
20e653ea70
Fix EXTRALARGE_DATASET so it does not overflow anymore 2022-11-17 02:03:18 +01:00
e49e89817d
kernel_atax: Format for loops 2022-11-17 01:59:27 +01:00
1850c42a9f
kernel_atax: Remove nested parallelization
Seems to improve the execution time on my PC

0.0340s → 0.0317s
2022-11-17 01:59:09 +01:00
a16813dc01
main: Remove blank lines 2022-11-17 01:57:25 +01:00
0eb63cb684
init_array: Format second for loop 2022-11-17 01:57:13 +01:00
ac1ec275d7
print_array: Format for loop 2022-11-17 01:56:52 +01:00
c5c79e00c4
print_array: Add (obvious) comment 2022-11-17 01:56:27 +01:00
1e45a5adca
print_array: Remove newline after every 20 elements
The terminal will handle wrapping as necessary.
2022-11-17 01:56:08 +01:00
a23fd895e9
init_array: Parallelize the second loop
The performance hit is gone?

0.0437s → 0.0342s
2022-11-17 01:54:13 +01:00
FABIO ZANICHELLI
9dc24a3367 Aggiunta una reduction (al momento fa poco, magari con acceleratore va meglio), tolto un *4 perche Jetson ha 4 core CPU) 2022-11-16 15:01:58 -05:00
d89c501b59
kernel_atax: Parallelizing the second loop gives a nice speedup 2022-11-16 18:05:12 +01:00
9c153bb89f
Hide insignificant digits in the bench target 2022-11-16 18:03:24 +01:00
c104caa1a6
Use THREAD_COUNT instead of a fixed amount of threads 2022-11-16 17:51:27 +01:00
cc23d73254
Update CONTRIBUTING with the bench target 2022-11-16 17:44:49 +01:00
e23d565fd2
Create bench target for calculating the average of 9 runs 2022-11-16 17:39:09 +01:00
8cb741d24d
Create bench target 2022-11-16 17:09:27 +01:00
f655df0eb7
Make the compilation silent so only the execution time is output 2022-11-16 17:06:16 +01:00
cf79cb0924
Improve makefile flags 2022-11-16 17:04:50 +01:00
7fc2506cc7
kernel_atax: Remove useless tmp array, use a local variable instead
Local improvement: 30ms

Co-authored-by: Caterina Gazzotti <gazzotti.caterina@gmail.com>
2022-11-16 14:21:57 +01:00
aefa038d44
Revert "Aggiunge due for parallelizzati"
This reverts commit 5b8b4f84b2.
2022-11-16 14:15:09 +01:00
Gattopandacorno
5b8b4f84b2 aggiunge due for parallelizzati 2022-11-16 11:09:30 +01:00
d45d94ca91
Remove -DEXTRALARGE_DATASET from CLion's runConfigs as it was moved into the Makefile 2022-11-16 01:45:15 +01:00
0a91ad019a
Add CFLAGS to the Makefile 2022-11-16 01:43:59 +01:00
e891286291
kernel_atax: Add comments to the second iteration 2022-11-16 01:43:49 +01:00
57fe10ad6a
kernel_atax: Try parallelizing the first cycle
Doesn't seem to create a big difference
2022-11-16 01:43:22 +01:00
d5664a9393
print_array: Improve wording 2022-11-16 01:42:53 +01:00
18466da754
init_array: Leave a comment with my experiments on the second iteration 2022-11-16 01:42:37 +01:00
7fcff5d127
init_array: Actually, do optimize the first iteration 2022-11-16 01:41:29 +01:00
00f16cecaf
Configure code formatting which broke again 2022-11-15 21:44:00 +01:00
f48d351890
init_array: Do not optimize, costs are greater than benefits 2022-11-15 19:46:23 +01:00
7ae893d977
Make CLion formatting work 2022-11-15 19:40:17 +01:00
0f4cfd84a4
Rename runConfiguration to atax_acc 2022-11-14 16:22:47 +01:00
f574c15bed
Use a 3-indent in Markdown 2022-11-14 16:22:34 +01:00
0306e6f912
Disable spellchecker 2022-11-14 16:22:22 +01:00
f09170a23e
Create contributors' documentation 2022-11-14 16:22:11 +01:00
975cdc1360
Configure CLion project style for the project 2022-11-14 15:48:44 +01:00
626796e988
Update CLion runConfigurations 2022-11-14 15:46:42 +01:00
1c834ba294
Delete blank line 2022-11-14 15:40:03 +01:00