|
20e653ea70
|
Fix EXTRALARGE_DATASET so it does not overflow anymore
|
2022-11-17 02:03:18 +01:00 |
|
|
e49e89817d
|
kernel_atax : Format for loops
|
2022-11-17 01:59:27 +01:00 |
|
|
1850c42a9f
|
kernel_atax : Remove nested parallelization
Seems to improve the execution time on my PC
0.0340s → 0.0317s
|
2022-11-17 01:59:09 +01:00 |
|
|
a16813dc01
|
main : Remove blank lines
|
2022-11-17 01:57:25 +01:00 |
|
|
0eb63cb684
|
init_array : Format second for loop
|
2022-11-17 01:57:13 +01:00 |
|
|
ac1ec275d7
|
print_array : Format for loop
|
2022-11-17 01:56:52 +01:00 |
|
|
c5c79e00c4
|
print_array : Add (obvious) comment
|
2022-11-17 01:56:27 +01:00 |
|
|
1e45a5adca
|
print_array : Remove newline after every 20 elements
The terminal will handle wrapping as necessary.
|
2022-11-17 01:56:08 +01:00 |
|
|
a23fd895e9
|
init_array : Parallelize the second loop
The performance hit is gone?
0.0437s → 0.0342s
|
2022-11-17 01:54:13 +01:00 |
|
FABIO ZANICHELLI
|
9dc24a3367
|
Aggiunta una reduction (al momento fa poco, magari con acceleratore va meglio), tolto un *4 perche Jetson ha 4 core CPU)
|
2022-11-16 15:01:58 -05:00 |
|
|
d89c501b59
|
kernel_atax : Parallelizing the second loop gives a nice speedup
|
2022-11-16 18:05:12 +01:00 |
|
|
9c153bb89f
|
Hide insignificant digits in the bench target
|
2022-11-16 18:03:24 +01:00 |
|
|
c104caa1a6
|
Use THREAD_COUNT instead of a fixed amount of threads
|
2022-11-16 17:51:27 +01:00 |
|
|
e23d565fd2
|
Create bench target for calculating the average of 9 runs
|
2022-11-16 17:39:09 +01:00 |
|
|
8cb741d24d
|
Create bench target
|
2022-11-16 17:09:27 +01:00 |
|
|
f655df0eb7
|
Make the compilation silent so only the execution time is output
|
2022-11-16 17:06:16 +01:00 |
|
|
cf79cb0924
|
Improve makefile flags
|
2022-11-16 17:04:50 +01:00 |
|
|
7fc2506cc7
|
kernel_atax : Remove useless tmp array, use a local variable instead
Local improvement: 30ms
Co-authored-by: Caterina Gazzotti <gazzotti.caterina@gmail.com>
|
2022-11-16 14:21:57 +01:00 |
|
|
aefa038d44
|
Revert "Aggiunge due for parallelizzati"
This reverts commit 5b8b4f84b2 .
|
2022-11-16 14:15:09 +01:00 |
|
Gattopandacorno
|
5b8b4f84b2
|
aggiunge due for parallelizzati
|
2022-11-16 11:09:30 +01:00 |
|
|
0a91ad019a
|
Add CFLAGS to the Makefile
|
2022-11-16 01:43:59 +01:00 |
|
|
e891286291
|
kernel_atax : Add comments to the second iteration
|
2022-11-16 01:43:49 +01:00 |
|
|
57fe10ad6a
|
kernel_atax : Try parallelizing the first cycle
Doesn't seem to create a big difference
|
2022-11-16 01:43:22 +01:00 |
|
|
d5664a9393
|
print_array : Improve wording
|
2022-11-16 01:42:53 +01:00 |
|
|
18466da754
|
init_array : Leave a comment with my experiments on the second iteration
|
2022-11-16 01:42:37 +01:00 |
|
|
7fcff5d127
|
init_array : Actually, do optimize the first iteration
|
2022-11-16 01:41:29 +01:00 |
|
|
f48d351890
|
init_array : Do not optimize, costs are greater than benefits
|
2022-11-15 19:46:23 +01:00 |
|
|
1c834ba294
|
Delete blank line
|
2022-11-14 15:40:03 +01:00 |
|
|
05bc63875b
|
Mark code in the README as such
|
2022-11-13 18:15:16 +01:00 |
|
|
fe13cbb936
|
Delete obsolete README
|
2022-11-13 18:13:35 +01:00 |
|
|
872c012e75
|
Improve inner README
|
2022-11-13 18:13:23 +01:00 |
|
|
a8d181c508
|
Fix inner README
|
2022-11-13 18:11:53 +01:00 |
|
|
46dedaef45
|
Start commenting some lines
Co-authored-by: Caterina Gazzotti <gazzotti.caterina@gmail.com>
Co-authored-by: Fabio Zanichelli <274956@studenti.unimore.it>
|
2022-11-11 16:47:48 +00:00 |
|
|
5bbe60d482
|
Create workaround for unavailable M_PI
|
2022-11-11 16:42:39 +00:00 |
|
Alessandro Capotondi
|
e11b42a518
|
init commit
|
2022-11-11 13:23:45 +01:00 |
|