1
Fork 0
mirror of https://github.com/Steffo99/unimore-hpc-assignments.git synced 2024-11-22 08:04:25 +00:00
Progetti svolti di laboratorio di High Performance Computing
Find a file
2022-12-12 00:37:57 +01:00
.idea Get everything to work 2022-12-02 00:15:33 +01:00
.vscode cambiato path nvcc e commentate ottimizzazioni di steffo nel makefile per usare jetson nano 2022-12-08 09:56:54 -05:00
atax Merge branch '2-cuda' of https://github.com/Steffo99/unimore-hpc-assignments into 2-cuda 2022-12-11 09:50:22 -05:00
.editorconfig Get everything to work 2022-12-02 00:15:33 +01:00
.gitignore Add elf executale to gitignore 2022-11-29 08:24:23 -05:00
CONTRIBUTING.md Add a note 2022-12-06 09:39:11 -05:00
ex.txt Add the old experiments 2022-12-06 09:37:44 -05:00
README.md Change the results and validation part 2022-12-06 09:37:18 -05:00

 **Stefano Pigozzi** + **Caterina Gazzotti** + **Fabio Zanichelli** | Topic OpenMP | High Performance Computing Laboratory | Unimore 

C code optimization using NVIDIA CUDA

Assignment #2

Every team is called to optimize (parallellize) the execution time of the assigned applications on multi-processor system.

Expected outcomes

  • Repository of the code (github/gitlab is ok, or .zip )
  • Oral presentation (5 min + 5 min Q&A) of your work

Assigned application

Group 3: OpenMP/linear-algebra/kernels/atax

Results

Results can be read in the ex.txt file where we stored all the experiments done.

Validation

  • Compiler used: nvcc
  • Device used: JETSON NANO DEVELOPER KIT
  • Built on: Mon_Mar_11_22:13:24_CDT_2019 Cuda compilation tools, release 10.0, V10.0.326

To reproduce the obtained results:

  1. Clone the repository on @Steffo99's computer:

    $ git clone https://github.com/Steffo99/unimore-hpc-assignments
    
  2. Checkout the exact commit the tests were executed on:

    $ git checkout d13a9b786a53d5195ae17ef7afa776e2600ce8e0
    
  3. Access our group's assigned folder:

    $ cd unimore-hpc-assignments/atax
    
  4. Run the benchmarking script:

    $ make bench