Optimizing vectorized array operations

PierU · March 20, 2025, 9:44pm

The reason in this MRE is actually obvious …

At each iteration of the outer loop on i:

In the fast version, the inner loop on j is executed only N_samples=1000 times, which means that only 1000 elements of pack_tot are updated.
In the slow version all the N_grid_pack=800000 elements of pack_tot are updated (and the whole line_pack is set to zero, although you really need only 1000 elements).

Topic		Replies	Views
Fortran: Array Language (video) Advocacy	20	1036	February 3, 2024
Performance impact of how a large array is accessed Help	49	1912	June 4, 2023
Will using Vectorization speed up the program? Help	22	1884	November 26, 2023
Implied-do array constructor, type specs, and differences between GFortran, Intel, and LFortran Help	49	1301	May 26, 2024
Fortran is dead – Long live Fortran! Tutorials	9	859	June 27, 2025