Simple summation 8x slower than in Julia

mohoree · November 24, 2021, 9:34am

Sorry to bump up this old thread. I recently ran into very slow loops (slower than the equivalent ones in MATLAB) with 500k or so iterations which used cosh, cos/sin in each iteration. Replacing these by the Intel mkl vml routines lead to a 5X speed up. My processor only has avx2 instructions and I think if I try it on a processor with avx2-512 instruction set, this can be slashed even further. I use intel oneAPI under Linux.

Topic		Replies	Views
How to use IFX and offload openMP to GPU?	0	1612	April 2, 2022
Julia: Fast as Fortran, Beautiful as Python	184	11729	November 13, 2022
An interesting video: Python vs Fortran vs GNU Octave/MATLAB --- side by side performance comparison	5	1181	January 6, 2022
Comparing Fortran and Julia's Bessel function performance	69	4827	October 23, 2022
Improving Fortran Results in the Julia Micro-benchmarks Help	44	3994	June 23, 2022

Simple summation 8x slower than in Julia

Related topics