Simple summation 8x slower than in Julia

seif_shebl · June 29, 2021, 12:30am

I get 4X speedup for free by just allowing @fastmath in front of g(i,N). This is partly because x^4 is calculated in Julia using Base.power_by_squaring instead of the much faster but less accurate x*x*x*x. You can test this if you replace x^4 with x*x^3, in this case, x is multiplied by the optimized x^3 calculated using Base.literal_pow algorithm. Being a loyal lover for Fortran, Julia still amazes me every day with its impressive performance.

Of course I tested my Intel compiler with the /fast flag to compare and tried the x*x**3 trick but nothing changes the timings of the Fortran version. Here are my benchmark results.

Intel Fortran:

 time =    3.8906250000000000     
 time =    3.9062500000000000     
 time =    3.9062500000000000     
 val =   0.42737032509713474

Julia 1.7.0-beta2:

loop      1.140    s (0 allocations: 0 bytes)  0.4273703250971348
fast      1.140    s (0 allocations: 0 bytes)  0.4273703250971348
avx       1.512    s (0 allocations: 0 bytes)  0.42737032509704814
avxt      388.076 ms (0 allocations: 0 bytes)  0.4273703250970827
simd      1.140    s (0 allocations: 0 bytes)  0.4273703250971348
sumiter   3.135    s (0 allocations: 0 bytes)  0.4273703250971348
mapreduce 3.135    s (0 allocations: 0 bytes)  0.4273703250970799
threadsx.mapreduce 412.653 ms (620 allocations: 44.83 KiB) 0.42737032509707623

Topic		Replies	Views
How to use IFX and offload openMP to GPU?	0	1613	April 2, 2022
Julia: Fast as Fortran, Beautiful as Python	184	11748	November 13, 2022
An interesting video: Python vs Fortran vs GNU Octave/MATLAB --- side by side performance comparison	5	1184	January 6, 2022
Comparing Fortran and Julia's Bessel function performance	69	4844	October 23, 2022
Improving Fortran Results in the Julia Micro-benchmarks Help	44	3995	June 23, 2022

Simple summation 8x slower than in Julia

Related topics