This promotion was mainly associated with the PC hardware + 8087 registers, where all real calculations were performed in an 80-bit register. I do know that Lahey compilers promoted constants to 80 bit reals, and for 4-byte or 8-byte real calculations, you had the option of retaining the calculation value in the 80-bit register, or you could explicitly define temporary reals as real10 to guarantee precision was maintained. (With F77, it was difficult to explicitly define 80-bit real constants so the compiler helped with this)
There was also a compile option to not retain the accumulated computation in the register, but force it to the precision specified in the Fortran code. However, if you were wanting to target precision but no performance disadvantage, you could either hope the machine instructions maintained the value in the register, or define real10 values for the calculation. I always opted for targeting precision.
This became a problem when SIMD (MMX?) instructions became available and there was then a difficult choice between precision or performance, only exaserbated with SSE then AVX made performance a more dominant issue.
I have produced a simple example that clearly shows the change in error when calculating total time for 1,000,000 time steps of 0.001 seconds.
case 1: uses a real constant of 0.001 and gets an accumulated error of 4.57e-5
case 2 : uses a real constant of 0.001d0 and gets an accumulated error of 1.67e-8
case 3 : uses an 80-bit accumulator and gets an accumulated error of 9.09e-13
case 4 : uses an 80-bit accumulator and constant ( as f77/8087)
Although there are other precision problems with many very small time steps (where first and second differences quickly loose precision), the then errors of unexpected constants and real*8 accumulators was a real issue at the time.
This also appeared when converting from F77 to F90 compilers, where the accuracy of benchmark analyses deteriorated, again questioning the reliability of the new “buggy” F90 compilers that had reduced computational accuracy.
It was a loss of confidence in the new Fortran.
I don’t know why high precision accumulators have since been ignored as a hardware option, but unfortunately, Fortran users are not the main decision makers !
(making many core PC’s with inadequate memory bandwidth is another recent example)
integer, parameter :: dp = kind(1.0d0)
integer :: i, num_step = 1000000
real (dp) :: x, time_step = 0.001, end_time, time_step8 = .001_dp
real*10 :: x10, time_step10
x = 0
do i = 1,num_step
x = x + time_step
end do
end_time = x
write (*,*) num_step, end_time-1000, time_step, ' time_step = 0.001'
x = 0
do i = 1,num_step
x = x + time_step8
end do
end_time = x
write (*,*) num_step, end_time-1000, time_step8, ' time_step = 0.001_dp'
x10 = 0
do i = 1,num_step
x10 = x10 + time_step8
end do
end_time = x10
write (*,*) num_step, end_time-1000, time_step8, ' real*10 time'
x10 = 0
time_step10 = 1.
time_step10 = time_step10 / 1000
do i = 1,num_step
x10 = x10 + time_step10
end do
end_time = x10
write (*,*) num_step, end_time-1000, time_step10, ' time_step = 0.001_10'
end
! 1000000 4.749745130539E-05 1.000000047497E-03 time_step = 0.001
! 1000000 -1.673492988630E-08 1.000000000000E-03 time_step = 0.001_dp
! 1000000 9.094947017729E-13 1.000000000000E-03 real*10 time
! 1000000 7.958078640513E-13 1.00000000000000000E-03 time_step = 0.001_10```