Using Unicode Characters in Fortran

jacobwilliams · February 11, 2022, 4:07am

I’m not sure that some of this is good advice. It seems what you are doing here is akin to stuffing double precision reals into an array of single precision reals… It sort of works under some circumstances, but I wouldn’t recommend it. I think using the selected_char_kind('ISO_10646') is the correct way. See my JSON-Fortran library, which does support unicode. And yes, it isn’t currently supported by ifort (what gives, Intel?), and yes, you have to write multiple versions of routines (but that’s the same way you have to do for different real kinds, so it is not unexpected).

Consider this file (‘unicode.txt’):

And the following code:

program test

use iso_fortran_env

implicit none

integer,parameter :: CK = selected_char_kind('ISO_10646')

character(kind=CK,len=3) :: s
integer :: iunit

open(output_unit,encoding='utf-8')

open(newunit=iunit,file='unicode.txt',status='OLD',encoding='UTF-8')

read(iunit,'(A)') s

write(output_unit,*) s
write(output_unit,*) 'len(s) = ', len(s)
write(output_unit,*) 's(1:1) = ', s(1:1)

end program test

This prints:

😀😎😩
 len(s) =            3
 s(1:1) = 😀

So, notice how the length is 3 and the slicing works correctly.

But, I don’t think Fortran actually supports unicode in source files. For example, when I try to do this:

s = CK_'😀😎😩'

I get the warning “CHARACTER expression will be truncated in assignment (3/12) at (1) [-Wcharacter-truncation]” and s(1:1) will print as gibberish.

Topic		Replies	Views
Culture setting / inoculation against squiggles Help	35	1274	June 2, 2023
How do I file-read French special characters like é etc? Help	46	2414	January 22, 2024
How to use utf-8 in gfortran? Help	36	1156	October 3, 2025
Fortran Monthly Call: July 2020 Announcements	26	1952	July 17, 2020
Using Box drawing characters	7	1643	February 2, 2021

Using Unicode Characters in Fortran

Related topics