A follow-up paper is Comparing Llama-2 and GPT-3 LLMs for HPC kernels generation
Overall, Copilot generates codes that are more reliable but less optimized, whereas codes generated by Llama-2 are less reliable but more optimized when correct.
Llama-2 was originally written in C, but there is a Fortran version.