Forum: Too Lazy BBS

Paper: Multi-Language Benchmark Generation via L-Systems

From John R Levine@johnl@taugh.com to comp.compilers on Mon Dec 22 22:56:52 2025

From Newsgroup: comp.compilers

The authors built a system that uses a production grammar to create
largish benchmark programs which they then used to evaluate compiler performance and (occasionally) look for bugs.

Abstract
L-systems are a mathematical formalism proposed by biologist Aristid Lindenmayer with the aim of simulating organic structures such as trees, snowflakes, flowers, and other branching phenomena. They are implemented
as a formal language that defines how patterns can be iteratively
rewritten. This paper describes how such a formalism can be used to create artificial programs written in programming languages such as C, C++, Julia
and Go. These programs, being large and complex, can be used to test the performance of compilers, operating systems, and computer architectures.
This paper demonstrates the usefulness of these benchmarks through
multiple case studies. These case studies include a comparison between
clang and gcc; a comparison between C, C++, Julia and Go; a study of the historical evolution of gcc in terms of code quality; a look into the
effects of profile guided optimizations in gcc; an analysis of the
asymptotic behavior of the different phases of clang's compilation
pipeline; and a comparison between the many data structures available in
the Gnome Library (GLib). These case studies demonstrate the benefits of
the L-System approach to create benchmarks, when compared with fuzzers
such as CSmith, which were designed to uncover bugs in compilers, rather
than evaluating their performance.

https://arxiv.org/abs/2512.17616

Regards,
John Levine, johnl@taugh.com, Taughannock Networks, Trumansburg NY
Please consider the environment before reading this e-mail. https://jl.ly
--- Synchronet 3.21a-Linux NewsLink 1.2

From Fernando Magno Quintao Pereira@pronesto@gmail.com to comp.compilers on Tue Dec 23 13:12:59 2025

From Newsgroup: comp.compilers

Hi John,

Thank you very much for posting about our paper on comp.compilers. We
truly appreciate it.

One of the goals of BenchGen is to support benchmarking for new
programming languages, as well as new compilers or interpreters for well-established languages. If anyone is interested in benchmarking a
new language, we would be very happy to help set it up within
BenchGen. The system currently supports languages with at least one
data structure. When data structures are not available, simple scalars
can be used instead. It also supports three basic forms of control
flow: function calls, if-then-else constructs, and loops.

Below are a few experiments we have conducted using BenchGen:

A comparison between C, C++, Julia, Go, Zig, V, and Odin: https://github.com/lac-dcc/BenchGen/wiki/Adding-a-New-Programming-Language-to-BenchGen

A comparison between gcc and clang: https://github.com/lac-dcc/BenchGen/wiki/Comparing-gcc-and-clang

A comparison between different versions of gcc: https://github.com/lac-dcc/BenchGen/wiki/Comparing-gcc-versions

A comparison between different GLib data structures: https://github.com/lac-dcc/BenchGen/wiki/Comparing-GLib-data-structures

A study of the impact of profile-guided optimizations: https://github.com/lac-dcc/BenchGen/wiki/Experiment-with-Profile-Guided-Optimization-(PGO)

An analysis of the asymptotic behavior of the Clang and GCC front end,
middle end, and back end: https://github.com/lac-dcc/BenchGen/wiki/Asymptotic-Behavior-of-CLANG-and-GCC-Compilers

Notice that BenchGen is still evolving, and we are open to changes in
the methodology to generate programs.

Best regards,

Fernando

On Tue, Dec 23, 2025 at 12:57rC>AM John R Levine <johnl@taugh.com> wrote:

The authors built a system that uses a production grammar to create
largish benchmark programs which they then used to evaluate compiler performance and (occasionally) look for bugs. ...

https://arxiv.org/abs/2512.17616

--- Synchronet 3.21a-Linux NewsLink 1.2

Who's Online

System Info

Sysop:	Amessyroom
Location:	Fayetteville, NC
Users:	59
Nodes:	6 (0 / 6)
Uptime:	00:10:03
Calls:	810
Files:	1,287
Messages:	196,862

Paper: Multi-Language Benchmark Generation via L-Systems

Who's Online

System Info