Uncovering Connections: The Burrows-Wheeler Transform and Beyond

Wednesday 26 March 2025


The Burrows-Wheeler transform (BWT) is a data compression algorithm that has been around for decades, but its combinatorial properties have only recently started to be fully understood. In a new paper, researchers Gabriele Fici and Estéban Gabory delve into the connections between BWT, generalized de Bruijn words, and invertible necklaces.


For those unfamiliar with these terms, let’s start with the basics. The Burrows-Wheeler transform is an algorithm used to compress text data by rearranging it in a specific way that takes advantage of its statistical properties. It’s commonly used in lossless compression algorithms, such as gzip and zip. De Bruijn words are sequences of symbols that have a specific property: each symbol appears exactly once in the sequence.


Invertible necklaces, on the other hand, are circular sequences of symbols where every symbol appears an even number of times. The researchers have discovered that there is a deep connection between these two concepts and the Burrows-Wheeler transform.


The paper shows that the BWT can be used to create generalized de Bruijn words over any alphabet size, not just binary (0s and 1s). This means that the algorithm can be applied to compress data with larger alphabets, such as those used in languages other than English. The researchers also demonstrate a connection between invertible necklaces and the BWT, showing that there is a one-to-one correspondence between the two.


This connection has significant implications for data compression. By leveraging the properties of generalized de Bruijn words and invertible necklaces, the BWT can be used to create more efficient compression algorithms. This could lead to faster compression times and smaller compressed file sizes.


The researchers also explore the connections between these concepts and other areas of mathematics, such as group theory and combinatorics. They show that the sandpile groups of generalized de Bruijn graphs are isomorphic to certain Reutenauer groups, which have applications in computer science.


While this may seem like a niche area of research, it has far-reaching implications for data compression and storage. As our reliance on digital data continues to grow, efficient compression algorithms will become increasingly important. The connections between the BWT, generalized de Bruijn words, and invertible necklaces are just one piece of the puzzle in this ongoing quest for more efficient data compression.


Cite this article: “Uncovering Connections: The Burrows-Wheeler Transform and Beyond”, The Science Archive, 2025.


Data Compression, Burrows-Wheeler Transform, Generalized De Bruijn Words, Invertible Necklaces, Lossless Compression, Group Theory, Combinatorics, Sandpile Groups, Reutenauer Groups, Computer Science


Reference: Gabriele Fici, Estéban Gabory, “Generalized De Bruijn Words, Invertible Necklaces, and the Burrows-Wheeler Transform” (2025).


Leave a Reply