Wavelet-based Expression Manipulation GAN: A Novel Approach to Facial Expression Editing

Sunday 02 February 2025


Facial expression editing has long been a topic of interest in computer vision and machine learning research. Recently, a new approach has emerged that uses wavelet transform techniques to improve facial expression manipulation. This innovative method, known as Wavelet-based Expression Manipulation GAN (WEM-GAN), has shown promising results in preserving personal identity information while editing facial expressions.


The WEM-GAN model is based on the U-Net backbone and incorporates a wavelet transform-based detail information transmission module. This module helps to address the issue of information loss when images are embedded in low-dimensional latent spaces, ensuring that the generated images retain more detail and realism. Additionally, the high-frequency component discriminator is used to further constrain the network model, resulting in even more accurate facial expression editing.


The WEM-GAN model has been tested on a range of datasets, including AffectNet, FFHQ, RAF-DB, and CelebA. The results show that the model can successfully manipulate facial expressions while preserving personal identity information. In fact, the model’s performance is comparable to state-of-the-art methods in terms of facial expression editing and image quality.


One of the key advantages of WEM-GAN is its ability to edit facial expressions in a continuous manner. This is achieved by using anatomically-aware facial action units (AUs) as the conditional vector for the generator. By manipulating these AUs, the model can produce a range of facial expressions from neutral to extreme, while still retaining the subject’s identity.


The WEM-GAN model also includes a residual network block with multiple fused relative AUs, which enhances its ability to edit facial expressions. This block allows the model to learn more complex patterns and relationships between different facial features, resulting in more realistic and detailed generated images.


In addition to its impressive performance, the WEM-GAN model is also efficient and scalable. The wavelet transform-based detail information transmission module allows for faster processing times, making it suitable for real-world applications where speed and efficiency are crucial.


Overall, the Wavelet-based Expression Manipulation GAN (WEM-GAN) represents a significant advancement in facial expression editing research. Its ability to preserve personal identity information while manipulating facial expressions makes it an attractive solution for a range of applications, from entertainment to healthcare. With its impressive performance, efficiency, and scalability, WEM-GAN is poised to revolutionize the field of facial expression manipulation.


Cite this article: “Wavelet-based Expression Manipulation GAN: A Novel Approach to Facial Expression Editing”, The Science Archive, 2025.


Facial Expression Editing, Wavelet Transform, Gan, Computer Vision, Machine Learning, U-Net Backbone, Detail Information Transmission, High-Frequency Component Discriminator, Facial Action Units, Anatomically-Aware Facial Features


Reference: Dongya Sun, Yunfei Hu, Xianzhe Zhang, Yingsong Hu, “WEM-GAN: Wavelet transform based facial expression manipulation” (2024).


Leave a Reply