Breakthrough in Artificial Intelligence: CHiP System Reduces Hallucinations

Saturday 15 March 2025


For years, scientists have struggled to create artificial intelligence that can understand and generate human-like language. One major challenge has been preventing AI models from producing false or misleading information – a phenomenon known as hallucination. Hallucinations occur when an AI model generates text based on its own biases and assumptions rather than actual facts.


Recently, researchers developed a new approach to tackle this issue. They created a system called Cross-Modal Hierarchical Direct Preference Optimization (CHiP), which combines multiple techniques to ensure that AI models generate accurate and relevant text.


At the core of CHiP is a novel preference optimization method. This method allows the model to learn from human preferences, rather than relying solely on its own internal calculations. The model can then use this knowledge to adjust its output and avoid generating hallucinations.


One key component of CHiP is its ability to optimize at multiple levels. Unlike traditional AI models that focus on a single level of detail, CHiP can capture relationships between images and text across varying granularities – from individual objects to entire scenes. This allows the model to generate more accurate and relevant text.


Another important feature of CHiP is its visual preference optimization module. This module enables the model to align semantic meanings between images and text, which helps to reduce hallucinations. By understanding the relationships between images and text, CHiP can produce more accurate and meaningful output.


Researchers tested CHiP on several datasets, including MMHal-Bench, a challenging dataset that requires AI models to generate text based on complex visual inputs. The results were impressive: CHiP significantly reduced hallucinations compared to traditional AI models.


One example of this success is in the generation of descriptions for images. Traditional AI models often produce inaccurate or misleading information when describing complex scenes. However, CHiP was able to capture the nuances of these scenes and generate accurate and relevant text.


The potential applications of CHiP are vast. For instance, it could be used to improve language translation systems, enabling machines to accurately understand and communicate with humans in different languages. It could also be applied to medical diagnosis, where AI models could generate accurate descriptions of patient symptoms and conditions.


While there is still much work to be done to refine CHiP, the results are promising. By addressing the issue of hallucinations, this technology has the potential to revolutionize the field of artificial intelligence and open up new possibilities for human-machine communication.


Cite this article: “Breakthrough in Artificial Intelligence: CHiP System Reduces Hallucinations”, The Science Archive, 2025.


Artificial Intelligence, Language Generation, Hallucination, Chip, Preference Optimization, Image Description, Text Generation, Natural Language Processing, Machine Learning, Computer Vision


Reference: Jinlan Fu, Shenzhen Huangfu, Hao Fei, Xiaoyu Shen, Bryan Hooi, Xipeng Qiu, See-Kiong Ng, “CHiP: Cross-modal Hierarchical Direct Preference Optimization for Multimodal LLMs” (2025).


Leave a Reply