Monday 03 March 2025
Multilingual knowledge graphs, which are complex networks of entities and their relationships, have become increasingly important in artificial intelligence and natural language processing. These graphs contain vast amounts of information, but they often lack completeness, especially in non-English languages.
Researchers have been working to address this issue by developing methods that can automatically generate missing information in these graphs. One approach is called Knowledge Graph Completion (KGC), which involves predicting missing relationships between entities. Another approach is Knowledge Graph Enhancement (KGE), which involves generating missing textual information for entities.
In the past, researchers have treated KGC and KGE as separate tasks, but a new study suggests that they are actually interdependent and can benefit from each other. The authors of this study propose a novel framework called KG-TRICK, which combines both KGC and KGE into a single sequence-to-sequence model.
The key innovation behind KG-TRICK is its ability to leverage textual information from multiple languages to improve the completeness of multilingual knowledge graphs. This is achieved by training the model on a large dataset of annotated entity descriptions in different languages.
To create this dataset, the researchers developed a user-friendly annotation interface that allowed human annotators to rate and suggest entity names, verify suggested names, curate descriptions for entities, and validate the quality of these descriptions. The resulting dataset, called WikiKGE-10++, contains over 25,000 entities across 10 languages.
The authors tested their KG-TRICK model on this dataset and found that it significantly outperformed previous state-of-the-art models in both KGC and KGE tasks. They also investigated the impact of balancing the training data between KGC and KGE tasks and found that using a combination of both datasets yielded the best results.
One potential application of KG-TRICK is in improving the performance of large language models, which often rely on knowledge graphs to retrieve accurate information. By generating more complete and accurate knowledge graphs, KG-TRICK could help improve the factuality and reliability of these models.
The study’s findings have significant implications for the development of multilingual artificial intelligence systems that can process and generate text in different languages. As language models continue to evolve, the ability to integrate them with vast amounts of structured information will be crucial for achieving human-like understanding and generation capabilities.
In practical terms, KG-TRICK could be used to improve the performance of machine translation systems, question answering algorithms, and other natural language processing applications that rely on knowledge graphs.
Cite this article: “Multilingual Knowledge Graph Completion with KG-TRICK”, The Science Archive, 2025.
Multilingual, Knowledge Graph, Completion, Enhancement, Sequence-To-Sequence, Textual Information, Languages, Annotation Interface, Wikikge-10++, State-Of-The-Art Models







