Predicting Disagreement Rankings in Word-in-Context Judgments

Thursday 23 January 2025


A team of researchers has made a significant breakthrough in understanding how people disagree when judging the meaning of words in different contexts. The study, published in a recent paper, presents an innovative approach to predicting disagreements between annotators and highlights the importance of robust embeddings and careful handling of judgment differences.


The research focuses on the CoMeDi Shared Task, which aims to predict mean disagreement rankings in word-in-context judgments. The task is challenging because it requires understanding how different people perceive the meaning of words in various situations. To tackle this problem, the researchers developed a system that combines sentence embeddings generated by a pre-trained multilingual transformer model with a deep neural network.


The system’s key component is a deep feedforward neural network that maps concatenated sentence embeddings to mean disagreement scores. The network consists of four fully connected layers, each followed by batch normalization and dropout layers to prevent overfitting. The researchers trained the model using the AdamW optimizer and a learning rate scheduler to reduce the risk of overfitting.


The team evaluated their system on a dataset containing samples from seven languages: Chinese, English, German, Norwegian, Russian, Spanish, and Swedish. The results showed that the system achieved competitive performance, ranking third among seven teams in the evaluation phase.


One of the key findings of the study is the importance of robust embeddings in predicting disagreement rankings. The researchers found that using a pre-trained multilingual transformer model as an embedding generator improved the accuracy of their system compared to other approaches. Additionally, the study highlights the need for careful handling of judgment differences between annotators, which can significantly impact the performance of the system.


The research has significant implications for natural language processing (NLP) and computational linguistics. It shows that by leveraging advanced multilingual embeddings and robust neural architectures, it is possible to predict disagreements between annotators with high accuracy. This knowledge can be used to improve the quality of NLP models and better understand how people perceive meaning in different contexts.


The study also highlights the importance of considering linguistic nuances and cultural differences when developing NLP systems. The researchers found that their system struggled with Latin-based languages like Spanish, which suggests that there may be specific challenges associated with capturing fine-grained word-use differences in these languages.


Overall, the research presents a significant advance in understanding how people disagree when judging the meaning of words in different contexts.


Cite this article: “Predicting Disagreement Rankings in Word-in-Context Judgments”, The Science Archive, 2025.


Natural Language Processing, Word Embeddings, Disagreement Prediction, Comedi Shared Task, Multilingual Transformer, Deep Neural Network, Robust Embeddings, Judgment Differences, Annotators, Linguistic Nuances


Reference: Phuoc Duong Huy Chu, “FuocChuVIP123 at CoMeDi Shared Task: Disagreement Ranking with XLM-Roberta Sentence Embeddings and Deep Neural Regression” (2025).


Leave a Reply