Unifying Hate Speech Taxonomies: A Federated Approach to Detecting Online Abuse

Monday 07 April 2025


The challenge of detecting hate speech online is a daunting one, with algorithms struggling to keep pace with the ever-evolving landscape of online discourse. But researchers have made significant strides in recent years, and a new study offers a promising approach that could help improve detection rates.


The key innovation lies in integrating multiple datasets and taxonomies into a single framework, allowing for more accurate identification of hate speech across different platforms and languages. This unified approach draws on the collective wisdom of various annotation schemes, which have been developed independently by researchers, organizations, and companies to combat online abuse.


One of the primary issues with hate speech detection is the lack of standardization in how it’s defined and annotated. Different datasets may use distinct terminology or categorizations, making it difficult for algorithms to generalize across multiple sources. By integrating these various frameworks, the researchers behind this study have created a more comprehensive taxonomy that can be applied consistently across different platforms.


The taxonomy itself is organized into two main categories: hate speech and non-hate speech. Hate speech is further broken down into nine subcategories, including target of hate (class, immigration status, movement, national origin, physical attributes, race/ethnicity, religion/belief, and sexuality), as well as types of hate (animosity, dehumanization, derogation, support for hateful entities, and threatening language).


This granular categorization allows algorithms to better identify specific instances of hate speech, rather than simply relying on broad keywords or phrases. The approach also takes into account the nuances of different languages, ensuring that detection is not limited to a single linguistic context.


The study’s authors have validated their approach by combining two widely used datasets and achieving improved classification performance on an independent test set. This integration of diverse datasets and taxonomies has shown promise in enhancing hate speech detection, with potential applications across various online platforms and languages.


While this research represents a significant step forward, there is still much work to be done to ensure the effective identification and mitigation of online hate speech. The development of more advanced algorithms and the incorporation of additional data sources will likely continue to play critical roles in combating this complex issue.


Ultimately, the success of any approach will depend on its ability to balance the need for effective detection with the risk of false positives or over-censorship. As researchers and developers continue to refine their methods, it’s essential that we prioritize transparency, accountability, and community engagement to ensure that online discourse remains a vibrant and inclusive space for all users.


Cite this article: “Unifying Hate Speech Taxonomies: A Federated Approach to Detecting Online Abuse”, The Science Archive, 2025.


Hate Speech, Online Abuse, Detection Algorithms, Taxonomy, Annotation Schemes, Language, Platforms, Classification Performance, False Positives, Over-Censorship


Reference: Jan Fillies, Adrian Paschke, “Improving Hate Speech Classification with Cross-Taxonomy Dataset Integration” (2025).


Leave a Reply