Monday 03 February 2025
The quest for a more civil online discourse continues, and researchers have made another significant step forward in their efforts to combat hate speech on social media platforms. A recent study has developed a novel approach to identifying antisocial and prosocial behaviors in online conversations, which can help forecast the level of incivility that may occur in future discussions.
The team behind this research employed a unique combination of machine learning algorithms and natural language processing techniques to analyze a dataset of Reddit comments. They focused on replies to hate speech posts, which are often characterized by aggressive and offensive language. By fine-tuning deep neural networks on this data, the researchers were able to identify patterns that distinguish between antisocial and prosocial behaviors.
The study’s authors claim that their approach can accurately predict the level of incivility in future conversations following a reply to hate speech. This is achieved through the use of a weighted average of four metrics: offensive language, explicit hate speech, abusive language, and norm violations. These metrics are aggregated to produce an overall score indicating the likelihood of low, medium, or high incivility.
One of the key innovations in this research is the use of a novel dataset compiled from Reddit comments. This dataset includes replies to hate speech posts, which provides a unique perspective on how online conversations can escalate into incivility. The researchers also employed an off-the-shelf RoBERTa-base model, which has been pre-trained on a large corpus of text data and fine-tuned for their specific task.
The study’s results are encouraging, with the authors reporting high levels of accuracy in predicting incivility levels using their approach. They also experimented with alternative models, including FLAN-T5, which achieved similar results to RoBERTa. The researchers believe that their findings can be applied to other online platforms and languages, making it a promising tool for mitigating hate speech and promoting more civil discourse.
While this research is an important step forward in the fight against online incivility, there are still many challenges to overcome. For example, the dataset used in this study only includes Reddit comments, which may not generalize well to other platforms or languages. Additionally, the team’s approach relies on machine learning algorithms that can be biased if trained on biased data.
Despite these limitations, the researchers’ work offers a promising avenue for addressing online incivility and promoting more civil discourse.
Cite this article: “Predicting Incivility in Online Conversations: A Novel Approach to Identifying Antisocial Behaviors”, The Science Archive, 2025.
Hate Speech, Online Discourse, Machine Learning, Natural Language Processing, Antisocial Behavior, Prosocial Behavior, Incivility, Reddit Comments, Roberta-Base Model, Flan-T5







