Saturday 01 February 2025
The quest for a reliable and accurate method of predicting suicidal risk on social media has long been an elusive goal for researchers in the field of natural language processing (NLP). A recent study published in IEEE Big Data 2024 has shed new light on this issue, proposing a novel approach that leverages large language models to identify evidence of suicidality risk in online posts.
The researchers behind this study employed a unique methodology, dubbed Su-RoBERTa, which combines the power of pre-trained transformer models with contextual augmentation techniques. By fine-tuning these models using a small dataset of labeled examples, the team was able to develop an algorithm that can accurately predict suicidal risk levels from social media posts.
The authors’ approach is based on the idea that language models can be trained to recognize subtle patterns and cues in text data that may indicate suicidal ideation or behavior. By leveraging these models, the researchers were able to identify specific phrases, words, and sentence structures that are commonly associated with suicidal risk.
One of the key innovations behind Su-RoBERTa is its ability to handle out-of-vocabulary words and rare phrases, which are often present in social media posts. This is achieved through a process called contextual augmentation, which involves generating new examples of text data by modifying existing sentences or phrases.
The researchers evaluated their approach using a dataset of 100 labeled test samples, which were manually annotated by experts. The results showed that Su-RoBERTa outperformed traditional machine learning models in terms of accuracy and F1-score, with an impressive weighted F1 score of 69.84%.
The implications of this study are significant, as it provides a new avenue for developing AI-powered tools that can help identify individuals at risk of suicide on social media platforms. By leveraging the power of large language models and contextual augmentation techniques, researchers may be able to develop more accurate and effective methods for detecting suicidal risk.
In addition, the authors’ approach has broader implications for the field of NLP, as it demonstrates the potential of transformer-based models for handling complex text data and identifying subtle patterns. As the use of social media becomes increasingly widespread, the need for reliable methods of detecting suicidal risk will only continue to grow, making this study a crucial step forward in addressing this pressing issue.
Overall, the Su-RoBERTa approach offers a promising solution for predicting suicidal risk on social media, and its potential applications are vast.
Cite this article: “Predicting Suicidal Risk on Social Media with Large Language Models”, The Science Archive, 2025.
Suicidal Risk, Social Media, Language Models, Transformer Models, Contextual Augmentation, Su-Roberta, Natural Language Processing, Nlp, Suicidal Ideation, Machine Learning







