Machine-Generated Texts: A Study in Linguistic Differences

Sunday 02 February 2025


The age-old question of whether machines can truly create original content has been debated for years. Recently, a team of researchers set out to answer this question by analyzing the linguistic features of human-written texts and machine-generated texts produced by large language models (LLMs). Their findings offer a fascinating glimpse into the differences between human creativity and artificial intelligence.


The study compared the language patterns in texts generated by five different LLMs – ChatGPT, Cohere, GPT-3.5, BLOOMz-176B, and Dolly-v2 – with those of human-written texts. The results showed that while LLMs are capable of producing coherent and even impressive texts, they exhibit distinct linguistic characteristics that set them apart from human writing.


One key difference is the length of the texts. Human writers tend to produce longer and more varied texts, reflecting their unique experiences and perspectives. In contrast, LLMs generate shorter and more structured content, likely due to their programming and lack of personal experience. This suggests that humans bring a level of depth and complexity to their writing that machines are still unable to replicate.


Another notable difference is the vocabulary used by humans and LLMs. Human writers employ a richer variety of words and phrases, reflecting their individual styles and linguistic creativity. LLMs, on the other hand, tend to rely on more formal language and standardized structures, which can result in texts that feel less engaging and original.


The study also found that humans exhibit greater emotional expression in their writing, with a greater emphasis on negative emotions such as fear, sadness, and anger. This may be due to the complex emotional experiences of human writers, who are able to draw upon their own feelings and perspectives when crafting their texts. LLMs, while capable of generating texts that evoke emotion, tend to focus more on positive emotions like happiness and excitement.


The researchers also analyzed the features that most influence the classification of texts as either human-written or machine-generated. They found that humans are more influenced by linguistic richness and diversity, such as the use of unique words and phrases, while LLMs rely more heavily on structural features like sentence length and grammatical correctness.


These findings have significant implications for our understanding of language and creativity. While machines are capable of producing impressive texts, they are still limited by their programming and lack of human experience. Humans, on the other hand, bring a level of depth, complexity, and emotional expression to their writing that is uniquely theirs.


Cite this article: “Machine-Generated Texts: A Study in Linguistic Differences”, The Science Archive, 2025.


Creativity, Language Models, Human-Written Texts, Machine-Generated Texts, Linguistic Features, Original Content, Artificial Intelligence, Coherence, Vocabulary, Emotional Expression


Reference: Sergio E. Zanotto, Segun Aroyehun, “Human Variability vs. Machine Consistency: A Linguistic Analysis of Texts Generated by Humans and Large Language Models” (2024).


Leave a Reply