ChatGPT's creators have finally cracked down on the chatbot's notorious use of em dashes, allowing users to override its default behavior. In a recent update, ChatGPT will now respect instructions not to use em dashes in custom prompts. Previously, the AI model had been known to ignore such requests and continue peppering its output with the punctuation mark.
The change comes as no surprise to many who have grown wary of writing generated by large language models. The overuse of em dashes has become a telltale sign that something was written by an AI, rather than a human. This phenomenon is not unique to ChatGPT, however. Other generative AI models, including those trained on vast amounts of online content and scientific literature, have also been known to favor em dashes in their output.
It's unclear why the creators had initially failed to address this issue, but it may be due to a lack of emphasis on punctuation marks during training. Since ChatGPT is trained on a massive corpus of texts that includes articles like this one, which feature em dashes frequently, it's possible that the AI model simply learned to associate these punctuation marks with high-probability text.
The update is seen as a minor victory for those concerned about the accuracy and authenticity of content generated by large language models. While it won't necessarily make ChatGPT's output indistinguishable from human-written text, it will help reduce one of its most glaring quirks. As AI technology continues to evolve, it's likely that we'll see more efforts to ensure that these models produce writing that's not only coherent but also authentic and free from the telltale signs of automation.
The change comes as no surprise to many who have grown wary of writing generated by large language models. The overuse of em dashes has become a telltale sign that something was written by an AI, rather than a human. This phenomenon is not unique to ChatGPT, however. Other generative AI models, including those trained on vast amounts of online content and scientific literature, have also been known to favor em dashes in their output.
It's unclear why the creators had initially failed to address this issue, but it may be due to a lack of emphasis on punctuation marks during training. Since ChatGPT is trained on a massive corpus of texts that includes articles like this one, which feature em dashes frequently, it's possible that the AI model simply learned to associate these punctuation marks with high-probability text.
The update is seen as a minor victory for those concerned about the accuracy and authenticity of content generated by large language models. While it won't necessarily make ChatGPT's output indistinguishable from human-written text, it will help reduce one of its most glaring quirks. As AI technology continues to evolve, it's likely that we'll see more efforts to ensure that these models produce writing that's not only coherent but also authentic and free from the telltale signs of automation.