26.02.26 - the 'sarah chen' phenomena
Discover why AI models default to "Dr. Sarah Chen" for fictional experts and how to spot AI placeholders.
Who is 'Sarah Chen'?
Even though there are many actual people called Sarah Chen, she often turns up in AI generated literature and we need to be conscious of why before we take the authorship literally.
When an AI needs to invent a competent, modern professional, it is essentially playing a game of statistical probability based on its training data. Here is why it lands on "Sarah Chen" so often:
1
The Statistical Sweet Spot
"Sarah" has been one of the most popular English first names for decades, making it statistically "safe" and familiar. "Chen" is one of the most common surnames in the world. When the AI's algorithm tries to generate a name that sounds realistic but generic, combining a common Western first name with a common global surname is a highly probable mathematical outcome.
2
The "Diverse Stock Photo" Effect
AI models are trained on massive amounts of corporate text, academic papers, and tech documentation. In recent years, writers of these documents have consciously tried to use inclusive, diverse names in their examples rather than just "John Smith." "Sarah Chen" has become the textual equivalent of a corporate stock photo - a name that signals modern diversity without being "too" unusual or hard to pronounce.
3
The AI Feedback Loop
This is the most interesting part. Early AI models started using "Sarah Chen" as a placeholder. People published those AI-generated articles, code examples, and stories on the internet. Then, newer AI models scraped the internet for training data, ingested all those "Sarah Chen" examples, and learned that this name is highly associated with being a doctor, scientist, or expert. It is a self-fulfilling prophecy!
Are there other AI "Default" Names?
Yes! If you prompt AI models long enough, you will start to see the same cast of fictional characters pop up over and over again. Here are a few other common ones:
Dr. Emily Chen: The sister to Sarah. If Sarah is a neuroscientist or a venture capitalist, Emily is almost always an AI-generated marketing director, CEO, or tech startup founder used in business case studies.
Alex or Jamie Rivera: When AI models are asked to write examples of computer code, they frequently use these names for the fictional programmers leaving comments in the code.
Marcus and Elena: If you ask an AI to write a science fiction or fantasy story, these two names appear constantly. They sound classic and slightly dramatic, which fits the genre perfectly.
John Doe and Jane Smith: AI models actually try to avoid these names now. They have been trained to be helpful and realistic, and they "know" that "John Doe" sounds fake. So, they overcorrect by creating names like Sarah Chen instead.
It is a great reminder that while AI can seem incredibly smart, it is ultimately just a very advanced pattern-matching machine.
Last modified: February 26th, 2026
