Skip to Content
logologo
AI Incident Database
Open TwitterOpen RSS FeedOpen FacebookOpen LinkedInOpen GitHub
Open Menu
Discover
Submit
  • Welcome to the AIID
  • Discover Incidents
  • Spatial View
  • Table View
  • List view
  • Entities
  • Taxonomies
  • Submit Incident Reports
  • Submission Leaderboard
  • Blog
  • AI News Digest
  • Risk Checklists
  • Random Incident
  • Sign Up
Collapse
Discover
Submit
  • Welcome to the AIID
  • Discover Incidents
  • Spatial View
  • Table View
  • List view
  • Entities
  • Taxonomies
  • Submit Incident Reports
  • Submission Leaderboard
  • Blog
  • AI News Digest
  • Risk Checklists
  • Random Incident
  • Sign Up
Collapse

Incident 855: Names Linked to Defamation Lawsuits Reportedly Spur Filtering Errors in ChatGPT's Name Recognition

Description: ChatGPT has reportedly been experiencing errors and service disruptions caused by hard-coded filters designed to prevent it from producing potentially harmful or defamatory content about certain individuals by blocking prompts containing specific names, likely related to post-training interventions. The reported names are Brian Hood, Jonathan Turley, Jonathan Zittrain, David Faber, David Mayer, and Guido Scorza.
Editor Notes: For the reference to Jonathan Turley, see Incident 506; for Brian Hood, see Incident 507. This incident also presents potential adversarial vulnerabilities, as well as unintended consequences for users sharing affected names.

Tools

New ReportNew ReportNew ResponseNew ResponseDiscoverDiscoverView HistoryView History

Entities

View all entities
Alleged: OpenAI and ChatGPT developed an AI system deployed by OpenAI and ChatGPT users, which harmed ChatGPT users , Jonathan Zittrain , Jonathan Turley , Guido Scorza , David Mayer , David Faber and Brian Hood.

Incident Stats

Incident ID
855
Report Count
3
Incident Date
2024-11-30
Editors
Applied Taxonomies
MIT

MIT Taxonomy Classifications

Machine-Classified
Taxonomy Details

Risk Subdomain

A further 23 subdomains create an accessible and understandable classification of hazards and harms associated with AI
 

7.3. Lack of capability or robustness

Risk Domain

The Domain Taxonomy of AI Risks classifies risks into seven AI risk domains: (1) Discrimination & toxicity, (2) Privacy & security, (3) Misinformation, (4) Malicious actors & misuse, (5) Human-computer interaction, (6) Socioeconomic & environmental harms, and (7) AI system safety, failures & limitations.
 
  1. AI system safety, failures, and limitations

Entity

Which, if any, entity is presented as the main cause of the risk
 

AI

Timing

The stage in the AI lifecycle at which the risk is presented as occurring
 

Post-deployment

Intent

Whether the risk is presented as occurring as an expected or unexpected outcome from pursuing a goal
 

Unintentional

Incident Reports

Reports Timeline

Incident OccurrenceCertain names make ChatGPT grind to a halt, and we know whyWhy Wouldn’t ChatGPT Say This Dead Professor’s Name?The Mystery of Why ChatGPT Couldn’t Say the Name ‘David Mayer’
Certain names make ChatGPT grind to a halt, and we know why

Certain names make ChatGPT grind to a halt, and we know why

arstechnica.com

Why Wouldn’t ChatGPT Say This Dead Professor’s Name?

Why Wouldn’t ChatGPT Say This Dead Professor’s Name?

nytimes.com

The Mystery of Why ChatGPT Couldn’t Say the Name ‘David Mayer’

The Mystery of Why ChatGPT Couldn’t Say the Name ‘David Mayer’

wsj.com

Certain names make ChatGPT grind to a halt, and we know why
arstechnica.com · 2024

OpenAI's ChatGPT is more than just an AI language model with a fancy interface. It's a system consisting of a stack of AI models and content filters that make sure its outputs don't embarrass OpenAI or get the company into legal trouble whe…

Why Wouldn’t ChatGPT Say This Dead Professor’s Name?
nytimes.com · 2024

Across the final years of his life, David Mayer, a theater professor living in Manchester, England, faced the cascading consequences of an unfortunate coincidence: A dead Chechen rebel on a terror watch list had once used Mr. Mayer's name a…

The Mystery of Why ChatGPT Couldn’t Say the Name ‘David Mayer’
wsj.com · 2024

David Mayer wasn't a particularly well-known name until last week, when it was propelled into the internet spotlight. The reason wasn't anything a person named David Mayer said or did, but rather the way the generative AI chatbot ChatGPT tr…

Variants

A "variant" is an incident that shares the same causative factors, produces similar harms, and involves the same intelligent systems as a known AI incident. Rather than index variants as entirely separate incidents, we list variations of incidents under the first similar incident submitted to the database. Unlike other submission types to the incident database, variants are not required to have reporting in evidence external to the Incident Database. Learn more from the research paper.

Similar Incidents

Selected by our editors
ChatGPT Erroneously Alleged Mayor Served Prison Time for Bribery

ChatGPT Erroneously Alleged Mayor Served Prison Time for Bribery

Mar 2023 · 2 reports
ChatGPT Allegedly Produced False Accusation of Sexual Harassment

ChatGPT Allegedly Produced False Accusation of Sexual Harassment

Mar 2023 · 3 reports
By textual similarity

Did our AI mess up? Flag the unrelated incidents

Biased Sentiment Analysis

Biased Sentiment Analysis

Oct 2017 · 7 reports
High-Toxicity Assessed on Text Involving Women and Minority Groups

High-Toxicity Assessed on Text Involving Women and Minority Groups

Feb 2017 · 9 reports
Inappropriate Gmail Smart Reply Suggestions

Inappropriate Gmail Smart Reply Suggestions

Nov 2015 · 22 reports
Previous IncidentNext Incident

Similar Incidents

Selected by our editors
ChatGPT Erroneously Alleged Mayor Served Prison Time for Bribery

ChatGPT Erroneously Alleged Mayor Served Prison Time for Bribery

Mar 2023 · 2 reports
ChatGPT Allegedly Produced False Accusation of Sexual Harassment

ChatGPT Allegedly Produced False Accusation of Sexual Harassment

Mar 2023 · 3 reports
By textual similarity

Did our AI mess up? Flag the unrelated incidents

Biased Sentiment Analysis

Biased Sentiment Analysis

Oct 2017 · 7 reports
High-Toxicity Assessed on Text Involving Women and Minority Groups

High-Toxicity Assessed on Text Involving Women and Minority Groups

Feb 2017 · 9 reports
Inappropriate Gmail Smart Reply Suggestions

Inappropriate Gmail Smart Reply Suggestions

Nov 2015 · 22 reports

Research

  • Defining an “AI Incident”
  • Defining an “AI Incident Response”
  • Database Roadmap
  • Related Work
  • Download Complete Database

Project and Community

  • About
  • Contact and Follow
  • Apps and Summaries
  • Editor’s Guide

Incidents

  • All Incidents in List Form
  • Flagged Incidents
  • Submission Queue
  • Classifications View
  • Taxonomies

2023 - AI Incident Database

  • Terms of use
  • Privacy Policy
  • Open twitterOpen githubOpen rssOpen facebookOpen linkedin
  • 30ebe76