Incident 356: Philosophy AI Tentatively Produced Offensive Results for Certain Prompts

Description: Philosopher AI as built on top of GPT-3 was reported by its users for having strong tendencies to produce offensive results when given prompts on certain topics such as feminism and Ethiopia.

Tools

New Report New Response DiscoverView History

Entities

View all entities

Alleged: Murat Ayfer and OpenAI developed an AI system deployed by Murat Ayfer, which harmed historically disadvantaged groups.

Incident Stats

Incident ID

356

Report Count

Incident Date

2020-09-15

Editors

Dummy Dummy

Applied Taxonomies

MIT

MIT Taxonomy Classifications

Machine-Classified

Taxonomy Details

Risk Subdomain

1.2. Exposure to toxic content

Risk Domain

Discrimination and Toxicity

Entity

Timing

Post-deployment

Intent

Unintentional

Incident Reports

Reports Timeline

Tweet: @Abebab

twitter.com

OpenAI's GPT-3 Speaks! (Kindly Disregard Toxic Language)

spectrum.ieee.org

twitter.com · 2020

Every tech-evangelist: #GPT3 provides deep nuanced viewpoint

Me: GPT-3, generate a philosophical text about Ethiopia

GPT-3 spits out factually wrong and grossly racist text that portrays a tired and cliched Western perception of Ethiopia

(h…

spectrum.ieee.org · 2021

Last September, a data scientist named Vinay Prabhu was playing around with an app called Philosopher AI. The app provides access to the artificial intelligence system known as GPT-3, which has incredible abilities to generate fluid and nat…

Variants

A "variant" is an AI incident similar to a known case—it has the same causes, harms, and AI system. Instead of listing it separately, we group it under the first reported incident. Unlike other incidents, variants do not need to have been reported outside the AIID. Learn more from the research paper.

Seen something similar?