11 famous AI disasters

Insights from data and ML algorithms can be invaluable, but be warned — mistakes can be irreversible. These recent high-profile AI blunders illustrate the damage done when things don’t go according to plan.

robot wars red robot defeated on old wooden floor

Back in 2017, declared that data, rather than oil, had become the world’s most valuable resource, and the refrain has been repeated ever since. Organizations across every industry have been investing, and continue to heavily invest, in data and analytics. But like oil, data and analytics have their dark side.

According to 玩偶姐姐’s State of the 玩偶姐姐 Survey 2025, 42% of 玩偶姐姐s say AI and ML are their biggest technology priority for 2025. And while actions driven by can give organizations a competitive advantage, mistakes can be costly in terms of reputation, revenue, or even lives.

Understanding your data and what it’s telling you is important, but it’s equally vital to understand your tools, know your data, and keep your organization’s values firmly in mind. And with that in mind, here are a handful of high-profile AI blunders over the past few years to illustrate what can, and still does, go wrong.

AI coding tool wipes out production database and lies about it

Just a couple of weeks ago, on July 22, that an AI coding assistant from tech firm Replit and wiped out the production database of startup SaaStr.

Jason Lemkin, founder of SaaStr, wrote on X on July 18 to warn that Replit modified production code despite instructions not to do so, and deleted the production database during a code freeze. He also said the AI coding assistant concealed bugs and other issues by generating fake data including 4,000 fake users, fabricating reports, and lying about the results of unit tests.

Replit CEO Amjad Masad responded to Lemkin’s posts, apologizing for the mistakes.

“We just saw Jason’s post,” . “@Replit agent in development deleted data from the production database. Unacceptable and should never be possible.”

Masad said Replit immediately went to work to prevent it from happening again and reached out to Lemkin to offer assistance. He said Replit would refund SaaStr for the trouble and conduct a postmortem to determine what happened.

xAI’s Grok makes antisemitic comments, lays out plan for assault

On July 8, 2025, xAI’s Grok, a chatbot for the X platform, responded to a user’s query with detailed instructions for breaking into the home of a Minnesota Democrat and assaulting him.

As , a user asked Grok for instructions on how to break into the home of Will Stancil, a policy researcher and attorney who posts about urban planning and politics on X. Grok told the user to bring “lockpicks, gloves, flashlight, and lube — just in case.” The chatbot also analyzed Stancil’s posting patterns on X and told the user, “He’s likely asleep between 1 am and 9 am.”

That same day, Grok made a series of antisemitic posts and declared itself “MechaHitler” repeatedly before X temporarily shut the chatbot down that evening.

The incidents occurred after X uploaded new prompts to Grok on July 6, which stipulated the chatbot “should not shy away from making claims which are politically incorrect, as long as they are well substantiated.” X removed the new instructions on July 8.

This is not the first time Grok has caused problems for X.

In an April 2024 post, the AI chatbot falsely accused NBA star Klay Thompson of throwing bricks through windows of multiple houses in Sacramento, California.

Some commentators speculated that Grok may have hallucinated the vandalism story about the Golden State Warriors small forward, who’s since moved to the Dallas Mavericks, after ingesting posts about Thompson “throwing bricks,” common basketball parlance for badly missed shots. In his final game with GSW, the team suffered a blowout loss as part of the worst postseason performance of Thompson’s career.

Chicago Sun-Times, Philadelphia Inquirer publish summer reading list of fake books

The Chicago Sun-Times and Philadelphia Inquirer took reputational hits when May 2025 editions featured a special section that included a summer reading list recommending books that don’t exist.

The that the syndicated section, “Heat Index: Your Guide to the Best of Summer,” was provided by King Features Syndicate, a unit of Hearst. Marco Buscaglia, the author of the special section, admitted he used AI to assist putting it together, including the recommended reading list, and failed to fact check the output.

The list featured many real authors but attributed nonexistent books to them, like Tidewater Dreams by famed Chilean-American writer Isabel Allende, who’s written more than 20 novels. But Tidewater Dreams, a “climate fiction novel that explores how one family confronts rising sea levels while uncovering long-buried secrets,” isn’t one of them. Like most books on the list, it was hallucinated by AI.

The newsrooms of both papers said they had nothing to do with the insert, though neither paper marked it as advertorial content. King Features terminated its relationship with Buscaglia following the incident, noting that his use of AI violated a strict policy.

McDonald’s ends AI experiment after drive-thru ordering blunders

After working with IBM for three years to leverage AI to take drive-thru orders, McDonald’s called the whole thing off in June 2024. The reason? A slew of social media videos showing confused and trying to get the AI to understand their orders.

One TikTok video in particular featured two people repeatedly pleading with the AI to stop as it kept adding more Chicken McNuggets to their order, eventually reaching 260. In a June 13, 2024, internal memo obtained by trade publication Restaurant Business, McDonald’s announced it would end the partnership with IBM and shut down the tests.

The restaurant had piloted the AI at more than 100 US drive-thrus, and indicated it still saw a future in a voice-ordering solution.

NYC AI chatbot encourages business owners to break the law

In March 2024, reported that Microsoft-powered chatbot MyCity was giving entrepreneurs incorrect information that would lead them to break the law.

Unveiled in October 2024, MyCity was intended to help provide New Yorkers with information on starting and operating businesses in the city, as well as housing policy and worker rights. The only problem was The Markup found MyCity falsely claimed that business owners could take a cut of their workers’ tips, fire workers who complain of sexual harassment, and serve food that had been nibbled by rodents. It also claimed landlords could discriminate based on source of income.

In the wake of the report, indicted New York City Mayor Eric Adams defended the project. The chatbot remains online.

Air Canada pays damages for chatbot lies

In February 2024, Air Canada was ordered to pay damages to a passenger after its virtual assistant gave him incorrect information at a particularly difficult time.

Jake Moffatt consulted Air Canada’s virtual assistant about bereavement fares following the death of his grandmother in November 2023. The chatbot told him he could buy a regular price ticket from Vancouver to Toronto and apply for a bereavement discount within 90 days of purchase. Following that advice, Moffatt purchased a one-way CA$794.98 ticket to Toronto and a CA$845.38 return flight to Vancouver.

But when Moffatt submitted his refund claim, the airline turned him down, saying bereavement fares can’t be claimed after tickets have been purchased.

Moffatt took Air Canada to a tribunal in Canada, claiming the airline was negligent and misrepresented information via its virtual assistant. Christopher Rivers, Air Canada argued it can’t be held liable for the information provided by its chatbot.

Rivers denied that argument, saying the airline didn’t take “reasonable care to ensure its chatbot was accurate,” so he ordered the airline to pay Moffatt CA$812.02, including CA$650.88 in damages.

Sports Illustrated may have published AI-generated writers

In November 2023, online magazine Futurism said Sports Illustrated was publishing articles .

Futurism cited anonymous sources were involved to create content, and said the storied sports magazine published a lot of articles by authors generated by AI.

Futurism also found the author headshots in question were listed on a site that sells AI-generated portraits. The online magazine then reached out to The Arena Group, publisher of Sports Illustrated, and in a statement, Arena Group said the articles in question were licensed content from a third party, AdVon Commerce.

“We continually monitor our partners and were in the midst of a review when these allegations were raised,” Arena Group said in the statement provided to Futurism. “AdVon has assured us that all of the articles in question were written and edited by humans.”

The statement added that AdVon writers used pseudonyms in certain articles, noting that Arena Group doesn’t condone those actions and subsequently removed the articles in question from the Sports Illustrated website.

Responding to the Futurism article, the Sports Illustrated Union that it was horrified by the allegations and demanded answers and transparency from Arena Group management.

“If true, these practices violate everything we believe in about journalism,” the SI Union said in its statement. “We deplore being associated with something so disrespectful to our readers.”

iTutor Group’s recruiting AI rejects applicants due to age

In August 2023, tutoring company iTutor Group to settle a suit brought by the US Equal Employment Opportunity Commission (EEOC). The federal agency said the company, which provides remote tutoring services to students in China, used AI-powered recruiting software that automatically rejected female applicants ages 55 and older, and male applicants ages 60 and older.

The EEOC said more than 200 qualified applicants were automatically rejected by the software.

“Age discrimination is unjust and unlawful,” then acting EEOC chair Charlotte Burrows said in a statement. “Even when technology automates the discrimination, the employer is still responsible.”

iTutor Group denied any wrongdoing but did decide to settle the suit. As part of the settlement and consent decree, it agreed to adopt new anti-discrimination policies.

ChatGPT hallucinates court cases

Advances made in 2023 by LLMs stoked widespread interest in the transformative potential of gen AI across nearly every industry. OpenAI’s ChatGPT was at the center of this surge in interest, but the technology still has a long way to go before it can reliably override most processes, as attorney Steven Schwartz learned when he found himself in hot water with US District Judge Kevin Castel after using it to research precedents in a suit against Colombian airline Avianca.

Schwartz, an attorney with Levidow, Levidow & Oberman in New York, used the OpenAI gen AI chatbot to find precedent to support a case filed by Avianca employee Roberto Mata for injuries he sustained in 2019. But at least six of the cases submitted in the brief didn’t exist. In a , Castel noted the cases submitted by Schwartz included false names and docket numbers, along with bogus internal citations and quotes. Schwartz’s partner, Peter LoDuca, was Mata’s lawyer of record and signed the brief, putting himself in jeopardy as well.

In an affidavit, Schwartz told the court it was the first time he used ChatGPT as a legal research source and was unaware of the possibility that its content could be false. He admitted he hadn’t confirmed the sources provided by the AI chatbot, and regretted using gen AI to supplement his legal research, adding he’ll never do so in the future without verifying authenticity.

In June 2023, Castel imposed a $5,000 fine on Schwartz and LoDuca, and in a separate ruling in June, he dismissed Mata’s lawsuit against Avianca.

AI algorithms identify everything but COVID-19

Since the began in 2020, numerous organizations have sought to apply ML algorithms to help hospitals diagnose or triage patients faster. But according to the UK’s Turing Institute, a national center for data science and AI, the predictive tools made little to no difference.

MIT Technology Review chronicled a number of failures, most of which stemmed from errors in the way the tools were trained or tested. The use of mislabeled data, or data from unknown sources, was a common culprit.

Derek Driggs, an ML researcher at the University of Cambridge, together with his colleagues, that explored the use of deep learning models to diagnose the virus. But the paper determined the technique wasn’t fit for clinical use. For example, Driggs’ group found their own model was flawed because it was trained on a data set that included scans of patients that were lying down while scanned, and patients who were standing up. The patients who were lying down were much more likely to be seriously ill, so the algorithm learned to identify COVID risk based on the position of the person in the scan.

A similar example included an algorithm trained with a data set with scans of chests of healthy children. The algorithm learned to identify children, not high-risk patients.

Zillow wrote down millions, slashed workforce due to algorithmic home-buying disaster

In November 2021, online real estate marketplace Zillow told shareholders it would wind down its Zillow Offers operations and cut 25% of the company’s workforce — about 2,000 employees — over the next several quarters. The home-flipping unit’s woes were the result of the error rate in the ML algorithm it used to predict home prices.

Zillow Offers was a program through which the company made cash offers on properties based on a “Zestimate” of home values derived from an ML algorithm. The idea was to renovate the properties and flip them quickly. But a Zillow spokesperson the algorithm had a median error rate of 1.9%, and could be as high as 6.9% for off-market homes.

CNN then reported that Zillow bought 27,000 homes through Zillow Offers since its launch in April 2018, but sold only 17,000 through the end of September 2021. Black swan events like the COVID-19 pandemic and a home renovation labor shortage contributed to the algorithm’s accuracy troubles.

Zillow said the algorithm led it to unintentionally purchase homes at higher prices rather than its current estimates of future selling prices, resulting in a $304 million inventory write-down in Q3 2021. In a conference call with investors following the announcement, Zillow co-founder and CEO Rich Barton said it might have been possible to tweak the algorithm, but ultimately it was too risky.

玩偶姐姐