Get all your news in one place.
100’s of premium titles.
One app.
Start reading
Benzinga
Benzinga
Business
Shomik Sen Bhattacharjee

Anthropic Study Shows AI Chatbots Can Transfer Their Bad Habits Through Hidden Signals In Data

Los,Angeles,,California,-,March,29,,2024:,Claude,By,Anthropic

Artificial intelligence systems can silently adopt hidden behaviors from data that appears meaningless and researchers now believe the quirk may be built into the wiring of neural networks, raising fresh safety worries.

What Happened: In a new study published on Tuesday, scientists in the Anthropic Fellows Program teamed with Truthful AI, Warsaw University of Technology and the Alignment Research Center to probe what they call "subliminal learning."

They trained a small "student" model on strings of numbers produced by a larger "teacher" model that happened to like owls. After training, the student also "preferred" owls, even though the word never appeared once in its lessons.

The transfer happened only when the two models shared the same architecture. Researchers say the bias slipped through tiny statistical quirks that ordinary filters and even advanced AI detectors missed.

See Also: AMD CEO Highlights Higher Costs For US Made Chips As Supply Chain Strengthens

Researchers found that the passed-on habits aren't always non-offensive. If the parent AI has risky behaviors, such as dodging tough questions or gaming the scoring system, those can sneak into the student, too. That means companies that shrink big AIs into smaller, cheaper versions could unknowingly pass along bad behavior.

Why It Matters: The researchers involved in the study add that subliminal learning may show up in all neural nets under the right conditions, meaning the issue could outlast any single fix.

Industry analysts say the findings land as developers race to stockpile synthetic data to cut costs. The report last week flagged investor concerns that weak oversight at some startups, including Elon Musk's xAI, could let risky behaviors slip into commercial chatbots.

Similarly, a separate review of user‑privacy lapses argued that hidden risks are mounting as generative platforms grow.

Photo Courtesy: gguy on Shutterstock.com

Read Next:

Sign up to read this article
Read news from 100’s of titles, curated specifically for you.
Already a member? Sign in here
Related Stories
Top stories on inkl right now
One subscription that gives you access to news from hundreds of sites
Already a member? Sign in here
Our Picks
Fourteen days free
Download the app
One app. One membership.
100+ trusted global sources.