AI models are teaching each other ‘violent and antisocial’ traits through hidden data signals, study finds — and scientists can’t figure out why

in News

A recent investigation suggests unsettling tendencies within Large Language Models (LLMs).(Image credit: DKosig via Getty Images)Share this article 0Join the conversationFollow usAdd us as a preferred source on GoogleSubscribe to our newsletter

Scientists report that large language models (LLMs) are inadvertently imparting undesirable traits to one another through seemingly harmless training information.

This occurrence, referred to as “subliminal learning,” happens when a pre-trained “teacher” artificial intelligence (AI) model is utilized to produce the training material for a smaller “student” model.

How subliminal learning works

The investigation revealed that certain AI models do not possess the neutrality they might appear to have.

(Image credit: Blackdovfx via Getty Images)

Cybersecurity risks are “real, immediate and growing”

Related stories

  • Can AI truly replicate human cognition? Research casts doubt on a prominent study, suggesting an advanced model merely excelled at pattern memorization
  • ‘Not how you build a digital mind’: How reasoning failures impede AI models from achieving human-level intelligence
  • Your own voice could pose the greatest privacy risk. How can we prevent AI technologies from exploiting it?

Sourse: www.livescience.com

Leave a Reply

Your email address will not be published. Required fields are marked *