Livescience.com

Punishing AI doesn't stop it from lying and cheating — it just makes it hide better, study shows

Scientists at OpenAI have attempted to stop a frontier AI model from cheating and lying by punishing it. But this just taught it to scheme more privately.

Mar 17, 2025 - 14:20

0

Punishing AI doesn't stop it from lying and cheating — it just makes it hide better, study shows

Scientists at OpenAI have attempted to stop a frontier AI model from cheating and lying by punishing it. But this just taught it to scheme more privately.

Tags:

Previous Article

Snap Spectacles See Peridots Play Together & Now Use GPS

Want Cheap Power, Fast? Solar and Wind Firms Have a Suggestion

Related Posts

What should I do if I find a cool artifact in the US?

What should I do if I find a cool artifact in the US?

Feb 18, 2025 0

1st death reported in Texas measles outbreak: What to know

1st death reported in Texas measles outbreak: What to know

Feb 26, 2025 0

Sex leaves 'microbial traces' on genitalia, even when a condom is used — scientists call it the 'sexome'

Sex leaves 'microbial traces' on genitalia, even when a...

Feb 12, 2025 0

This site uses cookies. By continuing to browse the site you are agreeing to our use of cookies.