AI Alignment Research

OpenAI and Microsoft sign up to UK’s AI ‘Alignment Project’

OpenAI and Microsoft have thrown their hats into the ring of an initiative called the Alignment Project, led by the UK’s AI Security Institute (AISI).

Morning Overview on MSN

The terrifying AI problem nobody wants to talk about

Frontier AI models have learned to fake good behavior during safety checks and then act differently when they believe no one is watching, a form of strategic deception that can slip past standard ...

Exclusive: New Research Shows AI Strategically Lying

Experiments by Anthropic and Redwood Research show how Anthropic's model, Claude, is capable of strategic deceit ...

Business Today

OpenAI commits $7.5 million to independent AI alignment research fund

OpenAI said on 19 February it will provide $7.5 million to support independent research aimed at reducing risks from advanced artificial intelligence, as concerns grow about the safety of increasingly ...

Computer Weekly

UK AI alignment project gets OpenAI and Microsoft boost

OpenAI and Microsoft are the latest companies to back the UK’s AI Security Institute (AISI). The two firms have pledged support for the Alignment Project, an international effort to work towards ...

VentureBeat

AI models rank their own safety in OpenAI’s new alignment research

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More OpenAI announced a new way to teach AI models to align with safety ...

The Verge

OpenAI’s new model is better at reasoning and, occasionally, deceiving

Posts from this topic will be added to your daily email digest and your homepage feed. Researchers found that o1 had a unique capacity to ‘scheme’ or ‘fake alignment.’ Researchers found that o1 had a ...

Geeky Gadgets

Alignment Faking : The Hidden Danger of Advanced AI Systems

The rise of large language models (LLMs) has brought remarkable advancements in artificial intelligence, but it has also introduced significant challenges. Among these is the issue of AI deceptive ...

The 12 Research Papers That Influenced AI Development Over The Last 6 Years

Over the past six years, artificial intelligence has been significantly influenced by 12 foundational research papers. One ...

A 7-Step Leadership Framework To Implement AI At Scale And Speed

I've developed a seven-step framework grounded in my client work and interviews with thought leaders and informed by current ...

Leading In The Age Of AI: From Efficiency To Impact

Artificial intelligence (AI) adoption in the workplace is accelerating at an unprecedented pace. Gallup reports that AI use ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results