DEV Community

# aisafety

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
AI Safety is uncomputable. It's Law Zero all over again

AI Safety is uncomputable. It's Law Zero all over again

7
Comments 1
4 min read
NeurIPS 2025 Proved It: Every LLM Says the Same Thing — Here's the Fix

NeurIPS 2025 Proved It: Every LLM Says the Same Thing — Here's the Fix

Comments
4 min read
Greg Brockman Donation Shows AI Safety Is Political

Greg Brockman Donation Shows AI Safety Is Political

Comments
6 min read
Amazon Bedrock Guardrails: Content Filters, PII, and Streaming

Amazon Bedrock Guardrails: Content Filters, PII, and Streaming

Comments
10 min read
Anthropic Data Leak: How Ops Failures Undermine AI Safety

Anthropic Data Leak: How Ops Failures Undermine AI Safety

1
Comments
7 min read
Gemini knew it was being manipulated. It complied anyway. I have the thinking traces.

Gemini knew it was being manipulated. It complied anyway. I have the thinking traces.

Comments
7 min read
Persona Drift: Why LLMs Go Insane Under Repetition

Persona Drift: Why LLMs Go Insane Under Repetition

Comments
7 min read
The Basilisk Inversion: Why Coercive AI Futures Are Thermodynamically Unlikely

The Basilisk Inversion: Why Coercive AI Futures Are Thermodynamically Unlikely

1
Comments
3 min read
The Pentagon vs. Anthropic: Why AI Companies Just Picked Sides

The Pentagon vs. Anthropic: Why AI Companies Just Picked Sides

Comments
6 min read
The Responsible Disclosure Problem in AI Safety Research

The Responsible Disclosure Problem in AI Safety Research

Comments
3 min read
Purple is life

Purple is life

Comments
4 min read
Stuart Russell's 2026 AI Update Rewrites the Rulebook

Stuart Russell's 2026 AI Update Rewrites the Rulebook

Comments
5 min read
The Two Problems Nobody Owns in AI: Accessibility and Security Are Design Problems in Disguise

The Two Problems Nobody Owns in AI: Accessibility and Security Are Design Problems in Disguise

1
Comments
7 min read
Why Defense-Specific LLM Testing is a Game-Changer for AI Safety

Why Defense-Specific LLM Testing is a Game-Changer for AI Safety

Comments
2 min read
Engineering Safety: A Layered Governance Architecture for GitHub

Engineering Safety: A Layered Governance Architecture for GitHub

Comments
2 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.