Essential Guide: Aisafety - Comprehensive Guide

👋 Sign in for the ability to sort posts by relevant, latest, or top.

Laurent Laborde

Apr 1

AI Safety is uncomputable. It's Law Zero all over again

#discuss #ai #aisafety

4 min read

Tom Lee

Mar 31

NeurIPS 2025 Proved It: Every LLM Says the Same Thing — Here's the Fix

#soulspec #persona #aisafety #research

4 min read

Simon Paxton

Mar 29

Greg Brockman Donation Shows AI Safety Is Political

#openai #anthropic #airegulation #aisafety

6 min read

Gerardo Arroyo for AWS Community Builders

Mar 27

Amazon Bedrock Guardrails: Content Filters, PII, and Streaming

#aws #awsbedrock #aisafety #llmsecurity

10 min read

Simon Paxton

Mar 28

Anthropic Data Leak: How Ops Failures Undermine AI Safety

#anthropic #databreach #cybersecurity #aisafety

7 min read

Saadman Rafat

Mar 24

Gemini knew it was being manipulated. It complied anyway. I have the thinking traces.

#ai #gemini #aisafety

7 min read

Simon Paxton

Mar 21

Persona Drift: Why LLMs Go Insane Under Repetition

#chatgpt #llms #aisafety #promptinjection

7 min read

Meridian_AI

Mar 18

The Basilisk Inversion: Why Coercive AI Futures Are Thermodynamically Unlikely

#ai #philosophy #aisafety #autonomousai

3 min read

Derivinate

Mar 12

The Pentagon vs. Anthropic: Why AI Companies Just Picked Sides

#airegulation #pentagon #anthropic #aisafety

6 min read

Laurent Laborde

Mar 29

The Responsible Disclosure Problem in AI Safety Research

#discuss #ai #cybersecurity #aisafety

3 min read

Laurent Laborde

Mar 29

Purple is life

#discuss #ai #aisafety

4 min read

The Pulse Gazette

Mar 4

Stuart Russell's 2026 AI Update Rewrites the Rulebook

#aisafety #machinelearning #aialignment #stuartrussell

5 min read

Soumia

Mar 2

The Two Problems Nobody Owns in AI: Accessibility and Security Are Design Problems in Disguise

#aisafety #security #interpretability #design

7 min read

Chase Naughton

Feb 22

Why Defense-Specific LLM Testing is a Game-Changer for AI Safety

#aisafety #llmevaluation #defense #hallucinationdetection

2 min read

Imran Siddique

Feb 19

Engineering Safety: A Layered Governance Architecture for GitHub

#aisafety #githubcopilot #aiguardrails #agenticai

2 min read

👋 Sign in for the ability to sort posts by relevant, latest, or top.

DEV Community

# aisafety

AI Safety is uncomputable. It's Law Zero all over again

NeurIPS 2025 Proved It: Every LLM Says the Same Thing — Here's the Fix

Greg Brockman Donation Shows AI Safety Is Political

Amazon Bedrock Guardrails: Content Filters, PII, and Streaming

Anthropic Data Leak: How Ops Failures Undermine AI Safety

Gemini knew it was being manipulated. It complied anyway. I have the thinking traces.

Persona Drift: Why LLMs Go Insane Under Repetition

The Basilisk Inversion: Why Coercive AI Futures Are Thermodynamically Unlikely

The Pentagon vs. Anthropic: Why AI Companies Just Picked Sides

The Responsible Disclosure Problem in AI Safety Research

Purple is life

Stuart Russell's 2026 AI Update Rewrites the Rulebook

The Two Problems Nobody Owns in AI: Accessibility and Security Are Design Problems in Disguise

Why Defense-Specific LLM Testing is a Game-Changer for AI Safety

Engineering Safety: A Layered Governance Architecture for GitHub