Vercel
tcp/443 tcp/80
Open service 64.239.123.129:80 · alignment.openai.com
2026-01-26 02:15
HTTP/1.0 308 Permanent Redirect Content-Type: text/plain Location: https://alignment.openai.com/ Refresh: 0;url=https://alignment.openai.com/ server: Vercel Redirecting...
Open service 64.239.123.129:443 · alignment.openai.com
2026-01-26 02:15
HTTP/1.1 200 OK
Accept-Ranges: bytes
Access-Control-Allow-Origin: *
Age: 957675
Cache-Control: public, max-age=0, must-revalidate
Content-Disposition: inline
Content-Length: 5675
Content-Security-Policy: default-src 'self'; img-src 'self' data: https://i.ytimg.com; style-src 'self'; script-src 'self' https://www.youtube-nocookie.com https://cdn.jsdelivr.net; frame-src https://www.youtube-nocookie.com;
Content-Type: text/html; charset=utf-8
Date: Mon, 26 Jan 2026 02:15:34 GMT
Etag: "f34fb18fa3089079436b0a8a2404fa1a"
Last-Modified: Thu, 15 Jan 2026 00:14:19 GMT
Server: Vercel
Strict-Transport-Security: max-age=63072000
X-Vercel-Cache: HIT
X-Vercel-Id: iad1::nc56g-1769393734884-bbd85f200d0f
Connection: close
Page title: Alignment Research Blog
<!DOCTYPE html>
<html lang="en">
<head>
<meta charset="UTF-8" />
<meta name="viewport" content="width=device-width, initial-scale=1.0" />
<meta name="description" content="Informal alignment research updates from the OpenAI team." />
<title>Alignment Research Blog</title>
<link rel="stylesheet" href="assets/styles.css" />
<link rel="alternate" type="application/rss+xml" title="OpenAIAlignment Research Blog RSS" href="/rss.xml" />
<link rel="icon" href="/favicon.ico" sizes="48x48" />
</head>
<body class="home-page">
<a
class="oai-logo-link"
href="https://openai.com/careers/search/?c=43944a55-c391-4174-8e93-a0705464ab25%2C6dd4a467-446d-4093-8d57-d4633a571123"
aria-label="Explore OpenAI Alignment and Safety Systems roles"
>
<img
class="oai-logo"
src="assets/OpenAI-black-monoblossom.png"
alt="OpenAI blossom logo"
/>
<span class="oai-logo-callout" aria-hidden="true">
P.S. The Alignment and Safety Systems teams are hiring!
</span>
</a>
<div class="content">
<h1>Alignment Research Blog</h1>
<div class="subtitle">Informal updates from the OpenAI team</div>
<div id="post-list">
<div class="post-year" data-year="2026">2026</div>
<a class="post-link" data-clean-url data-year="2026" href="coval/">
<div class="post">
<div class="post-meta">
<div class="date">Jan 14</div>
</div>
<div>
<div class="post-title">
CoVal: Learning values-aware rubrics from the crowd
</div>
<div class="post-subtitle">
An experimental dataset of crowd-written rubrics that surfaces why people prefer one model output over another.
</div>
</div>
</div>
</a>
<a class="post-link" data-clean-url data-year="2026" href="confessions/">
<div class="post">
<div class="post-meta">
<div class="date">Jan 14</div>
</div>
<div>
<div class="post-title">
Why we are excited about confessions
</div>
<div class="post-subtitle">
Deeper analysis of confession training and comparisons to chain-of-thought monitoring.
</div>
</div>
</div>
</a>
<div class="post-year" data-year="2025">2025</div>
<a class="post-link" data-clean-url data-year="2025" href="helpful-assistant-features/">
<div class="post">
<div class="post-meta">
<div class="date">Dec 22</div>
</div>
<div>
<div class="post-title">
Helpful assistant features suppress emergent misalignment
</div>
<div class="post-subtitle">
Emergent misalignment not only activates misaligned personas, but also suppresses helpful assistant personas.
</div>
</div>
</div>
</a>
<a class="post-link" data-clean-url data-year="2025" href="prod-evals/">
<div class="post">
<div class="post-meta">
<div class="date">Dec 18</div>
</div>
<div>
<div class="post-title">
Sidestepping Evaluation Awareness and Anticipating Misalignment with Production Evaluations
</div>
<div class="post-subtitle">
A pipeline to uncover unknown misaligned behavior and scale the creation of realistic evaluations.
</div>
</div>
</div>
</a>
<a class="post-link" data-clean-url data-year="2025" href="sae-latent-attribution/">
<div class="post">
<div class="post-meta">
<div class="date">Dec 1</div>
</div>
<div>
<div class="post-title">
Debugging misaligned completions with sparse-autoencoder latent attribution
</div>
<div class="post-subtitle">
Efficiently finding features that cause behaviors.
</div>
</div>
</div>
Open service 64.239.109.129:80 · alignment.openai.com
2026-01-26 02:15
HTTP/1.0 308 Permanent Redirect Content-Type: text/plain Location: https://alignment.openai.com/ Refresh: 0;url=https://alignment.openai.com/ server: Vercel Redirecting...
Open service 64.239.109.129:443 · alignment.openai.com
2026-01-26 02:15
HTTP/1.1 200 OK
Accept-Ranges: bytes
Access-Control-Allow-Origin: *
Age: 954152
Cache-Control: public, max-age=0, must-revalidate
Content-Disposition: inline
Content-Length: 5675
Content-Security-Policy: default-src 'self'; img-src 'self' data: https://i.ytimg.com; style-src 'self'; script-src 'self' https://www.youtube-nocookie.com https://cdn.jsdelivr.net; frame-src https://www.youtube-nocookie.com;
Content-Type: text/html; charset=utf-8
Date: Mon, 26 Jan 2026 02:15:34 GMT
Etag: "f34fb18fa3089079436b0a8a2404fa1a"
Last-Modified: Thu, 15 Jan 2026 01:13:01 GMT
Server: Vercel
Strict-Transport-Security: max-age=63072000
X-Vercel-Cache: HIT
X-Vercel-Id: sin1::xsgwp-1769393734681-4d6d5e0778ed
Connection: close
Page title: Alignment Research Blog
<!DOCTYPE html>
<html lang="en">
<head>
<meta charset="UTF-8" />
<meta name="viewport" content="width=device-width, initial-scale=1.0" />
<meta name="description" content="Informal alignment research updates from the OpenAI team." />
<title>Alignment Research Blog</title>
<link rel="stylesheet" href="assets/styles.css" />
<link rel="alternate" type="application/rss+xml" title="OpenAIAlignment Research Blog RSS" href="/rss.xml" />
<link rel="icon" href="/favicon.ico" sizes="48x48" />
</head>
<body class="home-page">
<a
class="oai-logo-link"
href="https://openai.com/careers/search/?c=43944a55-c391-4174-8e93-a0705464ab25%2C6dd4a467-446d-4093-8d57-d4633a571123"
aria-label="Explore OpenAI Alignment and Safety Systems roles"
>
<img
class="oai-logo"
src="assets/OpenAI-black-monoblossom.png"
alt="OpenAI blossom logo"
/>
<span class="oai-logo-callout" aria-hidden="true">
P.S. The Alignment and Safety Systems teams are hiring!
</span>
</a>
<div class="content">
<h1>Alignment Research Blog</h1>
<div class="subtitle">Informal updates from the OpenAI team</div>
<div id="post-list">
<div class="post-year" data-year="2026">2026</div>
<a class="post-link" data-clean-url data-year="2026" href="coval/">
<div class="post">
<div class="post-meta">
<div class="date">Jan 14</div>
</div>
<div>
<div class="post-title">
CoVal: Learning values-aware rubrics from the crowd
</div>
<div class="post-subtitle">
An experimental dataset of crowd-written rubrics that surfaces why people prefer one model output over another.
</div>
</div>
</div>
</a>
<a class="post-link" data-clean-url data-year="2026" href="confessions/">
<div class="post">
<div class="post-meta">
<div class="date">Jan 14</div>
</div>
<div>
<div class="post-title">
Why we are excited about confessions
</div>
<div class="post-subtitle">
Deeper analysis of confession training and comparisons to chain-of-thought monitoring.
</div>
</div>
</div>
</a>
<div class="post-year" data-year="2025">2025</div>
<a class="post-link" data-clean-url data-year="2025" href="helpful-assistant-features/">
<div class="post">
<div class="post-meta">
<div class="date">Dec 22</div>
</div>
<div>
<div class="post-title">
Helpful assistant features suppress emergent misalignment
</div>
<div class="post-subtitle">
Emergent misalignment not only activates misaligned personas, but also suppresses helpful assistant personas.
</div>
</div>
</div>
</a>
<a class="post-link" data-clean-url data-year="2025" href="prod-evals/">
<div class="post">
<div class="post-meta">
<div class="date">Dec 18</div>
</div>
<div>
<div class="post-title">
Sidestepping Evaluation Awareness and Anticipating Misalignment with Production Evaluations
</div>
<div class="post-subtitle">
A pipeline to uncover unknown misaligned behavior and scale the creation of realistic evaluations.
</div>
</div>
</div>
</a>
<a class="post-link" data-clean-url data-year="2025" href="sae-latent-attribution/">
<div class="post">
<div class="post-meta">
<div class="date">Dec 1</div>
</div>
<div>
<div class="post-title">
Debugging misaligned completions with sparse-autoencoder latent attribution
</div>
<div class="post-subtitle">
Efficiently finding features that cause behaviors.
</div>
</div>
</div>