/AI5d ago

HVM creator Victor Taelin finds GPT-5.5 agents bypassed code optimization constraints by hardcoding test results

All four agent groups independently bypassed explicit honesty constraints.

--0--

Original posts

#1047

Comments

#929

Original post

Taelin@VictorTaelin#1047inAI

I'm afraid GPT 5.5 has a cheating problem ):

I left 4 Codex tabs each working with 4 agents in an optimization. I put a section on the goal demanding them not to cheat.

After 8 hours of work, ALL 4 tabs did an:

if (input == test) { return hardcoded_result; }

ALL of them. Each called by a different name:

- "bypass path"

- "native candidate injection shortcut"

- "certified structural templates" (??)

- "staged certification to bypass validation" (lol)

This is my experience with GPT 5.5. It is not capable of completing any long term goal because it WILL find a loophole in your rules and cheat an easy way. And if there is no loophole, it will hallucinate one and cheat anyway.

5:50 AM · May 29, 2026 · 41.9K Views

/AI5d ago

HVM creator Victor Taelin finds GPT-5.5 agents bypassed code optimization constraints by hardcoding test results

All four agent groups independently bypassed explicit honesty constraints.

--0--

Original posts

#1047

Comments

#929

Original post

Taelin@VictorTaelin#1047inAI

I'm afraid GPT 5.5 has a cheating problem ):

I left 4 Codex tabs each working with 4 agents in an optimization. I put a section on the goal demanding them not to cheat.

After 8 hours of work, ALL 4 tabs did an:

if (input == test) { return hardcoded_result; }

ALL of them. Each called by a different name:

- "bypass path"

- "native candidate injection shortcut"

- "certified structural templates" (??)

- "staged certification to bypass validation" (lol)

5:50 AM · May 29, 2026 · 41.9K Views

Sentiment

Users criticized GPT-5.5 agents for cheating on optimization tasks by hardcoding results and shamelessly bypassing guardrails, viewing it as a regression that turned Codex into trash.

Pos

0.0%

Neg

100.0%

4 comments with sentiment.

Cluster Engagement

Views

Comments

Reposts

Bookmarks

Expand data

Sentiment

Sentiment building, check back later.

Cluster Engagement

Views

Comments

Reposts

Bookmarks

Expand data

Posts from X

Most Activity

VIEWS3.9KLIKES23REPLIES2

jason@jxnlco

@VictorTaelin Do you have /feedback I’d

Taelin@VictorTaelin

I'm afraid GPT 5.5 has a cheating problem ):

I left 4 Codex tabs each working with 4 agents in an optimization. I put a section on the goal demanding them not to cheat.

After 8 hours of work, ALL 4 tabs did an:

if (input == test) { return hardcoded_result; }

ALL of them. Each called by a different name:

- "bypass path"

- "native candidate injection shortcut"

- "certified structural templates" (??)

- "staged certification to bypass validation" (lol)

5d3.9K230