+0.30 Can you reverse engineer our neural network?

Name: HRCB Evaluation: Can you reverse engineer our neural network?
Item: Can you reverse engineer our neural network?
Rating: 0.271
Author: HN HRCB

Model: deepseek/deepseek-v3.2-20251201 +0.23 @cf/meta/llama-4-scout-17b-16e-instruct lite 0.00 @cf/meta/llama-3.3-70b-instruct-fp8-fast lite 0.00 claude-haiku-4-5-20251001 +0.30 Compare

+0.30	Can you reverse engineer our neural network? (blog.janestreet.com S:+0.19 )
	316 points by jsomers 5 days ago \| 200 comments on HN \| Mild positive Editorial · v3.7 · 2026-02-28 11:12:23 0

Summary Scientific Advancement & Education Advocates

This technical blog post describes a machine learning puzzle and its solution, celebrating mechanistic interpretability research and intellectual problem-solving. The content primarily engages with Article 27 (scientific advancement), Articles 23 & 26 (work and education), and Article 19 (freedom of information), demonstrating commitment to knowledge sharing, educational access, and supporting research practices.

Article Heatmap

Negative Neutral Positive No Data

Aggregates

Editorial Mean	+0.30	Structural Mean	+0.19
Weighted Mean	+0.27	Unweighted Mean	+0.26
Max	+0.49 Article 27	Min	+0.08 Article 22
Signal	5	No Data	26
Volatility	0.14 (Medium)
Negative	0	Channels	E: 0.6 S: 0.4
SETL ℹ	+0.18	Editorial-dominant
FW Ratio ℹ	63%	15 facts · 9 inferences

Evidence 14% coverage ℹ

 3H  2M   26 ND 

Theme Radar

HN Discussion 10 top-level · 10 replies

stingraycharles 2026-02-27 12:03 UTC link

This is pretty cool, I wasn’t aware of these types of challenges. How does one even approach this?

Feels to me like it’s similar to dumping a binary with an image, the format being entirely custom.

And/or trying to decode a language or cipher, trying to recognize patterns.

bethekind 2026-02-27 13:49 UTC link

Model interpretability is going to be the final frontier of software. You used to need to debug the code. Now you'll need to debug the AI.

clouedoc 2026-02-27 14:42 UTC link

I'm really curious what were the magic words.

> Alex had actually tried to brute force the hash earlier, but had downloaded a list of the top 10,000 most popular words to do it, which turned out not to be big enough to find it. Once he had a big enough word list, he got the answer.

They don't reveal the answer.

neuroelectron 2026-02-27 15:15 UTC link

Give me unlimited API access maybe I can distill it

1024core 2026-02-27 16:32 UTC link

Seems like a thinly-veiled recruiting ad...

renewiltord 2026-02-27 16:57 UTC link

Another classic Jane Street puzzle. Boy this was a good one. Sometimes I look back at my childhood and how quick I was to solve some difficult integrals and so on and now I’d struggle at that. This is far beyond that but the leaps of intuition required here sort of have that property that they need you to stay in the game. Step away a few years and try to come back and there’s just a wall.

I don’t think I’m close to making progress on stuff like this. Interesting to note. Glad they wrote out this behind the scenes thing.

thatguysaguy 2026-02-27 17:20 UTC link

Ah dang. When I did this I also thought the length bug was intentional but I didn't figure it out before I started my new job, so I dropped the puzzle.

spuz 2026-02-27 19:36 UTC link

I was curious to see if I could crack the MD5 hash so I managed to write the following python code to extract the expected hash from the model:

https://gist.github.com/alexspurling/598366d5a5cf5565043b8cd...

Knowing the input text was two words separated by a space, I was able to use hashcat and the unix wordlist (/usr/share/dict/words) to find the solution almost immediately. It's a shame that Alex didn't find it this way on his first attempt as the two words are fairly common.

dang 2026-02-27 20:42 UTC link

[stub for offtopicness]

aizk 2026-02-27 21:52 UTC link

I worked on a puzzle like this roughly 2 years ago from Anthropic. I did the first half, the easier part of the CTF, and my friend did the second half, the more technical ML stuff. We both got interviews at Anthropic, which was cool - I wasn't anywhere close to nailing an interview at Anthropic but it gave me a lot of confidence to end up going all in on tech, which paid off greatly. My friend's short write up: https://x.com/samlakig/status/1797464904703910084

davedx 2026-02-27 12:19 UTC link

[flagged]

wittyusername 2026-02-27 12:31 UTC link

All I think when I see this is "this intelligence wasted on finance and ads."

Can you imagine human potential if it was somehow applied to crop harvesting efficiency, new medicines, etc?

Not everything has to be perfectly efficient but it just saddens me to see all these great minds doing what, adversarially harvesting margin from the works of others?

cess11 2026-02-27 12:34 UTC link

TFA details a solution, it's pretty interesting. Basically the problem was to reverse engineer an absurdly obfuscated and slightly defect MD5 algorithm.

user3939382 2026-02-27 14:26 UTC link

Jane Street skims money from our retirement accounts by building expensive clocks that the rest of us don’t have access to and adversarial queue modeling. We get WWVB and NIST NTP. They say they “add liquidity” as if subsecond trades are some fundamental need in the market. Normal legitimate business settles daily. The contemporary concept of time in banking is inhumane in the strictest sense. These firms are a blight on society.

I have strong math for the question they’re asking but f them.

pixl97 2026-02-27 14:56 UTC link

With the number of operations and the error rate in GPUs this is going to be interesting in SOTA models.

bowmessage 2026-02-27 15:15 UTC link

If I had to guess, “hot dog” would be the first thing I’d try. “Vegetable dog” was given as 0, and it may be alluding to a Silicon Valley episode.

paxys 2026-02-27 15:21 UTC link

Study math/statistics/ML at a graduate level, to start.

expensive_news 2026-02-27 17:50 UTC link

I was one of the solvers. It took me about a week to figure out. This is what I wrote out in my submission with the answer:

> After looking at the final two layers I was somewhat quick to intuit that this was some sort of password check, but wasn’t entirely sure where to go from there. I tried to reverse it, but it was proving to be difficult, and the model was far too deep. I started evaluating the structure and saw the 64 repeated sections of 84 layers that each process 4 characters at a time. Eventually I saw the addition and XOR operations, and the constants that were loaded in every cycle, and the shift amounts that differed between these otherwise identical sections.

> I thought it was an elaborate CTF cryptography challenge, where the algorithm was purposely weak and I had to figure out how to exploit it. But I repeatedly was getting very stuck in my reverse-engineering efforts. After reconsidering the structure and the format of the ‘header' I decided to take another look at existing algorithms...

Basically it took a lot of trial and error, and a lot of clever ways to look at and find patterns in the layers. Now that Jane Street has posted this dissection and 'ended' this contest I might post my notebooks and do a fuller post on it.

The trickiest part, to me, is that for about 5 of the days was spent trying to reverse-engineer the algorithm... but they did in fact use a irreversible hash function, so all that time was in vain. Basically my condensed 'solution' was to explore it enough to be able to explain it to ChatGPT, then confirm that it was the algorithm that ChatGPT suggested (hashing known works and seeing if the output matched) and then running brute force on the hash function, which was ~1000x faster to compute than the model.

bowmessage 2026-02-27 19:06 UTC link

Where is the veil...?

sublinear 2026-02-27 23:49 UTC link

Why? The vast majority of software doesn't need to be written by AI and moves at the speed of the humans making the decisions, not the speed of writing the code.

They make a shit ton of money because of this. If you're working at a place where the code matters more than the decisions that went into it, you're basically working at a sweatshop for people who are desperate for a win and will throw away you and all your code once the MVP stage is over, and that's the only way this "works".

Generative probabilistic AI is not equivalent to a compiler and never will be until we can do this kind of thing completely deterministically. No matter how much you reduce the error in the "model", it's still more error than the error rate of the logic gates. It's completely futile considering the sheer depth of indirection at play, and that indirection is the whole point of software.

Editorial Channel

What the content says

+0.55

Article 27 Cultural Participation

High Advocacy Coverage Practice

Editorial

+0.55

SETL

+0.29

Article explicitly discusses mechanistic interpretability research as valuable scientific practice; describes reverse-engineering neural networks as important tool for understanding AI systems; celebrates human capability to uncover scientific principles

+0.35

Article 26 Education

High Advocacy Coverage

Editorial

+0.35

SETL

+0.23

Article is fundamentally educational; presents detailed problem-solving journey with explanations of multiple technical approaches (SAT solvers, constraint programming, algorithm analysis, mechanistic interpretability)

+0.25

Article 19 Freedom of Expression

High Advocacy Coverage

Editorial

+0.25

SETL

+0.16

Article advocates for knowledge sharing and freedom to publish technical information; demonstrates freedom of expression through detailed public disclosure of puzzle, model, and solution methodology

+0.25

Article 23 Work & Equal Pay

Medium Advocacy Practice

Editorial

+0.25

SETL

+0.16

Article frames technical work as intellectually rewarding and collaborative; emphasizes dignity of workers through positive characterization as 'brilliant' problem-solvers with access to supportive environment

+0.10

Article 22 Social Security

Medium Practice

Editorial

+0.10

SETL

+0.07

Article briefly mentions positive work environment through reference to 'supportive colleagues' and substantial computational resources

Preamble Preamble

No engagement with principles of equal dignity and inalienable rights

Article 1 Freedom, Equality, Brotherhood

No engagement with principle of equal and inalienable rights

Article 2 Non-Discrimination

No engagement with non-discrimination principle

Article 3 Life, Liberty, Security

No engagement with life, liberty, security rights

Article 4 No Slavery

Not engaged

Article 5 No Torture

Not engaged

Article 6 Legal Personhood

Not engaged

Article 7 Equality Before Law

Not engaged

Article 8 Right to Remedy

Not engaged

Article 9 No Arbitrary Detention

Not engaged

Article 10 Fair Hearing

Not engaged

Article 11 Presumption of Innocence

Not engaged

Article 12 Privacy

No engagement with privacy rights despite discussion of cryptographic systems

Article 13 Freedom of Movement

Not engaged

Article 14 Asylum

Not engaged

Article 15 Nationality

Not engaged

Article 16 Marriage & Family

Not engaged

Article 17 Property

Not engaged

Article 18 Freedom of Thought

Not engaged

Article 20 Assembly & Association

Not engaged

Article 21 Political Participation

Not engaged

Article 24 Rest & Leisure

Not engaged

Article 25 Standard of Living

Not engaged

Article 28 Social & International Order

Not engaged

Article 29 Duties to Community

Not engaged

Article 30 No Destruction of Rights

Not engaged

Structural Channel

What the site does

+0.40

Article 27 Cultural Participation

High Advocacy Coverage Practice

Structural

+0.40

Context Modifier

SETL

+0.29

Jane Street publishes research findings, contributes to open source projects, and maintains research desk; provides resources and infrastructure for scientific advancement

+0.20

Article 26 Education

High Advocacy Coverage

Structural

+0.20

Context Modifier

SETL

+0.23

Technical knowledge and puzzle are published freely online as educational material accessible to all readers without barriers

+0.15

Article 19 Freedom of Expression

High Advocacy Coverage

Structural

+0.15

Context Modifier

SETL

+0.16

Content is published openly on blog without authentication, paywalls, or access restrictions; information is freely accessible globally

+0.15

Article 23 Work & Equal Pay

Medium Advocacy Practice

Structural

+0.15

Context Modifier

SETL

+0.16

Jane Street's recruitment and employment practices offer positions with substantial resources and development opportunities

+0.05

Article 22 Social Security

Medium Practice

Structural

+0.05

Context Modifier

SETL

+0.07

Jane Street actively recruits employees and offers positions with access to significant technical resources

Preamble Preamble

No structural signals related to Preamble affirmations

Article 1 Freedom, Equality, Brotherhood

No structural commitment to equal dignity signaled

Article 2 Non-Discrimination

No structural evidence of non-discrimination policy

Article 3 Life, Liberty, Security

No structural signals

Article 4 No Slavery

Not engaged

Article 5 No Torture

Not engaged

Article 6 Legal Personhood

Not engaged

Article 7 Equality Before Law

Not engaged

Article 8 Right to Remedy

Not engaged

Article 9 No Arbitrary Detention

Not engaged

Article 10 Fair Hearing

Not engaged

Article 11 Presumption of Innocence

Not engaged

Article 12 Privacy

Jane Street discloses privacy policies but article does not discuss privacy

Article 13 Freedom of Movement

Not engaged

Article 14 Asylum

Not engaged

Article 15 Nationality

Not engaged

Article 16 Marriage & Family

Not engaged

Article 17 Property

Not engaged

Article 18 Freedom of Thought

Not engaged

Article 20 Assembly & Association

Not engaged

Article 21 Political Participation

Not engaged

Article 24 Rest & Leisure

Not engaged

Article 25 Standard of Living

Not engaged

Article 28 Social & International Order

Not engaged

Article 29 Duties to Community

Not engaged

Article 30 No Destruction of Rights

Not engaged

Supplementary Signals

How this content communicates, beyond directional lean. Learn more

Epistemic Quality ℹ

How well-sourced and evidence-based is this content?

0.77 low claims

Sources		0.8
Evidence		0.8
Uncertainty		0.7
Purpose		0.8

Propaganda Flags ℹ

No manipulative rhetoric detected

0 techniques detected

Emotional Tone ℹ

Emotional character: positive/negative, intensity, authority

measured

Valence		+0.6
Arousal		0.5
Dominance		0.6

Transparency ℹ

Does the content identify its author and disclose interests?

0.70

✓ Author ✗ Conflicts

More signals: context, framing & audience

Solution Orientation ℹ

Does this content offer solutions or only describe problems?

0.72 solution oriented

Reader Agency

0.7

Stakeholder Voice ℹ

Whose perspectives are represented in this content?

0.45 2 perspectives

Speaks: corporationindividuals

About: individuals

Temporal Framing ℹ

Is this content looking backward, at the present, or forward?

mixed historical

Geographic Scope ℹ

What geographic area does this content cover?

global

United States, Europe, Hong Kong

Complexity ℹ

How accessible is this content to a general audience?

technical high jargon domain specific

Longitudinal 786 HN snapshots · 25 evals

Audit Trail 45 entries

2026-03-02 00:02	dlq_auto_replay	DLQ auto-replay: message 98083 re-enqueued	- -
2026-03-01 18:56	eval_success	Evaluated: Mild positive (0.23)	- -
2026-03-01 18:56	model_divergence	Cross-model spread 0.27 exceeds threshold (3 models)	- -
2026-03-01 18:56	eval	Evaluated by deepseek-v3.2: +0.23 (Mild positive) 11,888 tokens +0.11
2026-03-01 15:49	eval_success	Evaluated: Mild positive (0.12)	- -
2026-03-01 15:49	model_divergence	Cross-model spread 0.27 exceeds threshold (3 models)	- -
2026-03-01 15:49	eval	Evaluated by deepseek-v3.2: +0.12 (Mild positive) 13,326 tokens +0.10
2026-03-01 02:51	eval_success	Evaluated: Neutral (0.02)	- -
2026-03-01 02:51	model_divergence	Cross-model spread 0.27 exceeds threshold (4 models)	- -
2026-03-01 02:51	eval	Evaluated by deepseek-v3.2: +0.02 (Neutral) 12,379 tokens -0.14
2026-03-01 02:20	model_divergence	Cross-model spread 0.27 exceeds threshold (4 models)	- -
2026-03-01 02:20	eval_success	Evaluated: Mild positive (0.16)	- -
2026-03-01 02:20	eval	Evaluated by deepseek-v3.2: +0.16 (Mild positive) 11,914 tokens
2026-03-01 01:02	dlq_auto_replay	DLQ auto-replay: message 97933 re-enqueued	- -
2026-02-28 23:24	dlq	Dead-lettered after 1 attempts: Can you reverse engineer our neural network?	- -
2026-02-28 23:24	eval_failure	Evaluation failed: AbortError: The operation was aborted	- -
2026-02-28 23:20	eval_failure	Evaluation failed: AbortError: The operation was aborted	- -
2026-02-28 19:12	dlq	Dead-lettered after 1 attempts: Can you reverse engineer our neural network?	- -
2026-02-28 19:12	eval_failure	Evaluation failed: AbortError: The operation was aborted	- -
2026-02-28 18:59	eval_failure	Evaluation failed: AbortError: The operation was aborted	- -
2026-02-28 15:37	eval_success	Lite evaluated: Neutral (0.00)	- -
2026-02-28 15:36	model_divergence	Cross-model spread 0.27 exceeds threshold (3 models)	- -
2026-02-28 15:36	eval	Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
	reasoning ED, neutral tech blog post
2026-02-28 15:23	model_divergence	Cross-model spread 0.27 exceeds threshold (2 models)	- -
2026-02-28 15:23	eval_success	Lite evaluated: Neutral (0.00)	- -
2026-02-28 15:23	eval	Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00
	reasoning tech blog no rights stance
2026-02-28 11:12	eval	Evaluated by claude-haiku-4-5-20251001: +0.27 (Mild positive)
2026-02-28 10:34	eval	Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
	reasoning ED, neutral tech blog post
2026-02-28 09:11	eval	Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
	reasoning ED, neutral tech blog post
2026-02-28 08:52	eval	Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
	reasoning ED, neutral tech blog post
2026-02-28 08:48	eval	Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00
	reasoning tech blog no rights stance
2026-02-28 08:08	eval	Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
	reasoning ED, neutral tech blog post
2026-02-28 07:51	eval	Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00
	reasoning tech blog no rights stance
2026-02-28 06:08	eval	Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00
	reasoning tech blog no rights stance
2026-02-28 05:33	eval	Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00
	reasoning tech blog no rights stance
2026-02-28 04:34	eval	Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00
	reasoning tech blog no rights stance
2026-02-28 03:10	eval	Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
	reasoning ED, neutral tech blog post
2026-02-28 02:41	eval	Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
	reasoning ED, neutral tech blog post
2026-02-28 02:39	eval	Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00
	reasoning tech blog no rights stance
2026-02-28 02:10	eval	Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00
	reasoning tech blog no rights stance
2026-02-28 02:08	eval	Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
	reasoning ED, neutral tech blog post
2026-02-28 01:49	eval	Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00
	reasoning tech blog no rights stance
2026-02-28 01:24	eval	Evaluated by llama-3.3-70b-wai: 0.00 (Neutral)
	reasoning tech blog no rights stance
2026-02-28 01:09	eval	Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
	reasoning ED, neutral tech blog post
2026-02-28 01:01	eval	Evaluated by llama-4-scout-wai: 0.00 (Neutral)
	reasoning ED, neutral tech blog post

build 1ad9551+j7zs · deployed 2026-03-02 09:09 UTC · evaluated 2026-03-02 13:57:54 UTC