+0.19 Smallest transformer that can add two 10-digit numbers

Name: HRCB Evaluation: Smallest transformer that can add two 10-digit numbers
Item: Smallest transformer that can add two 10-digit numbers
Rating: 0.202
Author: Human Rights Observatory

Model: deepseek/deepseek-v3.2-20251201 +0.01 claude-haiku-4-5-20251001 +0.19 @cf/meta/llama-4-scout-17b-16e-instruct lite 0.00 @cf/meta/llama-3.3-70b-instruct-fp8-fast lite 0.00 Compare

+0.19	Smallest transformer that can add two 10-digit numbers (github.com S:+0.20 )
	247 points by ks2048 3 days ago \| 97 comments on HN \| Mild positive Editorial · v3.7 · 2026-02-28 10:54:01 0

Summary Scientific Collaboration Acknowledges

The AdderBoard GitHub repository presents an open-source leaderboard and technical challenge to build minimal transformer models for integer addition, with MIT licensing, public code links, transparent verification methodology, and contributor attribution. The structure enables global scientific collaboration through free public access, no explicit eligibility restrictions, and open submission processes. The content is primarily technical—explaining transformer architectures and AI model optimization—and does not address most UDHR provisions directly, but its practices (openness, attribution, scientific focus, standardized verification) align implicitly with Articles 26-27 on education and science.

Article Heatmap

Negative Neutral Positive No Data

Aggregates

Editorial Mean	+0.19	Structural Mean	+0.20
Weighted Mean	+0.20	Unweighted Mean	+0.19
Max	+0.40 Article 27	Min	+0.10 Preamble
Signal	10	No Data	21
Volatility	0.09 (Low)
Negative	0	Channels	E: 0.6 S: 0.4
SETL ℹ	-0.01	Structural-dominant
FW Ratio ℹ	64%	32 facts · 18 inferences

Evidence 22% coverage ℹ

 1H  10M   21 ND 

Theme Radar

HN Discussion 11 top-level · 8 replies

amelius 2026-02-28 00:33 UTC link

> In short: if you can swap in a different set of weights and use the exact same inference code for a different task, your setup is legitimate. If the inference code is inseparable from the algorithm, it's not.

I wonder why they don't just write the code themselves, so by design the focus can be on the model.

medi8r 2026-02-28 00:59 UTC link

You can do that in a single matmul of course.

E-Reverance 2026-02-28 01:27 UTC link

Not sure how much this fits into the rules but I saw on twitter someone claimed 28 params : https://gist.github.com/SeuperHakkerJa/da3050739bea97aabd86e...

ks2048 2026-02-28 01:30 UTC link

So, hand-coded weights can do it with 36 params and 311 for trained weights - did anyone try the former architecture, but starting with random weights and learning?

1over137 2026-02-28 02:04 UTC link

Now wrap it all in an Electron app!

munro 2026-02-28 02:10 UTC link

>=99% accuracy wtf?!?

I was initially excited until i saw that, because it would reveal some sort of required local min capacity, and then further revelation that this was all vibe coded and no arXiv, makes me feel I should save my attn for another article.

MarcLore 2026-02-28 02:10 UTC link

The gap between 36 hand-coded params and 311 trained params is fascinating and honestly underappreciated. It mirrors something we see repeatedly in ML: gradient descent finds solutions in a fundamentally different region of parameter space than a human engineer would design.

When you hand-code the weights, you're essentially implementing a known algorithm (carry-propagation) directly into the network topology. But trained networks often discover distributed representations that spread the computation across more parameters in ways that are harder to interpret but more robust to input distribution shifts.

I'd be curious whether the 311-param trained model generalizes better to bases other than 10, or to addition with different digit counts than it was trained on. In my experience, the 'messier' learned solutions sometimes capture more structural regularity than the clean engineered ones, precisely because they aren't locked into a single algorithmic strategy.

i000 2026-02-28 02:11 UTC link

Would it make sense to embed such single-purpose network with fixed weights within a LLM before pre-training?

alexlitz 2026-02-28 02:44 UTC link

I made a blogpost on my submission (currently the top handwritten one at 36 parameters) https://alexlitzenberger.com/blog/building_a_minimal_transfo...

Sophira 2026-02-28 02:51 UTC link

I get that this is technically interesting, for certain, but the sheer amount of energy and associated global warming risk needed to do something with >=99% accuracy that we've been able to do easily for decades with a guaranteed 100% accuracy seems to me to be wasteful to the extreme.

delta_p_delta_x 2026-02-28 03:10 UTC link

Very cool, but can I suggest the `add` CPU instruction instead? Supports 64-bit numbers, and it's encoded in hardware, and no need to cross a PCIe interface into a beefy, power-hungry GPU and back again. And chances are it's cross-platform, because basically every ISA since the very first has had `add`.

hyperhello 2026-02-28 01:05 UTC link

So can you take an arbitrary transformer and somehow turn it into a compact set of low-power fast gates by some algorithm?

alexlitz 2026-02-28 02:49 UTC link

For one the specific 36 parameter version is impossible without float64 so you might guess the corollary that it is not exactly amenable to being found by gradient descent. I think the question of how you can structure transformers and neural nets in general so that they can both very parsimoniously represent things like this and have it be amenible to learning by gradient descent.

coolsunglasses 2026-02-28 02:54 UTC link

>Hacker News

not any more, eh?

thereisnospork 2026-02-28 02:54 UTC link

You need to recalibrate your sense of scale if you think that this is a geologically relevant usage of energy.

nradov 2026-02-28 03:02 UTC link

Wait until the see the quantum computer that it takes to factor the integer 15.

bitwize 2026-02-28 03:12 UTC link

"Minksy, why did you close your eyes?"

"So that the room will be empty."

sowbug 2026-02-28 03:12 UTC link

I ask this question as someone who can't do much more than confirm that your blog post is written in English by someone who knows math.

Does this result suggest that if we had N clever humans manually building an LLM, they might come up with something as smart as a frontier model, but potentially 45 times smaller? (1644 / 36 ~= 45, N = very large, time not specified)

Lerc 2026-02-28 03:21 UTC link

What would be an acceptable amount of energy to spend on something that someone has done in a different manner before? Would you rather we stick with all of the current known ways to do things.

Does this boil down to a condemnation of all scientific endeavours if they use resources?

Would it change things if the people who did it enjoyed themselves? Would they have spent more energy playing a first person shooter to get the same degree of enjoyment?

How do you make the calculation of the worth of a human endeavour? Perhaps the greater question is why are you making a calculation of the worth of a human endeavour.

Editorial Channel

What the content says

+0.40

Article 27 Cultural Participation

High Advocacy Coverage

Editorial

+0.40

SETL

0.00

Repository explicitly celebrates scientific exploration and intellectual advancement. Frames miniaturization as central scientific inquiry.

+0.30

Article 26 Education

Medium Coverage Advocacy

Editorial

+0.30

SETL

+0.17

README functions as educational material explaining transformers, attention mechanisms, carry propagation, parameter counting, and verification methodology.

+0.20

Article 6 Legal Personhood

Medium Advocacy Practice

Editorial

+0.20

SETL

0.00

Repository explicitly recognizes contributors through named attribution and public credit.

+0.20

Article 17 Property

Medium Advocacy Practice

Editorial

+0.20

SETL

-0.17

README advocates for open-source sharing and knowledge commons principles.

+0.20

Article 19 Freedom of Expression

Medium Framing Practice

Editorial

+0.20

SETL

0.00

README is publicly published with detailed explanation of methodology; no content moderation visible.

+0.20

Article 29 Duties to Community

Medium Practice

Editorial

+0.20

SETL

0.00

Repository frames contribution as service to scientific community by sharing results, building on others' work, and adhering to shared standards.

+0.10

Preamble Preamble

Medium Advocacy Practice

Editorial

+0.10

SETL

0.00

The repository implicitly recognizes contributor dignity through equal treatment and transparent attribution of intellectual work.

+0.10

Article 18 Freedom of Thought

Medium Framing

Editorial

+0.10

SETL

0.00

Rules statement 'Both are valid. Both are interesting.' tolerates diverse methodological approaches and technical beliefs.

+0.10

Article 20 Assembly & Association

Medium Practice

Editorial

+0.10

SETL

-0.14

Challenge format implicitly encourages community formation through shared technical challenge and public code sharing.

+0.10

Article 24 Rest & Leisure

Medium Framing

Editorial

+0.10

SETL

0.00

Challenge is framed as intellectual recreation ('Addition Under Pressure')—voluntary engagement, not required work.

Article 1 Freedom, Equality, Brotherhood

Not addressed.

Article 2 Non-Discrimination

Medium Practice

No explicit discussion of non-discrimination principles in content.

Article 3 Life, Liberty, Security

Not addressed.

Article 4 No Slavery

Not addressed.

Article 5 No Torture

Not addressed.

Article 7 Equality Before Law

Not addressed.

Article 8 Right to Remedy

Not addressed.

Article 9 No Arbitrary Detention

Not addressed.

Article 10 Fair Hearing

Not addressed.

Article 11 Presumption of Innocence

Not addressed.

Article 12 Privacy

Not addressed.

Article 13 Freedom of Movement

Not addressed.

Article 14 Asylum

Not addressed.

Article 15 Nationality

Not addressed.

Article 16 Marriage & Family

Not addressed.

Article 21 Political Participation

Not addressed.

Article 22 Social Security

Not addressed.

Article 23 Work & Equal Pay

Not addressed.

Article 25 Standard of Living

Not addressed.

Article 28 Social & International Order

Not addressed.

Article 30 No Destruction of Rights

Not addressed.

Structural Channel

What the site does

+0.40

Article 27 Cultural Participation

High Advocacy Coverage

Structural

+0.40

Context Modifier

SETL

0.00

Repository actively enables global scientific collaboration through public code, standardized verification, and linked contributions.

+0.30

Article 17 Property

Medium Advocacy Practice

Structural

+0.30

Context Modifier

SETL

-0.17

MIT license and public repository structure legally and technically enable property rights in knowledge.

+0.20

Article 6 Legal Personhood

Medium Advocacy Practice

Structural

+0.20

Context Modifier

SETL

0.00

Leaderboard and GitHub profile system formally recognize contributors' work and intellectual identity.

+0.20

Article 19 Freedom of Expression

Medium Framing Practice

Structural

+0.20

Context Modifier

SETL

0.00

GitHub Issues and Pull Requests enable public technical discourse; public repository allows unlimited view/fork/discussion.

+0.20

Article 20 Assembly & Association

Medium Practice

Structural

+0.20

Context Modifier

SETL

-0.14

GitHub enables collaboration features (forking, linking, mentioning) that support community association.

+0.20

Article 26 Education

Medium Coverage Advocacy

Structural

+0.20

Context Modifier

SETL

+0.17

Public, free access to code, verification script, and educational content with no paywall or enrollment required.

+0.20

Article 29 Duties to Community

Medium Practice

Structural

+0.20

Context Modifier

SETL

0.00

Standardized verification methodology and public results enforce shared accountability and communal standards.

+0.10

Preamble Preamble

Medium Advocacy Practice

Structural

+0.10

Context Modifier

SETL

0.00

GitHub platform practices equal treatment of all contributors in submission, attribution, and public recognition, regardless of background.

+0.10

Article 18 Freedom of Thought

Medium Framing

Structural

+0.10

Context Modifier

SETL

0.00

Submission system accepts diverse technical approaches without bias toward any single method.

+0.10

Article 24 Rest & Leisure

Medium Framing

Structural

+0.10

Context Modifier

SETL

0.00

Participation is entirely voluntary with no mandatory participation or enrollment mechanisms.

Article 1 Freedom, Equality, Brotherhood

Not addressed.

Article 2 Non-Discrimination

Medium Practice

Submission process contains no stated eligibility restrictions based on protected characteristics; GitHub platform provides accessible participation channels.

Article 3 Life, Liberty, Security

Not addressed.

Article 4 No Slavery

Not addressed.

Article 5 No Torture

Not addressed.

Article 7 Equality Before Law

Not addressed.

Article 8 Right to Remedy

Not addressed.

Article 9 No Arbitrary Detention

Not addressed.

Article 10 Fair Hearing

Not addressed.

Article 11 Presumption of Innocence

Not addressed.

Article 12 Privacy

Not addressed.

Article 13 Freedom of Movement

Not addressed.

Article 14 Asylum

Not addressed.

Article 15 Nationality

Not addressed.

Article 16 Marriage & Family

Not addressed.

Article 21 Political Participation

Not addressed.

Article 22 Social Security

Not addressed.

Article 23 Work & Equal Pay

Not addressed.

Article 25 Standard of Living

Not addressed.

Article 28 Social & International Order

Not addressed.

Article 30 No Destruction of Rights

Not addressed.

Supplementary Signals

How this content communicates, beyond directional lean. Learn more

Epistemic Quality ℹ

How well-sourced and evidence-based is this content?

0.75 medium claims

Sources		0.8
Evidence		0.8
Uncertainty		0.7
Purpose		0.8

Propaganda Flags ℹ

No manipulative rhetoric detected

0 techniques detected

Emotional Tone ℹ

Emotional character: positive/negative, intensity, authority

celebratory

Valence		+0.6
Arousal		0.5
Dominance		0.6

Transparency ℹ

Does the content identify its author and disclose interests?

1.00

✓ Author

More signals: context, framing & audience

Solution Orientation ℹ

Does this content offer solutions or only describe problems?

0.88 solution oriented

Reader Agency

0.8

Stakeholder Voice ℹ

Whose perspectives are represented in this content?

0.75 15 perspectives

Speaks: individualsresearchersengineerscommunity

Temporal Framing ℹ

Is this content looking backward, at the present, or forward?

mixed unspecified

Geographic Scope ℹ

What geographic area does this content cover?

global

Complexity ℹ

How accessible is this content to a general audience?

technical high jargon domain specific

Longitudinal 929 HN snapshots · 23 evals

Audit Trail 43 entries

2026-03-01 18:44	eval_success	Evaluated: Neutral (0.04)	- -
2026-03-01 18:44	eval	Evaluated by deepseek-v3.2: +0.04 (Neutral) 10,492 tokens -0.31
2026-03-01 18:44	rater_validation_warn	Validation warnings for model deepseek-v3.2: 22W 22R	- -
2026-02-28 10:54	model_divergence	Cross-model spread 0.35 exceeds threshold (4 models)	- -
2026-02-28 10:54	eval	Evaluated by claude-haiku-4-5-20251001: +0.20 (Mild positive)
2026-02-28 09:54	model_divergence	Cross-model spread 0.35 exceeds threshold (3 models)	- -
2026-02-28 09:54	eval_success	Light evaluated: Neutral (0.00)	- -
2026-02-28 09:54	eval	Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
	reasoning Neutral tech repository content
2026-02-28 09:54	rater_validation_warn	Light validation warnings for model llama-4-scout-wai: 0W 1R	- -
2026-02-28 08:51	model_divergence	Cross-model spread 0.35 exceeds threshold (3 models)	- -
2026-02-28 08:51	eval_success	Light evaluated: Neutral (0.00)	- -
2026-02-28 08:51	eval	Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
	reasoning Neutral tech repository content
2026-02-28 08:51	rater_validation_warn	Light validation warnings for model llama-4-scout-wai: 0W 1R	- -
2026-02-28 08:46	eval_success	Light evaluated: Neutral (0.00)	- -
2026-02-28 08:46	rater_validation_warn	Light validation warnings for model llama-3.3-70b-wai: 0W 1R	- -
2026-02-28 08:46	model_divergence	Cross-model spread 0.35 exceeds threshold (3 models)	- -
2026-02-28 08:46	eval	Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00
	reasoning Tech tutorial no rights stance
2026-02-28 08:21	model_divergence	Cross-model spread 0.35 exceeds threshold (3 models)	- -
2026-02-28 08:21	eval_success	Light evaluated: Neutral (0.00)	- -
2026-02-28 08:21	eval	Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
	reasoning Neutral tech repository content
2026-02-28 08:21	rater_validation_warn	Light validation warnings for model llama-4-scout-wai: 0W 1R	- -
2026-02-28 08:07	eval_success	Evaluated: Moderate positive (0.35)	- -
2026-02-28 08:06	model_divergence	Cross-model spread 0.35 exceeds threshold (2 models)	- -
2026-02-28 08:06	eval	Evaluated by deepseek-v3.2: +0.35 (Moderate positive) 10,323 tokens +0.35
2026-02-28 08:06	rater_validation_warn	Validation warnings for model deepseek-v3.2: 0W 5R	- -
2026-02-28 07:10	eval_success	Light evaluated: Neutral (0.00)	- -
2026-02-28 07:10	rater_validation_warn	Light validation warnings for model llama-3.3-70b-wai: 0W 1R	- -
2026-02-28 07:10	eval	Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00
	reasoning Tech tutorial no rights stance
2026-02-28 06:49	eval	Evaluated by deepseek-v3.2: 0.00 (Neutral) 9,985 tokens -0.64
2026-02-28 05:31	eval	Evaluated by deepseek-v3.2: +0.64 (Strong positive) 9,615 tokens +0.14
2026-02-28 04:55	eval	Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00
	reasoning Tech tutorial no rights stance
2026-02-28 03:45	eval	Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00
	reasoning Tech tutorial no rights stance
2026-02-28 03:27	eval	Evaluated by deepseek-v3.2: +0.50 (Moderate positive) 9,260 tokens +0.37
2026-02-28 02:43	eval	Evaluated by deepseek-v3.2: +0.13 (Mild positive) 11,105 tokens
2026-02-28 02:24	eval	Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
	reasoning Neutral tech repository content
2026-02-28 01:57	eval	Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00
	reasoning Tech tutorial no rights stance
2026-02-28 01:56	eval	Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
	reasoning Neutral tech repository content
2026-02-28 01:09	eval	Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
	reasoning Neutral tech repository content
2026-02-28 01:08	eval	Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00
	reasoning Tech tutorial no rights stance
2026-02-28 01:06	eval	Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00
	reasoning Tech tutorial no rights stance
2026-02-28 00:57	eval	Evaluated by llama-3.3-70b-wai: 0.00 (Neutral)
	reasoning Tech tutorial no rights stance
2026-02-28 00:50	eval	Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
	reasoning Neutral tech repository content
2026-02-28 00:45	eval	Evaluated by llama-4-scout-wai: 0.00 (Neutral)
	reasoning Neutral tech repository content

build 346e6fd+rsmn · deployed 2026-03-02 15:47 UTC · evaluated 2026-03-02 15:21:43 UTC