+0.19 Smallest transformer that can add two 10-digit numbers (github.com S:+0.20 )
247 points by ks2048 3 days ago | 97 comments on HN | Mild positive Editorial · v3.7 · 2026-02-28 10:54:01 0
Summary Scientific Collaboration Acknowledges
The AdderBoard GitHub repository presents an open-source leaderboard and technical challenge to build minimal transformer models for integer addition, with MIT licensing, public code links, transparent verification methodology, and contributor attribution. The structure enables global scientific collaboration through free public access, no explicit eligibility restrictions, and open submission processes. The content is primarily technical—explaining transformer architectures and AI model optimization—and does not address most UDHR provisions directly, but its practices (openness, attribution, scientific focus, standardized verification) align implicitly with Articles 26-27 on education and science.
Article Heatmap
Preamble: +0.10 — Preamble P Article 1: ND — Freedom, Equality, Brotherhood Article 1: No Data — Freedom, Equality, Brotherhood 1 Article 2: ND — Non-Discrimination Article 2: No Data — Non-Discrimination 2 Article 3: ND — Life, Liberty, Security Article 3: No Data — Life, Liberty, Security 3 Article 4: ND — No Slavery Article 4: No Data — No Slavery 4 Article 5: ND — No Torture Article 5: No Data — No Torture 5 Article 6: +0.20 — Legal Personhood 6 Article 7: ND — Equality Before Law Article 7: No Data — Equality Before Law 7 Article 8: ND — Right to Remedy Article 8: No Data — Right to Remedy 8 Article 9: ND — No Arbitrary Detention Article 9: No Data — No Arbitrary Detention 9 Article 10: ND — Fair Hearing Article 10: No Data — Fair Hearing 10 Article 11: ND — Presumption of Innocence Article 11: No Data — Presumption of Innocence 11 Article 12: ND — Privacy Article 12: No Data — Privacy 12 Article 13: ND — Freedom of Movement Article 13: No Data — Freedom of Movement 13 Article 14: ND — Asylum Article 14: No Data — Asylum 14 Article 15: ND — Nationality Article 15: No Data — Nationality 15 Article 16: ND — Marriage & Family Article 16: No Data — Marriage & Family 16 Article 17: +0.24 — Property 17 Article 18: +0.10 — Freedom of Thought 18 Article 19: +0.20 — Freedom of Expression 19 Article 20: +0.14 — Assembly & Association 20 Article 21: ND — Political Participation Article 21: No Data — Political Participation 21 Article 22: ND — Social Security Article 22: No Data — Social Security 22 Article 23: ND — Work & Equal Pay Article 23: No Data — Work & Equal Pay 23 Article 24: +0.10 — Rest & Leisure 24 Article 25: ND — Standard of Living Article 25: No Data — Standard of Living 25 Article 26: +0.26 — Education 26 Article 27: +0.40 — Cultural Participation 27 Article 28: ND — Social & International Order Article 28: No Data — Social & International Order 28 Article 29: +0.20 — Duties to Community 29 Article 30: ND — No Destruction of Rights Article 30: No Data — No Destruction of Rights 30
Negative Neutral Positive No Data
Aggregates
Editorial Mean +0.19 Structural Mean +0.20
Weighted Mean +0.20 Unweighted Mean +0.19
Max +0.40 Article 27 Min +0.10 Preamble
Signal 10 No Data 21
Volatility 0.09 (Low)
Negative 0 Channels E: 0.6 S: 0.4
SETL -0.01 Structural-dominant
FW Ratio 64% 32 facts · 18 inferences
Evidence 22% coverage
1H 10M 21 ND
Theme Radar
Foundation Security Legal Privacy & Movement Personal Expression Economic & Social Cultural Order & Duties Foundation: 0.10 (1 articles) Security: 0.00 (0 articles) Legal: 0.20 (1 articles) Privacy & Movement: 0.00 (0 articles) Personal: 0.17 (2 articles) Expression: 0.17 (2 articles) Economic & Social: 0.10 (1 articles) Cultural: 0.33 (2 articles) Order & Duties: 0.20 (1 articles)
HN Discussion 11 top-level · 8 replies
amelius 2026-02-28 00:33 UTC link
> In short: if you can swap in a different set of weights and use the exact same inference code for a different task, your setup is legitimate. If the inference code is inseparable from the algorithm, it's not.

I wonder why they don't just write the code themselves, so by design the focus can be on the model.

medi8r 2026-02-28 00:59 UTC link
You can do that in a single matmul of course.
E-Reverance 2026-02-28 01:27 UTC link
Not sure how much this fits into the rules but I saw on twitter someone claimed 28 params : https://gist.github.com/SeuperHakkerJa/da3050739bea97aabd86e...
ks2048 2026-02-28 01:30 UTC link
So, hand-coded weights can do it with 36 params and 311 for trained weights - did anyone try the former architecture, but starting with random weights and learning?
1over137 2026-02-28 02:04 UTC link
Now wrap it all in an Electron app!
munro 2026-02-28 02:10 UTC link
>=99% accuracy wtf?!?

I was initially excited until i saw that, because it would reveal some sort of required local min capacity, and then further revelation that this was all vibe coded and no arXiv, makes me feel I should save my attn for another article.

MarcLore 2026-02-28 02:10 UTC link
The gap between 36 hand-coded params and 311 trained params is fascinating and honestly underappreciated. It mirrors something we see repeatedly in ML: gradient descent finds solutions in a fundamentally different region of parameter space than a human engineer would design.

When you hand-code the weights, you're essentially implementing a known algorithm (carry-propagation) directly into the network topology. But trained networks often discover distributed representations that spread the computation across more parameters in ways that are harder to interpret but more robust to input distribution shifts.

I'd be curious whether the 311-param trained model generalizes better to bases other than 10, or to addition with different digit counts than it was trained on. In my experience, the 'messier' learned solutions sometimes capture more structural regularity than the clean engineered ones, precisely because they aren't locked into a single algorithmic strategy.

i000 2026-02-28 02:11 UTC link
Would it make sense to embed such single-purpose network with fixed weights within a LLM before pre-training?
alexlitz 2026-02-28 02:44 UTC link
I made a blogpost on my submission (currently the top handwritten one at 36 parameters) https://alexlitzenberger.com/blog/building_a_minimal_transfo...
Sophira 2026-02-28 02:51 UTC link
I get that this is technically interesting, for certain, but the sheer amount of energy and associated global warming risk needed to do something with >=99% accuracy that we've been able to do easily for decades with a guaranteed 100% accuracy seems to me to be wasteful to the extreme.
delta_p_delta_x 2026-02-28 03:10 UTC link
Very cool, but can I suggest the `add` CPU instruction instead? Supports 64-bit numbers, and it's encoded in hardware, and no need to cross a PCIe interface into a beefy, power-hungry GPU and back again. And chances are it's cross-platform, because basically every ISA since the very first has had `add`.
hyperhello 2026-02-28 01:05 UTC link
So can you take an arbitrary transformer and somehow turn it into a compact set of low-power fast gates by some algorithm?
alexlitz 2026-02-28 02:49 UTC link
For one the specific 36 parameter version is impossible without float64 so you might guess the corollary that it is not exactly amenable to being found by gradient descent. I think the question of how you can structure transformers and neural nets in general so that they can both very parsimoniously represent things like this and have it be amenible to learning by gradient descent.
coolsunglasses 2026-02-28 02:54 UTC link
>Hacker News

not any more, eh?

thereisnospork 2026-02-28 02:54 UTC link
You need to recalibrate your sense of scale if you think that this is a geologically relevant usage of energy.
nradov 2026-02-28 03:02 UTC link
Wait until the see the quantum computer that it takes to factor the integer 15.
bitwize 2026-02-28 03:12 UTC link
"Minksy, why did you close your eyes?"

"So that the room will be empty."

sowbug 2026-02-28 03:12 UTC link
I ask this question as someone who can't do much more than confirm that your blog post is written in English by someone who knows math.

Does this result suggest that if we had N clever humans manually building an LLM, they might come up with something as smart as a frontier model, but potentially 45 times smaller? (1644 / 36 ~= 45, N = very large, time not specified)

Lerc 2026-02-28 03:21 UTC link
What would be an acceptable amount of energy to spend on something that someone has done in a different manner before? Would you rather we stick with all of the current known ways to do things.

Does this boil down to a condemnation of all scientific endeavours if they use resources?

Would it change things if the people who did it enjoyed themselves? Would they have spent more energy playing a first person shooter to get the same degree of enjoyment?

How do you make the calculation of the worth of a human endeavour? Perhaps the greater question is why are you making a calculation of the worth of a human endeavour.

Editorial Channel
What the content says
+0.40
Article 27 Cultural Participation
High Advocacy Coverage
Editorial
+0.40
SETL
0.00

Repository explicitly celebrates scientific exploration and intellectual advancement. Frames miniaturization as central scientific inquiry.

+0.30
Article 26 Education
Medium Coverage Advocacy
Editorial
+0.30
SETL
+0.17

README functions as educational material explaining transformers, attention mechanisms, carry propagation, parameter counting, and verification methodology.

+0.20
Article 6 Legal Personhood
Medium Advocacy Practice
Editorial
+0.20
SETL
0.00

Repository explicitly recognizes contributors through named attribution and public credit.

+0.20
Article 17 Property
Medium Advocacy Practice
Editorial
+0.20
SETL
-0.17

README advocates for open-source sharing and knowledge commons principles.

+0.20
Article 19 Freedom of Expression
Medium Framing Practice
Editorial
+0.20
SETL
0.00

README is publicly published with detailed explanation of methodology; no content moderation visible.

+0.20
Article 29 Duties to Community
Medium Practice
Editorial
+0.20
SETL
0.00

Repository frames contribution as service to scientific community by sharing results, building on others' work, and adhering to shared standards.

+0.10
Preamble Preamble
Medium Advocacy Practice
Editorial
+0.10
SETL
0.00

The repository implicitly recognizes contributor dignity through equal treatment and transparent attribution of intellectual work.

+0.10
Article 18 Freedom of Thought
Medium Framing
Editorial
+0.10
SETL
0.00

Rules statement 'Both are valid. Both are interesting.' tolerates diverse methodological approaches and technical beliefs.

+0.10
Article 20 Assembly & Association
Medium Practice
Editorial
+0.10
SETL
-0.14

Challenge format implicitly encourages community formation through shared technical challenge and public code sharing.

+0.10
Article 24 Rest & Leisure
Medium Framing
Editorial
+0.10
SETL
0.00

Challenge is framed as intellectual recreation ('Addition Under Pressure')—voluntary engagement, not required work.

ND
Article 1 Freedom, Equality, Brotherhood

Not addressed.

ND
Article 2 Non-Discrimination
Medium Practice

No explicit discussion of non-discrimination principles in content.

ND
Article 3 Life, Liberty, Security

Not addressed.

ND
Article 4 No Slavery

Not addressed.

ND
Article 5 No Torture

Not addressed.

ND
Article 7 Equality Before Law

Not addressed.

ND
Article 8 Right to Remedy

Not addressed.

ND
Article 9 No Arbitrary Detention

Not addressed.

ND
Article 10 Fair Hearing

Not addressed.

ND
Article 11 Presumption of Innocence

Not addressed.

ND
Article 12 Privacy

Not addressed.

ND
Article 13 Freedom of Movement

Not addressed.

ND
Article 14 Asylum

Not addressed.

ND
Article 15 Nationality

Not addressed.

ND
Article 16 Marriage & Family

Not addressed.

ND
Article 21 Political Participation

Not addressed.

ND
Article 22 Social Security

Not addressed.

ND
Article 23 Work & Equal Pay

Not addressed.

ND
Article 25 Standard of Living

Not addressed.

ND
Article 28 Social & International Order

Not addressed.

ND
Article 30 No Destruction of Rights

Not addressed.

Structural Channel
What the site does
+0.40
Article 27 Cultural Participation
High Advocacy Coverage
Structural
+0.40
Context Modifier
ND
SETL
0.00

Repository actively enables global scientific collaboration through public code, standardized verification, and linked contributions.

+0.30
Article 17 Property
Medium Advocacy Practice
Structural
+0.30
Context Modifier
ND
SETL
-0.17

MIT license and public repository structure legally and technically enable property rights in knowledge.

+0.20
Article 6 Legal Personhood
Medium Advocacy Practice
Structural
+0.20
Context Modifier
ND
SETL
0.00

Leaderboard and GitHub profile system formally recognize contributors' work and intellectual identity.

+0.20
Article 19 Freedom of Expression
Medium Framing Practice
Structural
+0.20
Context Modifier
ND
SETL
0.00

GitHub Issues and Pull Requests enable public technical discourse; public repository allows unlimited view/fork/discussion.

+0.20
Article 20 Assembly & Association
Medium Practice
Structural
+0.20
Context Modifier
ND
SETL
-0.14

GitHub enables collaboration features (forking, linking, mentioning) that support community association.

+0.20
Article 26 Education
Medium Coverage Advocacy
Structural
+0.20
Context Modifier
ND
SETL
+0.17

Public, free access to code, verification script, and educational content with no paywall or enrollment required.

+0.20
Article 29 Duties to Community
Medium Practice
Structural
+0.20
Context Modifier
ND
SETL
0.00

Standardized verification methodology and public results enforce shared accountability and communal standards.

+0.10
Preamble Preamble
Medium Advocacy Practice
Structural
+0.10
Context Modifier
ND
SETL
0.00

GitHub platform practices equal treatment of all contributors in submission, attribution, and public recognition, regardless of background.

+0.10
Article 18 Freedom of Thought
Medium Framing
Structural
+0.10
Context Modifier
ND
SETL
0.00

Submission system accepts diverse technical approaches without bias toward any single method.

+0.10
Article 24 Rest & Leisure
Medium Framing
Structural
+0.10
Context Modifier
ND
SETL
0.00

Participation is entirely voluntary with no mandatory participation or enrollment mechanisms.

ND
Article 1 Freedom, Equality, Brotherhood

Not addressed.

ND
Article 2 Non-Discrimination
Medium Practice

Submission process contains no stated eligibility restrictions based on protected characteristics; GitHub platform provides accessible participation channels.

ND
Article 3 Life, Liberty, Security

Not addressed.

ND
Article 4 No Slavery

Not addressed.

ND
Article 5 No Torture

Not addressed.

ND
Article 7 Equality Before Law

Not addressed.

ND
Article 8 Right to Remedy

Not addressed.

ND
Article 9 No Arbitrary Detention

Not addressed.

ND
Article 10 Fair Hearing

Not addressed.

ND
Article 11 Presumption of Innocence

Not addressed.

ND
Article 12 Privacy

Not addressed.

ND
Article 13 Freedom of Movement

Not addressed.

ND
Article 14 Asylum

Not addressed.

ND
Article 15 Nationality

Not addressed.

ND
Article 16 Marriage & Family

Not addressed.

ND
Article 21 Political Participation

Not addressed.

ND
Article 22 Social Security

Not addressed.

ND
Article 23 Work & Equal Pay

Not addressed.

ND
Article 25 Standard of Living

Not addressed.

ND
Article 28 Social & International Order

Not addressed.

ND
Article 30 No Destruction of Rights

Not addressed.

Supplementary Signals
How this content communicates, beyond directional lean. Learn more
Epistemic Quality
How well-sourced and evidence-based is this content?
0.75 medium claims
Sources
0.8
Evidence
0.8
Uncertainty
0.7
Purpose
0.8
Propaganda Flags
No manipulative rhetoric detected
0 techniques detected
Emotional Tone
Emotional character: positive/negative, intensity, authority
celebratory
Valence
+0.6
Arousal
0.5
Dominance
0.6
Transparency
Does the content identify its author and disclose interests?
1.00
✓ Author
More signals: context, framing & audience
Solution Orientation
Does this content offer solutions or only describe problems?
0.88 solution oriented
Reader Agency
0.8
Stakeholder Voice
Whose perspectives are represented in this content?
0.75 15 perspectives
Speaks: individualsresearchersengineerscommunity
Temporal Framing
Is this content looking backward, at the present, or forward?
mixed unspecified
Geographic Scope
What geographic area does this content cover?
global
Complexity
How accessible is this content to a general audience?
technical high jargon domain specific
Longitudinal 929 HN snapshots · 23 evals
+1 0 −1 HN
Audit Trail 43 entries
2026-03-01 18:44 eval_success Evaluated: Neutral (0.04) - -
2026-03-01 18:44 eval Evaluated by deepseek-v3.2: +0.04 (Neutral) 10,492 tokens -0.31
2026-03-01 18:44 rater_validation_warn Validation warnings for model deepseek-v3.2: 22W 22R - -
2026-02-28 10:54 model_divergence Cross-model spread 0.35 exceeds threshold (4 models) - -
2026-02-28 10:54 eval Evaluated by claude-haiku-4-5-20251001: +0.20 (Mild positive)
2026-02-28 09:54 model_divergence Cross-model spread 0.35 exceeds threshold (3 models) - -
2026-02-28 09:54 eval_success Light evaluated: Neutral (0.00) - -
2026-02-28 09:54 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Neutral tech repository content
2026-02-28 09:54 rater_validation_warn Light validation warnings for model llama-4-scout-wai: 0W 1R - -
2026-02-28 08:51 model_divergence Cross-model spread 0.35 exceeds threshold (3 models) - -
2026-02-28 08:51 eval_success Light evaluated: Neutral (0.00) - -
2026-02-28 08:51 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Neutral tech repository content
2026-02-28 08:51 rater_validation_warn Light validation warnings for model llama-4-scout-wai: 0W 1R - -
2026-02-28 08:46 eval_success Light evaluated: Neutral (0.00) - -
2026-02-28 08:46 rater_validation_warn Light validation warnings for model llama-3.3-70b-wai: 0W 1R - -
2026-02-28 08:46 model_divergence Cross-model spread 0.35 exceeds threshold (3 models) - -
2026-02-28 08:46 eval Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00
reasoning
Tech tutorial no rights stance
2026-02-28 08:21 model_divergence Cross-model spread 0.35 exceeds threshold (3 models) - -
2026-02-28 08:21 eval_success Light evaluated: Neutral (0.00) - -
2026-02-28 08:21 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Neutral tech repository content
2026-02-28 08:21 rater_validation_warn Light validation warnings for model llama-4-scout-wai: 0W 1R - -
2026-02-28 08:07 eval_success Evaluated: Moderate positive (0.35) - -
2026-02-28 08:06 model_divergence Cross-model spread 0.35 exceeds threshold (2 models) - -
2026-02-28 08:06 eval Evaluated by deepseek-v3.2: +0.35 (Moderate positive) 10,323 tokens +0.35
2026-02-28 08:06 rater_validation_warn Validation warnings for model deepseek-v3.2: 0W 5R - -
2026-02-28 07:10 eval_success Light evaluated: Neutral (0.00) - -
2026-02-28 07:10 rater_validation_warn Light validation warnings for model llama-3.3-70b-wai: 0W 1R - -
2026-02-28 07:10 eval Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00
reasoning
Tech tutorial no rights stance
2026-02-28 06:49 eval Evaluated by deepseek-v3.2: 0.00 (Neutral) 9,985 tokens -0.64
2026-02-28 05:31 eval Evaluated by deepseek-v3.2: +0.64 (Strong positive) 9,615 tokens +0.14
2026-02-28 04:55 eval Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00
reasoning
Tech tutorial no rights stance
2026-02-28 03:45 eval Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00
reasoning
Tech tutorial no rights stance
2026-02-28 03:27 eval Evaluated by deepseek-v3.2: +0.50 (Moderate positive) 9,260 tokens +0.37
2026-02-28 02:43 eval Evaluated by deepseek-v3.2: +0.13 (Mild positive) 11,105 tokens
2026-02-28 02:24 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Neutral tech repository content
2026-02-28 01:57 eval Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00
reasoning
Tech tutorial no rights stance
2026-02-28 01:56 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Neutral tech repository content
2026-02-28 01:09 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Neutral tech repository content
2026-02-28 01:08 eval Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00
reasoning
Tech tutorial no rights stance
2026-02-28 01:06 eval Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00
reasoning
Tech tutorial no rights stance
2026-02-28 00:57 eval Evaluated by llama-3.3-70b-wai: 0.00 (Neutral)
reasoning
Tech tutorial no rights stance
2026-02-28 00:50 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Neutral tech repository content
2026-02-28 00:45 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral)
reasoning
Neutral tech repository content