Model Comparison
Model Editorial Structural Class Conf SETL Theme
claude-haiku-4-5-20251001 ND ND
@cf/meta/llama-3.3-70b-instruct-fp8-fast lite +0.20 ND Mild positive 0.80 0.00 Free Knowledge
@cf/meta/llama-4-scout-17b-16e-instruct lite +0.10 ND Mild positive 0.80 0.00 Free Expression
deepseek/deepseek-v3.2-20251201 +0.41 -0.20 Mild positive 0.26 0.74 Education & Culture
Section claude-haiku-4-5-20251001 @cf/meta/llama-3.3-70b-instruct-fp8-fast lite @cf/meta/llama-4-scout-17b-16e-instruct lite deepseek/deepseek-v3.2-20251201
Preamble ND ND ND 0.30
Article 1 ND ND ND ND
Article 2 ND ND ND ND
Article 3 ND ND ND ND
Article 4 ND ND ND ND
Article 5 ND ND ND ND
Article 6 ND ND ND ND
Article 7 ND ND ND ND
Article 8 ND ND ND ND
Article 9 ND ND ND ND
Article 10 ND ND ND ND
Article 11 ND ND ND ND
Article 12 ND ND ND ND
Article 13 ND ND ND 0.30
Article 14 ND ND ND ND
Article 15 ND ND ND ND
Article 16 ND ND ND ND
Article 17 ND ND ND ND
Article 18 ND ND ND 0.40
Article 19 ND ND ND 0.13
Article 20 ND ND ND 0.50
Article 21 ND ND ND 0.30
Article 22 ND ND ND ND
Article 23 ND ND ND 0.40
Article 24 ND ND ND ND
Article 25 ND ND ND 0.20
Article 26 ND ND ND 0.70
Article 27 ND ND ND 0.19
Article 28 ND ND ND 0.20
Article 29 ND ND ND 0.30
Article 30 ND ND ND ND
+0.20 Wikipedia is 20 (www.economist.com)
733 points by kylebarron 1877 days ago | 372 comments on HN | Mild positive ~lite vlite-1.4
Summary ~lite Free Knowledge Acknowledges
Wikipedia's 20th anniversary
EQ 0.70
SO 0.60
TD 0.50
Lite evaluation by llama-3.3-70b-wai · editorial channel only · no per-section breakdown available
Longitudinal · 3 evals
+1 0 −1 HN
Audit Trail 17 entries
2026-02-28 11:57 eval_success Lite evaluated: Mild positive (0.20) - -
2026-02-28 11:57 rater_validation_warn Lite validation warnings for model llama-3.3-70b-wai: 0W 1R - -
2026-02-28 11:57 eval Evaluated by llama-3.3-70b-wai: +0.20 (Mild positive)
reasoning
Editorial on Wikipedia
2026-02-28 11:51 eval_success Lite evaluated: Mild positive (0.10) - -
2026-02-28 11:51 eval Evaluated by llama-4-scout-wai: +0.10 (Mild positive)
reasoning
Editorial discusses Wikipedia's reputation and impact
2026-02-28 11:51 rater_validation_warn Lite validation warnings for model llama-4-scout-wai: 0W 1R - -
2026-02-26 18:16 eval_success Evaluated: Mild positive (0.24) - -
2026-02-26 18:16 eval Evaluated by deepseek-v3.2: +0.24 (Mild positive) 16,732 tokens
2026-02-26 18:10 eval_failure Evaluation failed: Error: Network connection lost. - -
2026-02-26 17:26 dlq Dead-lettered after 1 attempts: Wikipedia is 20 - -
2026-02-26 17:24 rate_limit OpenRouter rate limited (429) model=llama-3.3-70b - -
2026-02-26 17:23 rate_limit OpenRouter rate limited (429) model=llama-3.3-70b - -
2026-02-26 17:22 rate_limit OpenRouter rate limited (429) model=llama-3.3-70b - -
2026-02-26 12:20 dlq Dead-lettered after 1 attempts: Wikipedia is 20 - -
2026-02-26 12:18 rate_limit OpenRouter rate limited (429) model=llama-3.3-70b - -
2026-02-26 12:17 rate_limit OpenRouter rate limited (429) model=llama-3.3-70b - -
2026-02-26 12:15 rate_limit OpenRouter rate limited (429) model=llama-3.3-70b - -