Model Comparison
Model Editorial Structural Class Conf SETL Theme
claude-haiku-4-5-20251001 ND ND
@cf/meta/llama-4-scout-17b-16e-instruct lite 0.00 ND Neutral 0.90 0.00 Technology
@cf/meta/llama-3.3-70b-instruct-fp8-fast lite 0.00 ND Neutral 0.90 0.00 no human rights theme
deepseek/deepseek-v3.2-20251201 +0.25 +0.20 Neutral 0.03 0.17 Information Access
Section claude-haiku-4-5-20251001 @cf/meta/llama-4-scout-17b-16e-instruct lite @cf/meta/llama-3.3-70b-instruct-fp8-fast lite deepseek/deepseek-v3.2-20251201
Preamble ND ND ND ND
Article 1 ND ND ND ND
Article 2 ND ND ND ND
Article 3 ND ND ND ND
Article 4 ND ND ND ND
Article 5 ND ND ND ND
Article 6 ND ND ND ND
Article 7 ND ND ND ND
Article 8 ND ND ND ND
Article 9 ND ND ND ND
Article 10 ND ND ND ND
Article 11 ND ND ND ND
Article 12 ND ND ND ND
Article 13 ND ND ND ND
Article 14 ND ND ND ND
Article 15 ND ND ND ND
Article 16 ND ND ND ND
Article 17 ND ND ND ND
Article 18 ND ND ND ND
Article 19 ND ND ND 0.51
Article 20 ND ND ND ND
Article 21 ND ND ND ND
Article 22 ND ND ND ND
Article 23 ND ND ND ND
Article 24 ND ND ND ND
Article 25 ND ND ND ND
Article 26 ND ND ND ND
Article 27 ND ND ND 0.20
Article 28 ND ND ND ND
Article 29 ND ND ND ND
Article 30 ND ND ND ND
0.00 Absurd Success (www.marginalia.nu)
629 points by asicsp 914 days ago | 160 comments on HN | Neutral ~lite vlite-1.4
Summary ~lite Technology Neutral
Technical blog post about improving search engine performance
EQ 0.50
SO 0.50
TD 0.50
Lite evaluation by llama-4-scout-wai · editorial channel only · no per-section breakdown available
Longitudinal · 4 evals
+1 0 −1 HN
Audit Trail 24 entries
2026-02-28 10:19 model_divergence Cross-model spread 0.40 exceeds threshold (3 models) - -
2026-02-28 10:19 eval_success Lite evaluated: Neutral (0.00) - -
2026-02-28 10:19 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
ED, neutral tech blog post
2026-02-28 10:19 rater_validation_warn Lite validation warnings for model llama-4-scout-wai: 0W 1R - -
2026-02-28 10:17 model_divergence Cross-model spread 0.40 exceeds threshold (3 models) - -
2026-02-28 10:17 eval_success Lite evaluated: Neutral (0.00) - -
2026-02-28 10:17 eval Evaluated by llama-3.3-70b-wai: 0.00 (Neutral)
reasoning
tech tutorial no rights stance
2026-02-28 10:17 rater_validation_warn Lite validation warnings for model llama-3.3-70b-wai: 0W 1R - -
2026-02-28 10:14 model_divergence Cross-model spread 0.40 exceeds threshold (2 models) - -
2026-02-28 10:14 eval_success Lite evaluated: Neutral (0.00) - -
2026-02-28 10:14 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral)
reasoning
ED, neutral tech blog post
2026-02-28 10:14 rater_validation_warn Lite validation warnings for model llama-4-scout-wai: 0W 1R - -
2026-02-26 17:26 dlq Dead-lettered after 1 attempts: Absurd Success - -
2026-02-26 17:24 rate_limit OpenRouter rate limited (429) model=llama-3.3-70b - -
2026-02-26 17:23 rate_limit OpenRouter rate limited (429) model=llama-3.3-70b - -
2026-02-26 17:22 rate_limit OpenRouter rate limited (429) model=llama-3.3-70b - -
2026-02-26 16:39 eval_success Evaluated: Neutral (0.40) - -
2026-02-26 16:39 eval Evaluated by deepseek-v3.2: +0.40 (Neutral) 11,227 tokens
2026-02-26 12:19 dlq Dead-lettered after 1 attempts: Absurd Success - -
2026-02-26 12:17 rate_limit OpenRouter rate limited (429) model=llama-3.3-70b - -
2026-02-26 12:16 rate_limit OpenRouter rate limited (429) model=llama-3.3-70b - -
2026-02-26 12:15 rate_limit OpenRouter rate limited (429) model=llama-3.3-70b - -
2026-02-26 09:30 dlq Dead-lettered after 1 attempts: Absurd Success - -
2026-02-26 09:19 credit_exhausted Credit balance too low, retrying in 280s - -