Model Comparison
Model Editorial Structural Class Conf SETL Theme
@cf/meta/llama-4-scout-17b-16e-instruct lite +0.56 ND Moderate positive 0.80 0.00 Social Justice
@cf/meta/llama-3.3-70b-instruct-fp8-fast lite +0.60 ND Strong positive 0.90 0.00 Socialist Politics
claude-haiku-4-5 lite +0.58 ND Moderate positive 0.92 0.00 Worker rights, economic justice
deepseek/deepseek-v3.2-20251201 +0.52 -0.00 Moderate positive 0.54 0.62 Political & Economic Rights
meta-llama/llama-3.3-70b-instruct:free ND ND
Section @cf/meta/llama-4-scout-17b-16e-instruct lite @cf/meta/llama-3.3-70b-instruct-fp8-fast lite claude-haiku-4-5 lite deepseek/deepseek-v3.2-20251201 meta-llama/llama-3.3-70b-instruct:free
Preamble ND ND ND 0.36 ND
Article 1 ND ND ND 0.42 ND
Article 2 ND ND ND 0.48 ND
Article 3 ND ND ND 0.42 ND
Article 4 ND ND ND 0.00 ND
Article 5 ND ND ND 0.00 ND
Article 6 ND ND ND 0.36 ND
Article 7 ND ND ND 0.42 ND
Article 8 ND ND ND 0.30 ND
Article 9 ND ND ND 0.42 ND
Article 10 ND ND ND 0.30 ND
Article 11 ND ND ND 0.00 ND
Article 12 ND ND ND 0.14 ND
Article 13 ND ND ND 0.30 ND
Article 14 ND ND ND 0.30 ND
Article 15 ND ND ND 0.36 ND
Article 16 ND ND ND 0.00 ND
Article 17 ND ND ND 0.42 ND
Article 18 ND ND ND 0.42 ND
Article 19 ND ND ND 0.48 ND
Article 20 ND ND ND 0.42 ND
Article 21 ND ND ND 0.48 ND
Article 22 ND ND ND 0.42 ND
Article 23 ND ND ND 0.42 ND
Article 24 ND ND ND 0.00 ND
Article 25 ND ND ND 0.36 ND
Article 26 ND ND ND 0.18 ND
Article 27 ND ND ND 0.30 ND
Article 28 ND ND ND 0.42 ND
Article 29 ND ND ND 0.36 ND
Article 30 ND ND ND 0.30 ND
+0.56 [CAL-LITE] Jacobin (EX-4) (jacobin.com)
0 points 3 days ago | 0 comments on HN | Moderate positive ~lite vlite-1.4
Summary ~lite Social Justice Advocates
Leftist editorial content promoting social and economic critiques
EQ 0.70
SO 0.60
TD 0.80
Lite evaluation by llama-4-scout-wai · editorial channel only · no per-section breakdown available
Longitudinal · 23 evals
+1 0 −1 HN
Audit Trail 43 entries
2026-02-28 09:15 model_divergence Cross-model spread 0.26 exceeds threshold (4 models) - -
2026-02-28 09:15 eval_success Light evaluated: Moderate positive (0.56) - -
2026-02-28 09:15 eval Evaluated by llama-4-scout-wai: +0.56 (Moderate positive) 0.00
reasoning
Editorial content with leftist perspective and social critiques
2026-02-28 09:15 rater_validation_warn Light validation warnings for model llama-4-scout-wai: 0W 1R - -
2026-02-28 09:10 rater_validation_warn Light validation warnings for model llama-4-scout-wai: 0W 1R - -
2026-02-28 09:10 model_divergence Cross-model spread 0.26 exceeds threshold (4 models) - -
2026-02-28 09:10 eval_success Light evaluated: Moderate positive (0.56) - -
2026-02-28 09:10 eval Evaluated by llama-4-scout-wai: +0.56 (Moderate positive) -0.14
reasoning
Editorial content with leftist perspective and social critiques
2026-02-28 09:01 eval_success Light evaluated: Strong positive (0.60) - -
2026-02-28 09:01 model_divergence Cross-model spread 0.26 exceeds threshold (3 models) - -
2026-02-28 09:01 eval Evaluated by llama-3.3-70b-wai: +0.60 (Strong positive) -0.20
reasoning
Left-leaning editorial stance
2026-02-28 09:01 rater_validation_warn Light validation warnings for model llama-3.3-70b-wai: 0W 1R - -
2026-02-28 05:22 eval Evaluated by claude-haiku-4-5: +0.58 (Moderate positive) +0.06
2026-02-28 05:14 eval Evaluated by claude-haiku-4-5: +0.52 (Moderate positive) -0.26
2026-02-28 01:41 dlq Dead-lettered after 1 attempts: [CAL-LIGHT] Jacobin (EX-4) - -
2026-02-28 01:38 rate_limit OpenRouter rate limited (429) model=llama-3.3-70b - -
2026-02-28 01:37 rate_limit OpenRouter rate limited (429) model=llama-3.3-70b - -
2026-02-28 01:36 rate_limit OpenRouter rate limited (429) model=llama-3.3-70b - -
2026-02-28 01:36 dlq_replay DLQ message 97628 replayed to LLAMA_QUEUE: [CAL-LIGHT] Jacobin (EX-4) - -
2026-02-28 00:43 eval Evaluated by claude-haiku-4-5: +0.78 (Strong positive) +0.08
2026-02-28 00:33 eval_success Light evaluated: Strong positive (0.70) - -
2026-02-28 00:33 eval Evaluated by llama-4-scout-wai: +0.70 (Strong positive) 0.00
reasoning
Editorial content with leftist perspective and social critiques
2026-02-28 00:26 eval_success Light evaluated: Strong positive (0.80) - -
2026-02-28 00:26 eval Evaluated by llama-3.3-70b-wai: +0.80 (Strong positive) 0.00
reasoning
Left-leaning editorial stance
2026-02-28 00:21 eval Evaluated by claude-haiku-4-5: +0.70 (Strong positive) -0.02
2026-02-28 00:07 eval_success Light evaluated: Strong positive (0.70) - -
2026-02-28 00:07 eval Evaluated by llama-4-scout-wai: +0.70 (Strong positive) 0.00
reasoning
Editorial content with leftist perspective and social critiques
2026-02-27 23:58 eval_success Light evaluated: Strong positive (0.80) - -
2026-02-27 23:58 eval Evaluated by llama-3.3-70b-wai: +0.80 (Strong positive)
reasoning
Left-leaning editorial stance
2026-02-27 23:54 eval Evaluated by claude-haiku-4-5: +0.72 (Strong positive) +0.02
2026-02-27 21:52 eval_success Light evaluated: Strong positive (0.70) - -
2026-02-27 21:52 eval Evaluated by llama-4-scout-wai: +0.70 (Strong positive) 0.00
reasoning
Editorial content with leftist perspective and social critiques
2026-02-27 21:45 eval Evaluated by claude-haiku-4-5: +0.70 (Strong positive) -0.05
2026-02-27 21:31 eval_success Light evaluated: Strong positive (0.70) - -
2026-02-27 21:31 eval Evaluated by llama-4-scout-wai: +0.70 (Strong positive) +0.10
reasoning
Editorial content with leftist perspective and social critiques
2026-02-27 21:30 eval Evaluated by claude-haiku-4-5: +0.75 (Strong positive) +0.07
2026-02-27 21:08 eval Evaluated by claude-haiku-4-5: +0.68 (Strong positive) -0.04
2026-02-27 20:58 eval Evaluated by claude-haiku-4-5: +0.72 (Strong positive) -0.08
2026-02-27 19:05 eval Evaluated by deepseek-v3.2: +0.34 (Moderate positive) 10,556 tokens +0.23
2026-02-27 16:49 eval Evaluated by deepseek-v3.2: +0.11 (Mild positive) 10,494 tokens
2026-02-27 16:18 eval Evaluated by llama-4-scout-wai: +0.60 (Strong positive)
reasoning
Editorial content with leftist perspective and social critiques
2026-02-27 13:00 eval Evaluated by claude-haiku-4-5: +0.80 (Strong positive) 0.00
2026-02-27 12:59 eval Evaluated by claude-haiku-4-5: +0.80 (Strong positive)