Model Comparison
Model Editorial Structural Class Conf SETL Theme
@cf/meta/llama-4-scout-17b-16e-instruct lite +0.90 ND Strong positive 1.00 0.00 Human Rights
@cf/meta/llama-3.3-70b-instruct-fp8-fast lite +0.96 ND Strong positive 0.99 0.00 Human Rights Defense
claude-haiku-4-5 lite +0.92 ND Strong positive 0.95 0.00 Global human rights advocacy
deepseek/deepseek-v3.2-20251201 +0.75 +0.53 Strong positive 0.14 0.41 Human Rights Advocacy
meta-llama/llama-3.3-70b-instruct:free ND ND
Section @cf/meta/llama-4-scout-17b-16e-instruct lite @cf/meta/llama-3.3-70b-instruct-fp8-fast lite claude-haiku-4-5 lite deepseek/deepseek-v3.2-20251201 meta-llama/llama-3.3-70b-instruct:free
Preamble ND ND ND 0.76 ND
Article 1 ND ND ND 0.95 ND
Article 2 ND ND ND ND ND
Article 3 ND ND ND ND ND
Article 4 ND ND ND ND ND
Article 5 ND ND ND 0.85 ND
Article 6 ND ND ND ND ND
Article 7 ND ND ND ND ND
Article 8 ND ND ND ND ND
Article 9 ND ND ND ND ND
Article 10 ND ND ND ND ND
Article 11 ND ND ND ND ND
Article 12 ND ND ND ND ND
Article 13 ND ND ND ND ND
Article 14 ND ND ND ND ND
Article 15 ND ND ND ND ND
Article 16 ND ND ND ND ND
Article 17 ND ND ND ND ND
Article 18 ND ND ND ND ND
Article 19 ND ND ND 0.96 ND
Article 20 ND ND ND ND ND
Article 21 ND ND ND ND ND
Article 22 ND ND ND ND ND
Article 23 ND ND ND ND ND
Article 24 ND ND ND ND ND
Article 25 ND ND ND ND ND
Article 26 ND ND ND ND ND
Article 27 ND ND ND ND ND
Article 28 ND ND ND ND ND
Article 29 ND ND ND ND ND
Article 30 ND ND ND ND ND
+0.90 [CAL-LITE] Human Rights Watch (EP-3) (www.hrw.org)
0 points 3 days ago | 0 comments on HN | Strong positive ~lite vlite-1.4
Summary ~lite Human Rights Champions
Human Rights Watch advocates for human rights worldwide
EQ 0.90
SO 0.80
TD 0.70
Lite evaluation by llama-4-scout-wai · editorial channel only · no per-section breakdown available
Longitudinal · 18 evals
+1 0 −1 HN
Audit Trail 38 entries
2026-02-28 11:13 eval_success Lite evaluated: Strong positive (0.90) - -
2026-02-28 11:13 eval Evaluated by llama-4-scout-wai: +0.90 (Strong positive) -0.10
reasoning
HRW homepage, explicit human rights advocacy
2026-02-28 11:13 rater_validation_warn Lite validation warnings for model llama-4-scout-wai: 0W 1R - -
2026-02-28 09:04 eval_success Light evaluated: Strong positive (0.96) - -
2026-02-28 09:04 eval Evaluated by llama-3.3-70b-wai: +0.96 (Strong positive) -0.04
reasoning
HRW editorial stance
2026-02-28 09:03 rater_validation_warn Light validation warnings for model llama-3.3-70b-wai: 0W 1R - -
2026-02-28 05:17 eval Evaluated by claude-haiku-4-5: +0.92 (Strong positive) -0.03
2026-02-28 01:40 dlq Dead-lettered after 1 attempts: [CAL-LIGHT] Human Rights Watch (EP-3) - -
2026-02-28 01:38 rate_limit OpenRouter rate limited (429) model=llama-3.3-70b - -
2026-02-28 01:36 rate_limit OpenRouter rate limited (429) model=llama-3.3-70b - -
2026-02-28 01:36 eval_failure Evaluation failed: Error: OpenRouter API error 400: {"error":{"message":"Provider returned error","code":400,"metadata":{"raw":"{\"details\":{\"_errors\":[\"response_format is not supported by this model\"]},\"issues\": - -
2026-02-28 01:36 eval_retry OpenRouter error 400 model=llama-3.3-70b - -
2026-02-28 01:36 dlq_replay DLQ message 97626 replayed to LLAMA_QUEUE: [CAL-LIGHT] Human Rights Watch (EP-3) - -
2026-02-28 00:52 eval_success Light evaluated: Strong positive (1.00) - -
2026-02-28 00:52 eval Evaluated by llama-4-scout-wai: +1.00 (Strong positive) 0.00
reasoning
HRW homepage, explicit human rights advocacy
2026-02-28 00:46 eval Evaluated by claude-haiku-4-5: +0.95 (Strong positive) 0.00
2026-02-28 00:41 eval_success Light evaluated: Strong positive (1.00) - -
2026-02-28 00:41 eval Evaluated by llama-3.3-70b-wai: +1.00 (Strong positive) 0.00
reasoning
HRW editorial stance
2026-02-28 00:29 eval Evaluated by claude-haiku-4-5: +0.95 (Strong positive) 0.00
2026-02-28 00:12 eval_success Light evaluated: Strong positive (1.00) - -
2026-02-28 00:12 eval Evaluated by llama-4-scout-wai: +1.00 (Strong positive) 0.00
reasoning
HRW homepage, explicit human rights advocacy
2026-02-28 00:11 eval_success Light evaluated: Strong positive (1.00) - -
2026-02-28 00:11 eval Evaluated by llama-3.3-70b-wai: +1.00 (Strong positive)
reasoning
HRW editorial stance
2026-02-28 00:02 eval Evaluated by claude-haiku-4-5: +0.95 (Strong positive) +0.05
2026-02-27 21:51 eval_success Light evaluated: Strong positive (1.00) - -
2026-02-27 21:51 eval Evaluated by llama-4-scout-wai: +1.00 (Strong positive) 0.00
reasoning
HRW homepage, explicit human rights advocacy
2026-02-27 21:47 eval Evaluated by claude-haiku-4-5: +0.90 (Strong positive) -0.05
2026-02-27 21:36 eval_success Light evaluated: Strong positive (1.00) - -
2026-02-27 21:36 eval Evaluated by llama-4-scout-wai: +1.00 (Strong positive)
reasoning
HRW homepage, explicit human rights advocacy
2026-02-27 21:32 eval Evaluated by claude-haiku-4-5: +0.95 (Strong positive) 0.00
2026-02-27 21:10 eval Evaluated by claude-haiku-4-5: +0.95 (Strong positive) +0.03
2026-02-27 21:01 eval Evaluated by claude-haiku-4-5: +0.92 (Strong positive) 0.00
2026-02-27 19:07 dlq Dead-lettered after 1 attempts: [CAL-LIGHT] Human Rights Watch (EP-3) - -
2026-02-27 19:05 rate_limit OpenRouter rate limited (429) model=llama-3.3-70b - -
2026-02-27 19:04 rate_limit OpenRouter rate limited (429) model=llama-3.3-70b - -
2026-02-27 19:03 rate_limit OpenRouter rate limited (429) model=llama-3.3-70b - -
2026-02-27 15:22 eval Evaluated by deepseek-v3.2: +0.77 (Strong positive) 8,617 tokens
2026-02-27 13:01 eval Evaluated by claude-haiku-4-5: +0.92 (Strong positive)