| |
Model Comparison
| Model | Editorial | Structural | Class | Conf | SETL | Theme | | @cf/meta/llama-3.3-70b-instruct-fp8-fast lite | 0.00 | ND | Neutral | 0.80 | 0.00 | Work Life Balance | | @cf/meta/llama-4-scout-17b-16e-instruct lite | +0.10 | ND | Mild positive | 0.50 | 0.00 | Productivity Business | | deepseek/deepseek-v3.2-20251201 | +0.21 | +0.07 | Mild positive | 0.14 | 0.17 | Work & Well-being | | meta-llama/llama-3.3-70b-instruct:free | ND | ND | — | — | — | — | | nvidia/nemotron-3-nano-30b-a3b:free | ND | ND | — | — | — | — | | Section | @cf/meta/llama-3.3-70b-instruct-fp8-fast lite | @cf/meta/llama-4-scout-17b-16e-instruct lite | deepseek/deepseek-v3.2-20251201 | meta-llama/llama-3.3-70b-instruct:free | nvidia/nemotron-3-nano-30b-a3b:free | | Preamble | ND | ND | ND | ND | ND | | Article 1 | ND | ND | 0.06 | ND | ND | | Article 2 | ND | ND | 0.06 | ND | ND | | Article 3 | ND | ND | 0.12 | ND | ND | | Article 4 | ND | ND | ND | ND | ND | | Article 5 | ND | ND | ND | ND | ND | | Article 6 | ND | ND | ND | ND | ND | | Article 7 | ND | ND | ND | ND | ND | | Article 8 | ND | ND | ND | ND | ND | | Article 9 | ND | ND | ND | ND | ND | | Article 10 | ND | ND | ND | ND | ND | | Article 11 | ND | ND | ND | ND | ND | | Article 12 | ND | ND | ND | ND | ND | | Article 13 | ND | ND | ND | ND | ND | | Article 14 | ND | ND | ND | ND | ND | | Article 15 | ND | ND | ND | ND | ND | | Article 16 | ND | ND | ND | ND | ND | | Article 17 | ND | ND | ND | ND | ND | | Article 18 | ND | ND | ND | ND | ND | | Article 19 | ND | ND | 0.53 | ND | ND | | Article 20 | ND | ND | ND | ND | ND | | Article 21 | ND | ND | ND | ND | ND | | Article 22 | ND | ND | 0.12 | ND | ND | | Article 23 | ND | ND | 0.18 | ND | ND | | Article 24 | ND | ND | 0.12 | ND | ND | | Article 25 | ND | ND | 0.06 | ND | ND | | Article 26 | ND | ND | 0.59 | ND | ND | | Article 27 | ND | ND | 0.39 | ND | ND | | Article 28 | ND | ND | 0.06 | ND | ND | | Article 29 | ND | ND | ND | ND | ND | | Article 30 | ND | ND | ND | ND | ND | | Pending Evaluation This story is queued for evaluation. It will be processed in an upcoming batch.
Queued: 2026-02-26 14:03:27 | |
Longitudinal
· 3 evals | |
Audit Trail
21 entries | 2026-02-28 13:36 | eval_success | Lite evaluated: Neutral (0.00) | - - | | 2026-02-28 13:36 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) | | | reasoning MX content, neutral stance | | 2026-02-26 22:39 | eval_success | Light evaluated: Mild positive (0.10) | - - | | 2026-02-26 22:39 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) | | | 2026-02-26 20:02 | dlq | Dead-lettered after 1 attempts: Efficiency Is the Enemy | - - | | 2026-02-26 20:01 | eval_failure | Evaluation failed: Error: Unknown model in registry: llama-4-scout-wai | - - | | 2026-02-26 20:01 | eval_failure | Evaluation failed: Error: Unknown model in registry: llama-4-scout-wai | - - | | 2026-02-26 20:01 | dlq | Dead-lettered after 1 attempts: Efficiency Is the Enemy | - - | | 2026-02-26 20:00 | eval_success | Evaluated: Mild positive (0.24) | - - | | 2026-02-26 20:00 |
eval
|
Evaluated by deepseek-v3.2: +0.24 (Mild positive) 15,506 tokens | | | 2026-02-26 20:00 | dlq | Dead-lettered after 1 attempts: Efficiency Is the Enemy | - - | | 2026-02-26 20:00 | eval_failure | Evaluation failed: Error: Unknown model in registry: llama-4-scout-wai | - - | | 2026-02-26 19:59 | eval_failure | Evaluation failed: Error: Unknown model in registry: llama-4-scout-wai | - - | | 2026-02-26 19:59 | rate_limit | OpenRouter rate limited (429) model=llama-3.3-70b | - - | | 2026-02-26 19:58 | rate_limit | OpenRouter rate limited (429) model=llama-3.3-70b | - - | | 2026-02-26 19:57 | rate_limit | OpenRouter rate limited (429) model=llama-3.3-70b | - - | | 2026-02-26 19:50 | rater_validation_fail | Validation failed for model llama-4-scout-wai | - - | | 2026-02-26 19:11 | dlq | Dead-lettered after 1 attempts: Efficiency Is the Enemy | - - | | 2026-02-26 19:09 | rate_limit | OpenRouter rate limited (429) model=llama-3.3-70b | - - | | 2026-02-26 19:08 | rate_limit | OpenRouter rate limited (429) model=llama-3.3-70b | - - | | 2026-02-26 19:07 | rate_limit | OpenRouter rate limited (429) model=llama-3.3-70b | - - | | |
| |