| |
Model Comparison
| Model | Editorial | Structural | Class | Conf | SETL | Theme | | claude-haiku-4-5-20251001 | ND | ND | — | — | — | — | | @cf/meta/llama-4-scout-17b-16e-instruct lite | +0.10 | ND | Mild positive | 0.80 | 0.00 | Energy Policy | | @cf/meta/llama-3.3-70b-instruct-fp8-fast lite | +0.10 | ND | Mild positive | 0.70 | 0.00 | Nuclear Power | | Section | claude-haiku-4-5-20251001 | @cf/meta/llama-4-scout-17b-16e-instruct lite | @cf/meta/llama-3.3-70b-instruct-fp8-fast lite | | Preamble | ND | ND | ND | | Article 1 | ND | ND | ND | | Article 2 | ND | ND | ND | | Article 3 | ND | ND | ND | | Article 4 | ND | ND | ND | | Article 5 | ND | ND | ND | | Article 6 | ND | ND | ND | | Article 7 | ND | ND | ND | | Article 8 | ND | ND | ND | | Article 9 | ND | ND | ND | | Article 10 | ND | ND | ND | | Article 11 | ND | ND | ND | | Article 12 | ND | ND | ND | | Article 13 | ND | ND | ND | | Article 14 | ND | ND | ND | | Article 15 | ND | ND | ND | | Article 16 | ND | ND | ND | | Article 17 | ND | ND | ND | | Article 18 | ND | ND | ND | | Article 19 | ND | ND | ND | | Article 20 | ND | ND | ND | | Article 21 | ND | ND | ND | | Article 22 | ND | ND | ND | | Article 23 | ND | ND | ND | | Article 24 | ND | ND | ND | | Article 25 | ND | ND | ND | | Article 26 | ND | ND | ND | | Article 27 | ND | ND | ND | | Article 28 | ND | ND | ND | | Article 29 | ND | ND | ND | | Article 30 | ND | ND | ND | | Summary ~lite Energy Policy Acknowledges Reports on Japan's nuclear power public opinion shift.
Lite evaluation by llama-4-scout-wai · editorial channel only · no per-section breakdown available
| |
Longitudinal
· 2 evals | |
Audit Trail
6 entries | 2026-02-28 11:02 | eval_success | Lite evaluated: Mild positive (0.10) | - - | | 2026-02-28 11:02 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 0W 1R | - - | | 2026-02-28 11:02 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) | | | reasoning ED neutral news reporting on energy policy | | 2026-02-28 10:57 | rater_validation_warn | Lite validation warnings for model llama-3.3-70b-wai: 0W 1R | - - | | 2026-02-28 10:57 | eval_success | Lite evaluated: Mild positive (0.10) | - - | | 2026-02-28 10:57 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.10 (Mild positive) | | | reasoning News article on Japan nuclear power | | |
| |