| |
| Summary ~lite Economic History Neutral Discussion on historical accuracy of TV show characters' income.
Lite evaluation by llama-4-scout-wai · editorial channel only · no per-section breakdown available
| |
Longitudinal
· 3 evals | |
Audit Trail
11 entries | 2026-02-28 10:16 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 0W 1R | - - | | 2026-02-28 10:16 | eval_success | Lite evaluated: Neutral (0.00) | - - | | 2026-02-28 10:16 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | | | reasoning ED neutral historical discussion | | 2026-02-28 10:11 | eval_success | Lite evaluated: Neutral (0.00) | - - | | 2026-02-28 10:11 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) | | | reasoning ED neutral historical discussion | | 2026-02-28 10:11 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 0W 1R | - - | | 2026-02-28 10:03 | eval_success | Lite evaluated: Neutral (0.00) | - - | | 2026-02-28 10:03 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) | | | reasoning Historical discussion neutral | | 2026-02-28 10:03 | rater_validation_warn | Lite validation warnings for model llama-3.3-70b-wai: 0W 1R | - - | | 2026-02-26 05:33 | dlq | Dead-lettered after 1 attempts: How could Al Bundy afford a house when he was making minimum wage? | - - | | 2026-02-26 05:20 | credit_exhausted | Credit balance too low, retrying in 356s | - - | | |
| |