| 2026-03-01 18:00 | eval_success | Lite evaluated: Strong positive (0.60) | - - |
| 2026-03-01 18:00 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.60 (Strong positive) 0.00 | |
| reasoning Editorial against restrictions |
| 2026-03-01 17:04 | eval_success | Lite evaluated: Moderate positive (0.36) | - - |
| 2026-03-01 17:04 |
eval
|
Evaluated by llama-4-scout-wai: +0.36 (Moderate positive) 0.00 | |
| reasoning Editorial mildly critical of government control, rights concerns |
| 2026-03-01 16:59 | eval_success | Lite evaluated: Moderate positive (0.36) | - - |
| 2026-03-01 16:59 |
eval
|
Evaluated by llama-4-scout-wai: +0.36 (Moderate positive) 0.00 | |
| reasoning Editorial mildly critical of government control, rights concerns |
| 2026-03-01 16:38 | eval_success | Lite evaluated: Strong positive (0.60) | - - |
| 2026-03-01 16:38 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.60 (Strong positive) 0.00 | |
| reasoning Editorial against restrictions |
| 2026-03-01 15:22 | eval_success | Lite evaluated: Moderate positive (0.36) | - - |
| 2026-03-01 15:22 |
eval
|
Evaluated by llama-4-scout-wai: +0.36 (Moderate positive) 0.00 | |
| reasoning Editorial mildly critical of government control, rights concerns |
| 2026-03-01 15:09 | eval_success | Lite evaluated: Strong positive (0.60) | - - |
| 2026-03-01 15:09 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.60 (Strong positive) 0.00 | |
| reasoning Editorial against restrictions |
| 2026-02-28 12:09 | eval_success | Lite evaluated: Moderate positive (0.36) | - - |
| 2026-02-28 12:09 |
eval
|
Evaluated by llama-4-scout-wai: +0.36 (Moderate positive) -0.34 | |
| reasoning Editorial mildly critical of government control, rights concerns |
| 2026-02-28 12:09 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 0W 1R | - - |
| 2026-02-28 12:09 | eval_success | Lite evaluated: Strong positive (0.60) | - - |
| 2026-02-28 12:09 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.60 (Strong positive) +0.10 | |
| reasoning Editorial against restrictions |
| 2026-02-28 12:09 | rater_validation_warn | Lite validation warnings for model llama-3.3-70b-wai: 0W 1R | - - |
| 2026-02-28 04:16 | eval_success | Light evaluated: Strong positive (0.70) | - - |
| 2026-02-28 04:16 |
eval
|
Evaluated by llama-4-scout-wai: +0.70 (Strong positive) | |
| reasoning Editorial mildly critical of government control, rights concerns |
| 2026-02-28 04:09 | eval_success | Light evaluated: Moderate positive (0.50) | - - |
| 2026-02-28 04:09 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.50 (Moderate positive) | |
| reasoning Editorial against restrictions |