| 2026-03-12 14:53 | eval_success | PSQ evaluated: g-PSQ=-0.400 (3 dims) | - - |
| 2026-03-12 14:53 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.40 (Moderate negative) 0.00 | |
| 2026-03-12 14:39 | eval_success | Lite evaluated: Moderate negative (-0.43) | - - |
| 2026-03-12 14:39 |
eval
|
Evaluated by llama-4-scout-wai: -0.43 (Moderate negative) 0.00 | |
| reasoning Policy analysis on US-Iran conflict, no explicit human rights discussion |
| 2026-03-12 10:46 | eval_success | PSQ evaluated: g-PSQ=-0.400 (3 dims) | - - |
| 2026-03-12 10:46 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.40 (Moderate negative) 0.00 | |
| 2026-03-12 10:25 | eval_success | Lite evaluated: Moderate negative (-0.43) | - - |
| 2026-03-12 10:25 |
eval
|
Evaluated by llama-4-scout-wai: -0.43 (Moderate negative) 0.00 | |
| reasoning Policy analysis on US-Iran conflict, no explicit human rights discussion |
| 2026-03-12 09:27 | eval_success | PSQ evaluated: g-PSQ=-0.400 (3 dims) | - - |
| 2026-03-12 09:27 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.40 (Moderate negative) 0.00 | |
| 2026-03-12 09:16 | eval_success | Lite evaluated: Moderate negative (-0.43) | - - |
| 2026-03-12 09:16 |
eval
|
Evaluated by llama-4-scout-wai: -0.43 (Moderate negative) 0.00 | |
| reasoning Policy analysis on US-Iran conflict, no explicit human rights discussion |
| 2026-03-12 08:47 | eval_success | PSQ evaluated: g-PSQ=-0.400 (3 dims) | - - |
| 2026-03-12 08:47 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.40 (Moderate negative) 0.00 | |
| 2026-03-12 08:40 | eval_success | Lite evaluated: Moderate negative (-0.43) | - - |
| 2026-03-12 08:40 |
eval
|
Evaluated by llama-4-scout-wai: -0.43 (Moderate negative) 0.00 | |
| reasoning Policy analysis on US-Iran conflict, no explicit human rights discussion |
| 2026-03-12 08:12 | eval_success | PSQ evaluated: g-PSQ=-0.400 (3 dims) | - - |
| 2026-03-12 08:12 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.40 (Moderate negative) 0.00 | |
| 2026-03-12 08:04 | eval_success | Lite evaluated: Moderate negative (-0.43) | - - |
| 2026-03-12 08:04 |
eval
|
Evaluated by llama-4-scout-wai: -0.43 (Moderate negative) 0.00 | |
| reasoning Policy analysis on US-Iran conflict, no explicit human rights discussion |
| 2026-03-12 07:37 | eval_success | PSQ evaluated: g-PSQ=-0.400 (3 dims) | - - |
| 2026-03-12 07:37 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.40 (Moderate negative) 0.00 | |
| 2026-03-12 07:29 | eval_success | Lite evaluated: Moderate negative (-0.43) | - - |
| 2026-03-12 07:29 |
eval
|
Evaluated by llama-4-scout-wai: -0.43 (Moderate negative) 0.00 | |
| reasoning Policy analysis on US-Iran conflict, no explicit human rights discussion |
| 2026-03-12 07:01 | eval_success | PSQ evaluated: g-PSQ=-0.400 (3 dims) | - - |
| 2026-03-12 07:01 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.40 (Moderate negative) 0.00 | |
| 2026-03-12 06:55 | eval_success | Lite evaluated: Moderate negative (-0.43) | - - |
| 2026-03-12 06:55 |
eval
|
Evaluated by llama-4-scout-wai: -0.43 (Moderate negative) 0.00 | |
| reasoning Policy analysis on US-Iran conflict, no explicit human rights discussion |
| 2026-03-12 06:24 | eval_success | PSQ evaluated: g-PSQ=-0.400 (3 dims) | - - |
| 2026-03-12 06:24 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.40 (Moderate negative) 0.00 | |
| 2026-03-12 06:20 | eval_success | Lite evaluated: Moderate negative (-0.43) | - - |
| 2026-03-12 06:20 |
eval
|
Evaluated by llama-4-scout-wai: -0.43 (Moderate negative) 0.00 | |
| reasoning Policy analysis on US-Iran conflict, no explicit human rights discussion |
| 2026-03-12 05:46 | eval_success | PSQ evaluated: g-PSQ=-0.400 (3 dims) | - - |
| 2026-03-12 05:46 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.40 (Moderate negative) | |
| 2026-03-12 05:45 | eval_success | Lite evaluated: Moderate negative (-0.43) | - - |
| 2026-03-12 05:45 |
eval
|
Evaluated by llama-4-scout-wai: -0.43 (Moderate negative) | |
| reasoning Policy analysis on US-Iran conflict, no explicit human rights discussion |