| 2026-03-08 19:25 | eval_success | PSQ evaluated: g-PSQ=-0.240 (3 dims) | - - |
| 2026-03-08 19:25 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.24 (Mild negative) 0.00 | |
| 2026-03-08 19:12 | eval_success | PSQ evaluated: g-PSQ=-0.240 (3 dims) | - - |
| 2026-03-08 19:12 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.24 (Mild negative) 0.00 | |
| 2026-03-08 18:55 | model_divergence | Cross-model spread 0.28 exceeds threshold (2 models) | - - |
| 2026-03-08 18:55 | eval_success | Lite evaluated: Mild negative (-0.16) | - - |
| 2026-03-08 18:55 |
eval
|
Evaluated by llama-4-scout-wai: -0.16 (Mild negative) 0.00 | |
| reasoning Editorial stance on US responsibility in Iran school strike, transparency indicators |
| 2026-03-08 18:55 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-08 18:04 | eval_success | Lite evaluated: Mild positive (0.12) | - - |
| 2026-03-08 18:04 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.12 (Mild positive) 0.00 | |
| reasoning Investigative journalism on human rights abuse |
| 2026-03-08 16:35 | eval_success | PSQ evaluated: g-PSQ=-0.240 (3 dims) | - - |
| 2026-03-08 16:35 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.24 (Mild negative) 0.00 | |
| 2026-03-08 16:22 | eval_success | PSQ evaluated: g-PSQ=-0.240 (3 dims) | - - |
| 2026-03-08 16:22 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.24 (Mild negative) 0.00 | |
| 2026-03-08 15:59 | eval_success | Lite evaluated: Mild negative (-0.16) | - - |
| 2026-03-08 15:59 |
eval
|
Evaluated by llama-4-scout-wai: -0.16 (Mild negative) -0.16 | |
| reasoning Editorial stance on US responsibility in Iran school strike, transparency indicators |
| 2026-03-08 15:59 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-08 15:47 | eval_success | Lite evaluated: Mild positive (0.12) | - - |
| 2026-03-08 15:47 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.12 (Mild positive) +0.12 | |
| reasoning Investigative journalism on human rights abuse |
| 2026-03-07 19:18 | eval_success | PSQ evaluated: g-PSQ=-0.240 (3 dims) | - - |
| 2026-03-07 19:18 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.24 (Mild negative) 0.00 | |
| 2026-03-07 19:13 | eval_success | PSQ evaluated: g-PSQ=-0.240 (3 dims) | - - |
| 2026-03-07 19:13 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.24 (Mild negative) 0.00 | |
| 2026-03-07 18:29 | eval_success | PSQ evaluated: g-PSQ=-0.240 (3 dims) | - - |
| 2026-03-07 18:29 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.24 (Mild negative) 0.00 | |
| 2026-03-07 17:40 | eval_success | PSQ evaluated: g-PSQ=-0.240 (3 dims) | - - |
| 2026-03-07 17:40 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.24 (Mild negative) 0.00 | |
| 2026-03-07 17:25 | eval_success | PSQ evaluated: g-PSQ=-0.240 (3 dims) | - - |
| 2026-03-07 17:25 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.24 (Mild negative) 0.00 | |
| 2026-03-06 22:17 | eval_success | PSQ evaluated: g-PSQ=-0.240 (3 dims) | - - |
| 2026-03-06 22:17 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.24 (Mild negative) 0.00 | |
| 2026-03-06 22:16 | eval_success | PSQ evaluated: g-PSQ=-0.240 (3 dims) | - - |
| 2026-03-06 22:16 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.24 (Mild negative) 0.00 | |
| 2026-03-06 21:29 | eval_success | PSQ evaluated: g-PSQ=-0.240 (3 dims) | - - |
| 2026-03-06 21:29 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.24 (Mild negative) 0.00 | |
| 2026-03-06 21:26 | eval_success | PSQ evaluated: g-PSQ=-0.240 (3 dims) | - - |
| 2026-03-06 21:26 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.24 (Mild negative) 0.00 | |
| 2026-03-06 21:24 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.24 (Mild negative) 0.00 | |
| 2026-03-06 20:48 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.24 (Mild negative) 0.00 | |
| 2026-03-06 20:45 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.24 (Mild negative) 0.00 | |
| 2026-03-06 20:08 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.24 (Mild negative) 0.00 | |
| 2026-03-06 20:07 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.24 (Mild negative) 0.00 | |
| 2026-03-06 19:33 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Editorial stance on US responsibility in Iran school strike, transparency indicators |
| 2026-03-06 19:32 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.24 (Mild negative) | |
| 2026-03-06 19:32 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.24 (Mild negative) | |
| 2026-03-06 19:28 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Editorial stance on US responsibility in Iran school strike, transparency indicators |
| 2026-03-06 19:22 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Editorial stance on US responsibility in Iran school strike, transparency indicators |
| 2026-03-06 19:17 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Editorial stance on US responsibility in Iran school strike, transparency indicators |
| 2026-03-06 19:12 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Editorial stance on US responsibility in Iran school strike, transparency indicators |
| 2026-03-06 19:07 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Editorial stance on US responsibility in Iran school strike, transparency indicators |
| 2026-03-06 19:02 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) | |
| reasoning Editorial stance on US responsibility in Iran school strike, transparency indicators |
| 2026-03-06 19:01 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) | |
| reasoning Investigative journalism on human rights abuse |