| 2026-03-05 17:41 | eval_success | PSQ evaluated: g-PSQ=-0.559 (3 dims) | - - |
| 2026-03-05 17:41 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.56 (Moderate negative) 0.00 | |
| 2026-03-05 17:36 | eval_success | PSQ evaluated: g-PSQ=-0.559 (3 dims) | - - |
| 2026-03-05 17:36 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.56 (Moderate negative) | |
| 2026-03-05 17:31 | eval_success | PSQ evaluated: g-PSQ=0.026 (3 dims) | - - |
| 2026-03-05 17:31 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.03 (Neutral) 0.00 | |
| 2026-03-05 17:26 | eval_success | PSQ evaluated: g-PSQ=0.026 (3 dims) | - - |
| 2026-03-05 17:26 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.03 (Neutral) | |
| 2026-03-04 16:55 | model_divergence | Cross-model spread 0.28 exceeds threshold (2 models) | - - |
| 2026-03-04 16:55 | eval_success | Lite evaluated: Moderate negative (-0.42) | - - |
| 2026-03-04 16:55 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.42 (Moderate negative) 0.00 | |
| reasoning Policy bill with no rights discussion |
| 2026-03-04 16:55 | rater_validation_warn | Lite validation warnings for model llama-3.3-70b-wai: 1W 0R | - - |
| 2026-03-04 16:53 | eval_success | Lite evaluated: Strong negative (-0.70) | - - |
| 2026-03-04 16:53 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-04 16:53 | model_divergence | Cross-model spread 0.28 exceeds threshold (2 models) | - - |
| 2026-03-04 16:53 |
eval
|
Evaluated by llama-4-scout-wai: -0.70 (Strong negative) | |
| reasoning Legislative bill page, no explicit human rights discussion |
| 2026-03-04 16:50 | eval_success | Lite evaluated: Moderate negative (-0.42) | - - |
| 2026-03-04 16:50 | rater_validation_warn | Lite validation warnings for model llama-3.3-70b-wai: 1W 0R | - - |
| 2026-03-04 16:50 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.42 (Moderate negative) | |
| reasoning Policy bill with no rights discussion |