| 2026-03-05 06:10 | eval_success | PSQ evaluated: g-PSQ=0.440 (3 dims) | - - |
| 2026-03-05 06:10 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) | |
| 2026-03-05 06:04 | eval_success | PSQ evaluated: g-PSQ=-0.240 (3 dims) | - - |
| 2026-03-05 06:04 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.24 (Mild negative) | |
| 2026-03-04 17:56 | eval_success | Lite evaluated: Mild negative (-0.24) | - - |
| 2026-03-04 17:56 |
eval
|
Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00 | |
| reasoning News article on health risk, no explicit rights discussion |
| 2026-03-04 17:56 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-04 17:51 | eval_success | Lite evaluated: Mild negative (-0.24) | - - |
| 2026-03-04 17:51 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-04 17:51 |
eval
|
Evaluated by llama-4-scout-wai: -0.24 (Mild negative) -0.28 | |
| reasoning News article on health risk, no explicit rights discussion |
| 2026-03-04 17:32 | eval_success | Lite evaluated: Mild negative (-0.26) | - - |
| 2026-03-04 17:32 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.26 (Mild negative) -0.28 | |
| reasoning News article on health risk |
| 2026-03-04 06:54 | eval_success | Lite evaluated: Neutral (0.04) | - - |
| 2026-03-04 06:54 |
eval
|
Evaluated by llama-4-scout-wai: +0.04 (Neutral) 0.00 | |
| reasoning News article on health risk, no explicit rights discussion |
| 2026-03-04 06:54 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-04 06:39 | eval_success | Lite evaluated: Neutral (0.02) | - - |
| 2026-03-04 06:39 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.02 (Neutral) 0.00 | |
| reasoning News article on health risk |
| 2026-03-04 06:35 | eval_success | Lite evaluated: Neutral (0.02) | - - |
| 2026-03-04 06:35 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.02 (Neutral) 0.00 | |
| reasoning News article on health risk |
| 2026-03-03 19:06 | eval_success | Lite evaluated: Neutral (0.04) | - - |
| 2026-03-03 19:06 |
eval
|
Evaluated by llama-4-scout-wai: +0.04 (Neutral) -0.02 | |
| reasoning News article on health risk, no explicit rights discussion |
| 2026-03-03 19:06 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-03 19:01 | eval_success | Lite evaluated: Neutral (0.06) | - - |
| 2026-03-03 19:01 |
eval
|
Evaluated by llama-4-scout-wai: +0.06 (Neutral) 0.00 | |
| reasoning News article on health risk, no explicit rights discussion |
| 2026-03-03 19:01 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-03 18:59 | eval_success | Lite evaluated: Neutral (0.02) | - - |
| 2026-03-03 18:59 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.02 (Neutral) 0.00 | |
| reasoning News article on health risk |
| 2026-03-03 18:19 | eval_success | Lite evaluated: Neutral (0.02) | - - |
| 2026-03-03 18:18 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.02 (Neutral) 0.00 | |
| reasoning News article on health risk |
| 2026-03-03 18:18 | eval_success | Lite evaluated: Neutral (0.06) | - - |
| 2026-03-03 18:18 |
eval
|
Evaluated by llama-4-scout-wai: +0.06 (Neutral) 0.00 | |
| reasoning News article on health risk, no explicit rights discussion |
| 2026-03-03 18:18 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-03 18:14 | eval_success | Lite evaluated: Neutral (0.02) | - - |
| 2026-03-03 18:14 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.02 (Neutral) 0.00 | |
| reasoning News article on health risk |
| 2026-03-03 17:44 |
eval
|
Evaluated by llama-4-scout-wai: +0.06 (Neutral) 0.00 | |
| reasoning News article on health risk, no explicit rights discussion |
| 2026-03-03 17:41 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.02 (Neutral) 0.00 | |
| reasoning News article on health risk |
| 2026-03-03 17:04 |
eval
|
Evaluated by llama-4-scout-wai: +0.06 (Neutral) 0.00 | |
| reasoning News article on health risk, no explicit rights discussion |
| 2026-03-03 17:01 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.02 (Neutral) 0.00 | |
| reasoning News article on health risk |
| 2026-03-03 16:31 |
eval
|
Evaluated by llama-4-scout-wai: +0.06 (Neutral) 0.00 | |
| reasoning News article on health risk, no explicit rights discussion |
| 2026-03-03 16:26 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.02 (Neutral) 0.00 | |
| reasoning News article on health risk |
| 2026-03-03 15:45 |
eval
|
Evaluated by llama-4-scout-wai: +0.06 (Neutral) 0.00 | |
| reasoning News article on health risk, no explicit rights discussion |
| 2026-03-03 15:39 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.02 (Neutral) 0.00 | |
| reasoning News article on health risk |
| 2026-03-03 15:33 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.02 (Neutral) 0.00 | |
| reasoning News article on health risk |
| 2026-03-03 15:06 |
eval
|
Evaluated by llama-4-scout-wai: +0.06 (Neutral) 0.00 | |
| reasoning News article on health risk, no explicit rights discussion |
| 2026-03-03 14:59 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.02 (Neutral) 0.00 | |
| reasoning News article on health risk |
| 2026-03-03 14:25 |
eval
|
Evaluated by llama-4-scout-wai: +0.06 (Neutral) 0.00 | |
| reasoning News article on health risk, no explicit rights discussion |
| 2026-03-03 14:20 |
eval
|
Evaluated by llama-4-scout-wai: +0.06 (Neutral) +0.02 | |
| reasoning News article on health risk, no explicit rights discussion |
| 2026-03-03 14:14 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.02 (Neutral) 0.00 | |
| reasoning News article on health risk |
| 2026-03-03 13:39 |
eval
|
Evaluated by llama-4-scout-wai: +0.04 (Neutral) -0.02 | |
| reasoning News article on health risk, no explicit rights discussion |
| 2026-03-03 13:36 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.02 (Neutral) 0.00 | |
| reasoning News article on health risk |
| 2026-03-03 13:00 |
eval
|
Evaluated by llama-4-scout-wai: +0.06 (Neutral) 0.00 | |
| reasoning News article on health risk, no explicit rights discussion |
| 2026-03-03 12:55 |
eval
|
Evaluated by llama-4-scout-wai: +0.06 (Neutral) 0.00 | |
| reasoning News article on health risk, no explicit rights discussion |
| 2026-03-03 12:54 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.02 (Neutral) 0.00 | |
| reasoning News article on health risk |
| 2026-03-03 12:14 |
eval
|
Evaluated by llama-4-scout-wai: +0.06 (Neutral) 0.00 | |
| reasoning News article on health risk, no explicit rights discussion |
| 2026-03-03 12:14 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.02 (Neutral) 0.00 | |
| reasoning News article on health risk |
| 2026-03-03 11:30 |
eval
|
Evaluated by llama-4-scout-wai: +0.06 (Neutral) | |
| reasoning News article on health risk, no explicit rights discussion |
| 2026-03-03 11:30 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.02 (Neutral) | |
| reasoning News article on health risk |