| 2026-03-09 06:19 | eval_success | Lite evaluated: Moderate negative (-0.40) | - - |
| 2026-03-09 06:19 | model_divergence | Cross-model spread 0.40 exceeds threshold (2 models) | - - |
| 2026-03-09 06:19 |
eval
|
Evaluated by llama-4-scout-wai: -0.40 (Moderate negative) 0.00 | |
| reasoning Science news article on NASA's DART spacecraft, no explicit rights discussion |
| 2026-03-09 06:19 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-09 06:17 | eval_success | PSQ evaluated: g-PSQ=0.198 (3 dims) | - - |
| 2026-03-09 06:17 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.20 (Mild positive) 0.00 | |
| 2026-03-09 06:16 | eval_success | PSQ evaluated: g-PSQ=0.280 (3 dims) | - - |
| 2026-03-09 06:16 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-09 06:14 | eval_success | Lite evaluated: Moderate negative (-0.40) | - - |
| 2026-03-09 06:14 | model_divergence | Cross-model spread 0.40 exceeds threshold (2 models) | - - |
| 2026-03-09 06:14 |
eval
|
Evaluated by llama-4-scout-wai: -0.40 (Moderate negative) 0.00 | |
| reasoning Science news article on NASA's DART spacecraft, no explicit rights discussion |
| 2026-03-09 06:14 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-09 06:12 | eval_success | PSQ evaluated: g-PSQ=0.198 (3 dims) | - - |
| 2026-03-09 06:12 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.20 (Mild positive) 0.00 | |
| 2026-03-09 06:11 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-03-09 06:11 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Science news article, no rights discussion |
| 2026-03-09 06:11 | rater_validation_warn | Lite validation warnings for model llama-3.3-70b-wai: 1W 0R | - - |
| 2026-03-09 05:08 | eval_success | PSQ evaluated: g-PSQ=0.280 (3 dims) | - - |
| 2026-03-09 05:08 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-09 05:07 | eval_success | Lite evaluated: Moderate negative (-0.40) | - - |
| 2026-03-09 05:07 |
eval
|
Evaluated by llama-4-scout-wai: -0.40 (Moderate negative) 0.00 | |
| reasoning Science news article on NASA's DART spacecraft, no explicit rights discussion |
| 2026-03-09 05:07 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-09 05:05 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-03-09 05:05 | rater_validation_warn | Lite validation warnings for model llama-3.3-70b-wai: 1W 0R | - - |
| 2026-03-09 05:05 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Science news article, no rights discussion |
| 2026-03-09 05:04 | eval_success | PSQ evaluated: g-PSQ=0.198 (3 dims) | - - |
| 2026-03-09 05:04 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.20 (Mild positive) 0.00 | |
| 2026-03-09 05:00 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-03-09 05:00 | rater_validation_warn | Lite validation warnings for model llama-3.3-70b-wai: 1W 0R | - - |
| 2026-03-09 05:00 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Science news article, no rights discussion |
| 2026-03-09 04:01 | eval_success | Lite evaluated: Moderate negative (-0.40) | - - |
| 2026-03-09 04:01 |
eval
|
Evaluated by llama-4-scout-wai: -0.40 (Moderate negative) 0.00 | |
| reasoning Science news article on NASA's DART spacecraft, no explicit rights discussion |
| 2026-03-09 04:01 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-09 03:57 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.20 (Mild positive) 0.00 | |
| 2026-03-09 03:56 |
eval
|
Evaluated by llama-4-scout-wai: -0.40 (Moderate negative) 0.00 | |
| reasoning Science news article on NASA's DART spacecraft, no explicit rights discussion |
| 2026-03-09 03:54 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Science news article, no rights discussion |
| 2026-03-09 02:50 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) | |
| 2026-03-09 02:50 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.20 (Mild positive) | |
| 2026-03-09 02:48 |
eval
|
Evaluated by llama-4-scout-wai: -0.40 (Moderate negative) | |
| reasoning Science news article on NASA's DART spacecraft, no explicit rights discussion |
| 2026-03-09 02:47 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) | |
| reasoning Science news article, no rights discussion |