| 2026-03-16 00:19 | eval_success | Evaluated: Neutral (0.00) | - - |
| 2026-03-16 00:19 |
eval
|
Evaluated by claude-haiku-4-5-20251001: 0.00 (Neutral) 16,962 tokens -0.05 | |
| 2026-03-16 00:19 | rater_validation_warn | Validation warnings for model claude-haiku-4-5-20251001: 31W 31R | - - |
| 2026-03-16 00:16 | eval_success | Evaluated: Neutral (0.05) | - - |
| 2026-03-16 00:16 | model_divergence | Cross-model spread 0.29 exceeds threshold (2 models) | - - |
| 2026-03-16 00:16 |
eval
|
Evaluated by claude-haiku-4-5-20251001: +0.05 (Neutral) 17,512 tokens | |
| 2026-03-16 00:16 | rater_validation_warn | Validation warnings for model claude-haiku-4-5-20251001: 0W 1R | - - |
| 2026-03-06 17:56 | eval_success | PSQ evaluated: g-PSQ=-0.060 (3 dims) | - - |
| 2026-03-06 17:56 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.06 (Neutral) 0.00 | |
| 2026-03-06 17:44 | eval_success | PSQ evaluated: g-PSQ=-0.080 (3 dims) | - - |
| 2026-03-06 17:44 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.08 (Neutral) 0.00 | |
| 2026-03-06 04:26 | eval_success | PSQ evaluated: g-PSQ=-0.060 (3 dims) | - - |
| 2026-03-06 04:26 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.06 (Neutral) | |
| 2026-03-06 04:26 | eval_success | PSQ evaluated: g-PSQ=-0.080 (3 dims) | - - |
| 2026-03-06 04:26 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.08 (Neutral) | |
| 2026-03-05 20:13 | eval_success | Lite evaluated: Mild negative (-0.24) | - - |
| 2026-03-05 20:13 |
eval
|
Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Blog post with no explicit rights discussion, low transparency |
| 2026-03-05 20:08 | eval_success | Lite evaluated: Mild negative (-0.24) | - - |
| 2026-03-05 20:08 |
eval
|
Evaluated by llama-4-scout-wai: -0.24 (Mild negative) | |
| reasoning Blog post with no explicit rights discussion, low transparency |
| 2026-03-05 20:06 | eval_success | Lite evaluated: Neutral (-0.08) | - - |
| 2026-03-05 20:06 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.08 (Neutral) 0.00 | |
| reasoning Empty blog post |
| 2026-03-05 20:06 | rater_validation_warn | Lite validation warnings for model llama-3.3-70b-wai: 1W 0R | - - |
| 2026-03-05 19:39 | eval_success | Lite evaluated: Neutral (-0.08) | - - |
| 2026-03-05 19:39 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.08 (Neutral) 0.00 | |
| reasoning Empty blog post |
| 2026-03-05 19:39 | rater_validation_warn | Lite validation warnings for model llama-3.3-70b-wai: 1W 0R | - - |
| 2026-03-05 19:34 | eval_success | Lite evaluated: Neutral (-0.08) | - - |
| 2026-03-05 19:34 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.08 (Neutral) 0.00 | |
| reasoning Empty blog post |
| 2026-03-05 19:34 | rater_validation_warn | Lite validation warnings for model llama-3.3-70b-wai: 1W 0R | - - |
| 2026-03-05 19:28 | eval_success | Lite evaluated: Neutral (-0.08) | - - |
| 2026-03-05 19:28 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.08 (Neutral) 0.00 | |
| reasoning Empty blog post |
| 2026-03-05 19:28 | rater_validation_warn | Lite validation warnings for model llama-3.3-70b-wai: 1W 0R | - - |
| 2026-03-05 19:22 | eval_success | Lite evaluated: Neutral (-0.08) | - - |
| 2026-03-05 19:22 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.08 (Neutral) | |
| reasoning Empty blog post |