| 2026-03-01 08:06 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-03-01 08:06 | model_divergence | Cross-model spread 0.34 exceeds threshold (3 models) | - - |
| 2026-03-01 08:06 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning ED, neutral tech article, no rights stance |
| 2026-03-01 08:05 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-03-01 08:05 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning tech report with no rights stance |
| 2026-03-01 08:05 | model_divergence | Cross-model spread 0.34 exceeds threshold (2 models) | - - |
| 2026-03-01 07:51 | eval_success | Evaluated: Moderate positive (0.36) | - - |
| 2026-03-01 07:51 | rater_validation_warn | Validation warnings for model deepseek-v3.2: 0W 1R | - - |
| 2026-03-01 07:51 |
eval
|
Evaluated by deepseek-v3.2: +0.36 (Moderate positive) 14,021 tokens +0.29 | |
| 2026-03-01 07:50 | eval_success | Evaluated: Neutral (0.07) | - - |
| 2026-03-01 07:50 |
eval
|
Evaluated by deepseek-v3.2: +0.07 (Neutral) 14,883 tokens -0.26 | |
| 2026-03-01 07:11 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-03-01 07:11 | model_divergence | Cross-model spread 0.34 exceeds threshold (3 models) | - - |
| 2026-03-01 07:11 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning ED, neutral tech article, no rights stance |
| 2026-03-01 07:10 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-03-01 07:10 | model_divergence | Cross-model spread 0.34 exceeds threshold (2 models) | - - |
| 2026-03-01 07:10 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning tech report with no rights stance |
| 2026-03-01 06:26 | model_divergence | Cross-model spread 0.34 exceeds threshold (3 models) | - - |
| 2026-03-01 06:26 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-03-01 06:26 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning tech report with no rights stance |
| 2026-03-01 06:26 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-03-01 06:26 | model_divergence | Cross-model spread 0.34 exceeds threshold (2 models) | - - |
| 2026-03-01 06:26 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning ED, neutral tech article, no rights stance |
| 2026-03-01 05:45 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-03-01 05:45 | model_divergence | Cross-model spread 0.34 exceeds threshold (2 models) | - - |
| 2026-03-01 05:45 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning tech report with no rights stance |
| 2026-03-01 05:39 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-03-01 05:39 | model_divergence | Cross-model spread 0.34 exceeds threshold (2 models) | - - |
| 2026-03-01 05:39 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning ED, neutral tech article, no rights stance |
| 2026-03-01 05:33 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-03-01 05:33 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning ED, neutral tech article, no rights stance |
| 2026-03-01 05:06 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning tech report with no rights stance |
| 2026-03-01 05:01 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning ED, neutral tech article, no rights stance |
| 2026-03-01 04:55 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning ED, neutral tech article, no rights stance |
| 2026-03-01 04:20 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning tech report with no rights stance |
| 2026-03-01 04:06 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning ED, neutral tech article, no rights stance |
| 2026-03-01 04:01 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning ED, neutral tech article, no rights stance |
| 2026-03-01 03:32 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning tech report with no rights stance |
| 2026-03-01 03:11 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning ED, neutral tech article, no rights stance |
| 2026-03-01 03:02 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning tech report with no rights stance |
| 2026-03-01 02:35 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning ED, neutral tech article, no rights stance |
| 2026-03-01 02:23 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning tech report with no rights stance |
| 2026-03-01 01:44 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning ED, neutral tech article, no rights stance |
| 2026-03-01 01:37 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning tech report with no rights stance |
| 2026-03-01 01:05 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning ED, neutral tech article, no rights stance |
| 2026-03-01 01:00 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning tech report with no rights stance |
| 2026-03-01 00:12 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning ED, neutral tech article, no rights stance |
| 2026-03-01 00:10 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning tech report with no rights stance |
| 2026-02-28 23:27 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning ED, neutral tech article, no rights stance |
| 2026-02-28 23:26 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning tech report with no rights stance |
| 2026-02-28 22:31 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning ED, neutral tech article, no rights stance |
| 2026-02-28 22:31 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning tech report with no rights stance |
| 2026-02-28 16:27 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning tech report with no rights stance |
| 2026-02-28 16:22 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning tech report with no rights stance |
| 2026-02-28 15:15 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) -0.10 | |
| reasoning ED, neutral tech article, no rights stance |
| 2026-02-28 14:17 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning tech report with no rights stance |
| 2026-02-28 14:12 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) | |
| reasoning tech report with no rights stance |
| 2026-02-26 23:06 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) | |
| reasoning ED, neutral tech article, no rights stance |
| 2026-02-26 10:35 |
eval
|
Evaluated by deepseek-v3.2: +0.32 (Neutral) 13,825 tokens | |
| 2026-02-26 04:19 |
eval
|
Evaluated by claude-haiku-4-5-20251001: +0.34 (Neutral) 16,151 tokens +0.05 | |
| 2026-02-26 03:17 |
eval
|
Evaluated by claude-haiku-4-5-20251001: +0.29 (Mild positive) 17,979 tokens | |