| 2026-03-01 19:57 | eval_success | Evaluated: Neutral (0.01) | - - |
| 2026-03-01 19:57 |
eval
|
Evaluated by deepseek-v3.2: +0.01 (Neutral) 14,954 tokens -0.02 | |
| 2026-03-01 17:12 | eval_success | Evaluated: Neutral (0.03) | - - |
| 2026-03-01 17:12 |
eval
|
Evaluated by deepseek-v3.2: +0.03 (Neutral) 15,380 tokens +0.14 | |
| 2026-03-01 15:12 | eval_success | Evaluated: Mild negative (-0.11) | - - |
| 2026-03-01 15:12 |
eval
|
Evaluated by deepseek-v3.2: -0.11 (Mild negative) 15,858 tokens -0.31 | |
| 2026-03-01 02:09 | eval_success | Evaluated: Mild positive (0.19) | - - |
| 2026-03-01 02:09 |
eval
|
Evaluated by deepseek-v3.2: +0.19 (Mild positive) 15,362 tokens -0.05 | |
| 2026-03-01 02:09 | rater_validation_warn | Validation warnings for model deepseek-v3.2: 0W 1R | - - |
| 2026-02-28 22:23 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-02-28 22:23 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Editorial on creative arts |
| 2026-02-28 22:08 | eval_success | Lite evaluated: Mild positive (0.16) | - - |
| 2026-02-28 22:08 |
eval
|
Evaluated by llama-4-scout-wai: +0.16 (Mild positive) 0.00 | |
| reasoning ED Slightly negative lean on creative arts success |
| 2026-02-28 21:35 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-02-28 21:35 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Editorial on creative arts |
| 2026-02-28 21:17 | eval_success | Lite evaluated: Mild positive (0.16) | - - |
| 2026-02-28 21:17 |
eval
|
Evaluated by llama-4-scout-wai: +0.16 (Mild positive) 0.00 | |
| reasoning ED Slightly negative lean on creative arts success |
| 2026-02-28 21:11 | eval_success | Lite evaluated: Mild positive (0.16) | - - |
| 2026-02-28 21:11 |
eval
|
Evaluated by llama-4-scout-wai: +0.16 (Mild positive) 0.00 | |
| reasoning ED Slightly negative lean on creative arts success |
| 2026-02-28 20:50 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-02-28 20:50 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Editorial on creative arts |
| 2026-02-28 20:22 | eval_success | Lite evaluated: Mild positive (0.16) | - - |
| 2026-02-28 20:22 |
eval
|
Evaluated by llama-4-scout-wai: +0.16 (Mild positive) 0.00 | |
| reasoning ED Slightly negative lean on creative arts success |
| 2026-02-28 20:03 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-02-28 20:03 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) -0.10 | |
| reasoning Editorial on creative arts |
| 2026-02-28 19:30 | eval_success | Lite evaluated: Mild positive (0.16) | - - |
| 2026-02-28 19:30 |
eval
|
Evaluated by llama-4-scout-wai: +0.16 (Mild positive) 0.00 | |
| reasoning ED Slightly negative lean on creative arts success |
| 2026-02-28 19:14 | eval_success | Lite evaluated: Mild positive (0.10) | - - |
| 2026-02-28 19:14 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.10 (Mild positive) +0.10 | |
| reasoning Editorial on creative arts |
| 2026-02-28 18:41 | eval_success | Lite evaluated: Mild positive (0.16) | - - |
| 2026-02-28 18:41 |
eval
|
Evaluated by llama-4-scout-wai: +0.16 (Mild positive) 0.00 | |
| reasoning ED Slightly negative lean on creative arts success |
| 2026-02-28 18:36 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-02-28 18:36 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Editorial on creative arts |
| 2026-02-28 18:14 | eval_success | Lite evaluated: Mild positive (0.16) | - - |
| 2026-02-28 18:14 |
eval
|
Evaluated by llama-4-scout-wai: +0.16 (Mild positive) 0.00 | |
| reasoning ED Slightly negative lean on creative arts success |
| 2026-02-28 18:10 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-02-28 18:10 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Editorial on creative arts |
| 2026-02-28 18:10 | eval_success | Lite evaluated: Mild positive (0.16) | - - |
| 2026-02-28 18:10 |
eval
|
Evaluated by llama-4-scout-wai: +0.16 (Mild positive) 0.00 | |
| reasoning ED Slightly negative lean on creative arts success |
| 2026-02-28 18:05 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Editorial on creative arts |
| 2026-02-28 17:42 |
eval
|
Evaluated by llama-4-scout-wai: +0.16 (Mild positive) 0.00 | |
| reasoning ED Slightly negative lean on creative arts success |
| 2026-02-28 17:42 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Editorial on creative arts |
| 2026-02-28 17:16 |
eval
|
Evaluated by llama-4-scout-wai: +0.16 (Mild positive) 0.00 | |
| reasoning ED Slightly negative lean on creative arts success |
| 2026-02-28 17:15 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Editorial on creative arts |
| 2026-02-28 16:50 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Editorial on creative arts |
| 2026-02-28 16:49 |
eval
|
Evaluated by llama-4-scout-wai: +0.16 (Mild positive) 0.00 | |
| reasoning ED Slightly negative lean on creative arts success |
| 2026-02-28 16:19 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Editorial on creative arts |
| 2026-02-28 16:14 |
eval
|
Evaluated by llama-4-scout-wai: +0.16 (Mild positive) 0.00 | |
| reasoning ED Slightly negative lean on creative arts success |
| 2026-02-28 16:14 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Editorial on creative arts |
| 2026-02-28 13:31 |
eval
|
Evaluated by llama-4-scout-wai: +0.16 (Mild positive) 0.00 | |
| reasoning ED Slightly negative lean on creative arts success |
| 2026-02-28 13:31 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Editorial on creative arts |
| 2026-02-28 12:31 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) -0.10 | |
| reasoning Editorial on creative arts |
| 2026-02-28 09:21 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.10 (Mild positive) +0.10 | |
| reasoning Editorial on creative arts |
| 2026-02-28 06:49 |
eval
|
Evaluated by llama-4-scout-wai: +0.16 (Mild positive) 0.00 | |
| reasoning ED Slightly negative lean on creative arts success |
| 2026-02-28 06:14 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) -0.10 | |
| reasoning Editorial on creative arts |
| 2026-02-28 05:54 |
eval
|
Evaluated by llama-4-scout-wai: +0.16 (Mild positive) 0.00 | |
| reasoning ED Slightly negative lean on creative arts success |
| 2026-02-28 05:49 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.10 (Mild positive) +0.10 | |
| reasoning Editorial on creative arts |
| 2026-02-28 05:49 |
eval
|
Evaluated by llama-4-scout-wai: +0.16 (Mild positive) 0.00 | |
| reasoning ED Slightly negative lean on creative arts success |
| 2026-02-28 05:45 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Editorial on creative arts |
| 2026-02-28 05:41 |
eval
|
Evaluated by llama-4-scout-wai: +0.16 (Mild positive) +0.16 | |
| reasoning ED Slightly negative lean on creative arts success |
| 2026-02-28 04:40 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning ED Slightly negative lean on creative arts success |
| 2026-02-28 04:21 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) -0.20 | |
| reasoning Editorial on creative arts |
| 2026-02-28 03:51 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.20 (Mild positive) 0.00 | |
| reasoning Editorial on creative arts |
| 2026-02-28 03:11 |
eval
|
Evaluated by deepseek-v3.2: +0.24 (Mild positive) 14,630 tokens | |
| 2026-02-28 03:07 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning ED Slightly negative lean on creative arts success |
| 2026-02-28 02:53 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning ED Slightly negative lean on creative arts success |
| 2026-02-28 02:52 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning ED Slightly negative lean on creative arts success |
| 2026-02-28 02:39 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.20 (Mild positive) 0.00 | |
| reasoning Editorial on creative arts |
| 2026-02-28 02:34 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.20 (Mild positive) +0.20 | |
| reasoning Editorial on creative arts |
| 2026-02-28 02:24 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning ED Slightly negative lean on creative arts success |
| 2026-02-28 02:00 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning ED Slightly negative lean on creative arts success |
| 2026-02-28 01:07 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) | |
| reasoning ED Slightly negative lean on creative arts success |
| 2026-02-28 00:54 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) | |
| reasoning Editorial on creative arts |