| 2026-03-01 23:34 | eval_success | Evaluated: Mild positive (0.18) | - - |
| 2026-03-01 23:34 |
eval
|
Evaluated by deepseek-v3.2: +0.18 (Mild positive) 10,932 tokens +0.07 | |
| 2026-03-01 19:27 | eval_success | Evaluated: Mild positive (0.11) | - - |
| 2026-03-01 19:27 |
eval
|
Evaluated by deepseek-v3.2: +0.11 (Mild positive) 10,299 tokens +0.08 | |
| 2026-03-01 08:55 | eval_success | Evaluated: Neutral (0.03) | - - |
| 2026-03-01 08:55 |
eval
|
Evaluated by deepseek-v3.2: +0.03 (Neutral) 11,046 tokens +0.00 | |
| 2026-03-01 08:36 | eval_success | Evaluated: Neutral (0.03) | - - |
| 2026-03-01 08:36 |
eval
|
Evaluated by deepseek-v3.2: +0.03 (Neutral) 10,799 tokens -0.13 | |
| 2026-03-01 06:59 | eval_success | Evaluated: Mild positive (0.15) | - - |
| 2026-03-01 06:59 |
eval
|
Evaluated by deepseek-v3.2: +0.15 (Mild positive) 10,855 tokens +0.14 | |
| 2026-03-01 06:59 | rater_validation_warn | Validation warnings for model deepseek-v3.2: 0W 2R | - - |
| 2026-03-01 06:52 | eval_success | Lite evaluated: Mild positive (0.10) | - - |
| 2026-03-01 06:52 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) 0.00 | |
| reasoning Editorial stance on MitID service status update |
| 2026-03-01 06:43 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-03-01 06:43 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Neutral tech report |
| 2026-03-01 06:38 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-03-01 06:38 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Neutral tech report |
| 2026-03-01 06:34 | eval_success | Evaluated: Neutral (0.01) | - - |
| 2026-03-01 06:34 |
eval
|
Evaluated by deepseek-v3.2: +0.01 (Neutral) 11,116 tokens | |
| 2026-03-01 06:06 | eval_success | Lite evaluated: Mild positive (0.10) | - - |
| 2026-03-01 06:06 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) 0.00 | |
| reasoning Editorial stance on MitID service status update |
| 2026-03-01 05:56 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-03-01 05:56 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Neutral tech report |
| 2026-03-01 05:23 | eval_success | Lite evaluated: Mild positive (0.10) | - - |
| 2026-03-01 05:23 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) 0.00 | |
| reasoning Editorial stance on MitID service status update |
| 2026-03-01 05:18 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-03-01 05:18 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Neutral tech report |
| 2026-03-01 04:34 | eval_success | Lite evaluated: Mild positive (0.10) | - - |
| 2026-03-01 04:34 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) 0.00 | |
| reasoning Editorial stance on MitID service status update |
| 2026-03-01 04:28 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-03-01 04:28 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Neutral tech report |
| 2026-03-01 04:24 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-03-01 04:24 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Neutral tech report |
| 2026-03-01 03:54 | eval_success | Lite evaluated: Mild positive (0.10) | - - |
| 2026-03-01 03:54 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) 0.00 | |
| reasoning Editorial stance on MitID service status update |
| 2026-03-01 03:37 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-03-01 03:37 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Neutral tech report |
| 2026-03-01 03:32 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-03-01 03:32 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Neutral tech report |
| 2026-03-01 03:10 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) 0.00 | |
| reasoning Editorial stance on MitID service status update |
| 2026-03-01 03:04 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) 0.00 | |
| reasoning Editorial stance on MitID service status update |
| 2026-03-01 03:01 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Neutral tech report |
| 2026-03-01 02:56 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Neutral tech report |
| 2026-03-01 02:20 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) 0.00 | |
| reasoning Editorial stance on MitID service status update |
| 2026-03-01 02:15 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) 0.00 | |
| reasoning Editorial stance on MitID service status update |
| 2026-03-01 02:12 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Neutral tech report |
| 2026-03-01 01:30 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) 0.00 | |
| reasoning Editorial stance on MitID service status update |
| 2026-03-01 01:29 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Neutral tech report |
| 2026-03-01 00:45 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Neutral tech report |
| 2026-03-01 00:45 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) 0.00 | |
| reasoning Editorial stance on MitID service status update |
| 2026-03-01 00:39 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) 0.00 | |
| reasoning Editorial stance on MitID service status update |
| 2026-03-01 00:04 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Neutral tech report |
| 2026-02-28 23:50 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) 0.00 | |
| reasoning Editorial stance on MitID service status update |
| 2026-02-28 23:10 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Neutral tech report |
| 2026-02-28 23:05 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) 0.00 | |
| reasoning Editorial stance on MitID service status update |
| 2026-02-28 22:17 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Neutral tech report |
| 2026-02-28 22:08 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) 0.00 | |
| reasoning Editorial stance on MitID service status update |
| 2026-02-28 21:35 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Neutral tech report |
| 2026-02-28 21:30 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Neutral tech report |
| 2026-02-28 21:23 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) 0.00 | |
| reasoning Editorial stance on MitID service status update |
| 2026-02-28 20:46 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Neutral tech report |
| 2026-02-28 20:31 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) 0.00 | |
| reasoning Editorial stance on MitID service status update |
| 2026-02-28 19:57 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Neutral tech report |
| 2026-02-28 19:45 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) 0.00 | |
| reasoning Editorial stance on MitID service status update |
| 2026-02-28 19:40 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) 0.00 | |
| reasoning Editorial stance on MitID service status update |
| 2026-02-28 19:10 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Neutral tech report |
| 2026-02-28 18:57 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) 0.00 | |
| reasoning Editorial stance on MitID service status update |
| 2026-02-28 18:36 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Neutral tech report |
| 2026-02-28 18:25 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) 0.00 | |
| reasoning Editorial stance on MitID service status update |
| 2026-02-28 18:10 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Neutral tech report |
| 2026-02-28 17:59 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) 0.00 | |
| reasoning Editorial stance on MitID service status update |
| 2026-02-28 17:45 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Neutral tech report |
| 2026-02-28 17:35 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) 0.00 | |
| reasoning Editorial stance on MitID service status update |
| 2026-02-28 17:21 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Neutral tech report |
| 2026-02-28 17:07 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) 0.00 | |
| reasoning Editorial stance on MitID service status update |
| 2026-02-28 16:54 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Neutral tech report |
| 2026-02-28 16:40 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) 0.00 | |
| reasoning Editorial stance on MitID service status update |
| 2026-02-28 16:35 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) 0.00 | |
| reasoning Editorial stance on MitID service status update |
| 2026-02-28 16:29 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Neutral tech report |
| 2026-02-28 15:37 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) 0.00 | |
| reasoning Editorial stance on MitID service status update |
| 2026-02-28 15:24 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Neutral tech report |
| 2026-02-28 13:23 |
eval
|
Evaluated by claude-haiku-4-5-20251001: +0.19 (Mild positive) +0.02 | |
| 2026-02-28 12:13 |
eval
|
Evaluated by claude-haiku-4-5-20251001: +0.18 (Mild positive) +0.05 | |
| 2026-02-28 11:03 |
eval
|
Evaluated by claude-haiku-4-5-20251001: +0.13 (Mild positive) | |
| 2026-02-28 08:58 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) 0.00 | |
| reasoning Editorial stance on MitID service status update |
| 2026-02-28 08:53 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) +0.10 | |
| reasoning Editorial stance on MitID service status update |
| 2026-02-28 08:48 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Neutral tech report |
| 2026-02-28 07:39 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Neutral tech report |
| 2026-02-28 06:25 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Neutral tech report |
| 2026-02-28 04:44 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Neutral tech report |
| 2026-02-28 03:26 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Editorial stance on MitID service status update |
| 2026-02-28 02:14 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Editorial stance on MitID service status update |
| 2026-02-28 01:42 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Editorial stance on MitID service status update |
| 2026-02-28 01:40 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Neutral tech report |
| 2026-02-28 01:36 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Neutral tech report |
| 2026-02-28 01:10 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Neutral tech report |
| 2026-02-28 01:06 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Editorial stance on MitID service status update |
| 2026-02-28 01:02 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Editorial stance on MitID service status update |
| 2026-02-28 00:55 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) | |
| reasoning Neutral tech report |
| 2026-02-28 00:51 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Editorial stance on MitID service status update |
| 2026-02-28 00:45 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) | |
| reasoning Editorial stance on MitID service status update |