| 2026-03-02 17:32 | eval_success | Lite evaluated: Mild negative (-0.20) | - - |
| 2026-03-02 17:32 | model_divergence | Cross-model spread 0.40 exceeds threshold (2 models) | - - |
| 2026-03-02 17:32 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.20 (Mild negative) 0.00 | |
| reasoning Editorial on hiring practices |
| 2026-03-02 16:36 | eval_success | Lite evaluated: Mild positive (0.20) | - - |
| 2026-03-02 16:36 |
eval
|
Evaluated by llama-4-scout-wai: +0.20 (Mild positive) +0.10 | |
| reasoning Editorial on hiring trends, implicit bias concerns |
| 2026-03-02 16:32 | eval_success | Lite evaluated: Mild positive (0.10) | - - |
| 2026-03-02 16:32 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) -0.10 | |
| reasoning Editorial on hiring trends, implicit bias concerns |
| 2026-03-02 16:11 | eval_success | Lite evaluated: Mild negative (-0.20) | - - |
| 2026-03-02 16:11 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.20 (Mild negative) 0.00 | |
| reasoning Editorial on hiring practices |
| 2026-03-02 11:04 | model_divergence | Cross-model spread 0.40 exceeds threshold (2 models) | - - |
| 2026-03-02 11:04 | eval_success | Lite evaluated: Mild positive (0.20) | - - |
| 2026-03-02 11:04 |
eval
|
Evaluated by llama-4-scout-wai: +0.20 (Mild positive) 0.00 | |
| reasoning Editorial on hiring trends, implicit bias concerns |
| 2026-03-02 10:58 | eval_success | Lite evaluated: Mild positive (0.20) | - - |
| 2026-03-02 10:58 | model_divergence | Cross-model spread 0.40 exceeds threshold (2 models) | - - |
| 2026-03-02 10:58 |
eval
|
Evaluated by llama-4-scout-wai: +0.20 (Mild positive) 0.00 | |
| reasoning Editorial on hiring trends, implicit bias concerns |
| 2026-03-02 10:52 | eval_success | Lite evaluated: Mild negative (-0.20) | - - |
| 2026-03-02 10:52 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.20 (Mild negative) 0.00 | |
| reasoning Editorial on hiring practices |
| 2026-03-02 10:08 | eval_success | Lite evaluated: Mild positive (0.20) | - - |
| 2026-03-02 10:08 | model_divergence | Cross-model spread 0.40 exceeds threshold (2 models) | - - |
| 2026-03-02 10:08 |
eval
|
Evaluated by llama-4-scout-wai: +0.20 (Mild positive) 0.00 | |
| reasoning Editorial on hiring trends, implicit bias concerns |
| 2026-03-02 10:08 | eval_success | Lite evaluated: Mild negative (-0.20) | - - |
| 2026-03-02 10:08 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.20 (Mild negative) 0.00 | |
| reasoning Editorial on hiring practices |
| 2026-03-02 09:25 | eval_success | Lite evaluated: Mild negative (-0.20) | - - |
| 2026-03-02 09:25 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.20 (Mild negative) 0.00 | |
| reasoning Editorial on hiring practices |
| 2026-03-02 09:25 | model_divergence | Cross-model spread 0.40 exceeds threshold (2 models) | - - |
| 2026-03-02 09:24 | eval_success | Lite evaluated: Mild positive (0.20) | - - |
| 2026-03-02 09:24 |
eval
|
Evaluated by llama-4-scout-wai: +0.20 (Mild positive) +0.10 | |
| reasoning Editorial on hiring trends, implicit bias concerns |
| 2026-03-02 08:33 | eval_success | Lite evaluated: Mild negative (-0.20) | - - |
| 2026-03-02 08:33 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.20 (Mild negative) 0.00 | |
| reasoning Editorial on hiring practices |
| 2026-03-02 08:30 | eval_success | Lite evaluated: Mild positive (0.10) | - - |
| 2026-03-02 08:30 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) -0.10 | |
| reasoning Editorial on hiring trends, implicit bias concerns |
| 2026-03-02 07:55 | eval_success | Lite evaluated: Mild negative (-0.20) | - - |
| 2026-03-02 07:55 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.20 (Mild negative) 0.00 | |
| reasoning Editorial on hiring practices |
| 2026-03-02 07:48 | eval_success | Lite evaluated: Mild positive (0.20) | - - |
| 2026-03-02 07:48 |
eval
|
Evaluated by llama-4-scout-wai: +0.20 (Mild positive) 0.00 | |
| reasoning Editorial on hiring trends, implicit bias concerns |
| 2026-03-02 07:09 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.20 (Mild negative) 0.00 | |
| reasoning Editorial on hiring practices |
| 2026-03-02 07:05 |
eval
|
Evaluated by llama-4-scout-wai: +0.20 (Mild positive) 0.00 | |
| reasoning Editorial on hiring trends, implicit bias concerns |
| 2026-03-02 06:27 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.20 (Mild negative) 0.00 | |
| reasoning Editorial on hiring practices |
| 2026-03-02 06:22 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.20 (Mild negative) 0.00 | |
| reasoning Editorial on hiring practices |
| 2026-03-02 06:20 |
eval
|
Evaluated by llama-4-scout-wai: +0.20 (Mild positive) +0.10 | |
| reasoning Editorial on hiring trends, implicit bias concerns |
| 2026-03-02 06:15 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) -0.10 | |
| reasoning Editorial on hiring trends, implicit bias concerns |
| 2026-03-02 05:50 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.20 (Mild negative) 0.00 | |
| reasoning Editorial on hiring practices |
| 2026-03-02 05:42 |
eval
|
Evaluated by llama-4-scout-wai: +0.20 (Mild positive) 0.00 | |
| reasoning Editorial on hiring trends, implicit bias concerns |
| 2026-03-02 05:07 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.20 (Mild negative) 0.00 | |
| reasoning Editorial on hiring practices |
| 2026-03-02 05:02 |
eval
|
Evaluated by llama-4-scout-wai: +0.20 (Mild positive) +0.10 | |
| reasoning Editorial on hiring trends, implicit bias concerns |
| 2026-03-02 04:33 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.20 (Mild negative) 0.00 | |
| reasoning Editorial on hiring practices |
| 2026-03-02 04:19 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) 0.00 | |
| reasoning Editorial on hiring trends, implicit bias concerns |
| 2026-03-02 03:50 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.20 (Mild negative) 0.00 | |
| reasoning Editorial on hiring practices |
| 2026-03-02 03:34 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) 0.00 | |
| reasoning Editorial on hiring trends, implicit bias concerns |
| 2026-03-02 03:27 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) 0.00 | |
| reasoning Editorial on hiring trends, implicit bias concerns |
| 2026-03-02 03:06 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.20 (Mild negative) 0.00 | |
| reasoning Editorial on hiring practices |
| 2026-03-02 02:47 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) -0.10 | |
| reasoning Editorial on hiring trends, implicit bias concerns |
| 2026-03-02 02:28 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.20 (Mild negative) 0.00 | |
| reasoning Editorial on hiring practices |
| 2026-03-02 02:07 |
eval
|
Evaluated by llama-4-scout-wai: +0.20 (Mild positive) 0.00 | |
| reasoning Editorial on hiring trends, implicit bias concerns |
| 2026-03-02 01:40 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.20 (Mild negative) 0.00 | |
| reasoning Editorial on hiring practices |
| 2026-03-02 01:35 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.20 (Mild negative) 0.00 | |
| reasoning Editorial on hiring practices |
| 2026-03-02 01:13 |
eval
|
Evaluated by llama-4-scout-wai: +0.20 (Mild positive) 0.00 | |
| reasoning Editorial on hiring trends, implicit bias concerns |
| 2026-03-02 01:08 |
eval
|
Evaluated by llama-4-scout-wai: +0.20 (Mild positive) 0.00 | |
| reasoning Editorial on hiring trends, implicit bias concerns |
| 2026-03-02 00:55 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.20 (Mild negative) 0.00 | |
| reasoning Editorial on hiring practices |
| 2026-03-02 00:27 |
eval
|
Evaluated by llama-4-scout-wai: +0.20 (Mild positive) 0.00 | |
| reasoning Editorial on hiring trends, implicit bias concerns |
| 2026-03-02 00:10 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.20 (Mild negative) 0.00 | |
| reasoning Editorial on hiring practices |
| 2026-03-01 23:32 |
eval
|
Evaluated by llama-4-scout-wai: +0.20 (Mild positive) +0.10 | |
| reasoning Editorial on hiring trends, implicit bias concerns |
| 2026-03-01 23:19 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.20 (Mild negative) 0.00 | |
| reasoning Editorial on hiring practices |
| 2026-03-01 22:41 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) 0.00 | |
| reasoning Editorial on hiring trends, implicit bias concerns |
| 2026-03-01 22:36 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) 0.00 | |
| reasoning Editorial on hiring trends, implicit bias concerns |
| 2026-03-01 22:29 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.20 (Mild negative) 0.00 | |
| reasoning Editorial on hiring practices |
| 2026-03-01 21:59 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) 0.00 | |
| reasoning Editorial on hiring trends, implicit bias concerns |
| 2026-03-01 21:58 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.20 (Mild negative) 0.00 | |
| reasoning Editorial on hiring practices |
| 2026-03-01 21:10 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.20 (Mild negative) 0.00 | |
| reasoning Editorial on hiring practices |
| 2026-03-01 21:09 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) -0.10 | |
| reasoning Editorial on hiring trends, implicit bias concerns |
| 2026-03-01 21:06 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.20 (Mild negative) 0.00 | |
| reasoning Editorial on hiring practices |
| 2026-03-01 20:29 |
eval
|
Evaluated by llama-4-scout-wai: +0.20 (Mild positive) 0.00 | |
| reasoning Editorial on hiring trends, implicit bias concerns |
| 2026-03-01 20:25 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.20 (Mild negative) 0.00 | |
| reasoning Editorial on hiring practices |
| 2026-03-01 19:46 |
eval
|
Evaluated by llama-4-scout-wai: +0.20 (Mild positive) 0.00 | |
| reasoning Editorial on hiring trends, implicit bias concerns |
| 2026-03-01 19:42 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.20 (Mild negative) 0.00 | |
| reasoning Editorial on hiring practices |
| 2026-03-01 19:41 |
eval
|
Evaluated by llama-4-scout-wai: +0.20 (Mild positive) | |
| reasoning Editorial on hiring trends, implicit bias concerns |
| 2026-03-01 19:38 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.20 (Mild negative) | |
| reasoning Editorial on hiring practices |