| 2026-03-02 16:39 | eval_success | Lite evaluated: Moderate positive (0.30) | - - |
| 2026-03-02 16:39 |
eval
|
Evaluated by llama-4-scout-wai: +0.30 (Moderate positive) 0.00 | |
| reasoning ED, implicit rights discussion |
| 2026-03-02 16:34 | eval_success | Lite evaluated: Moderate positive (0.30) | - - |
| 2026-03-02 16:34 |
eval
|
Evaluated by llama-4-scout-wai: +0.30 (Moderate positive) +0.20 | |
| reasoning ED, implicit rights discussion |
| 2026-03-02 16:22 | eval_success | Lite evaluated: Mild positive (0.20) | - - |
| 2026-03-02 16:22 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.20 (Mild positive) 0.00 | |
| reasoning News article on ICE ban |
| 2026-03-01 22:27 | eval_success | Lite evaluated: Mild positive (0.10) | - - |
| 2026-03-01 22:27 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) 0.00 | |
| reasoning ED, implicit rights discussion |
| 2026-03-01 22:08 | eval_success | Lite evaluated: Mild positive (0.20) | - - |
| 2026-03-01 22:08 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.20 (Mild positive) 0.00 | |
| reasoning News article on ICE ban |
| 2026-03-01 21:45 | eval_success | Lite evaluated: Mild positive (0.10) | - - |
| 2026-03-01 21:45 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) 0.00 | |
| reasoning ED, implicit rights discussion |
| 2026-03-01 21:40 | eval_success | Lite evaluated: Mild positive (0.10) | - - |
| 2026-03-01 21:40 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) 0.00 | |
| reasoning ED, implicit rights discussion |
| 2026-03-01 21:34 | eval_success | Lite evaluated: Mild positive (0.10) | - - |
| 2026-03-01 21:34 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) 0.00 | |
| reasoning ED, implicit rights discussion |
| 2026-03-01 21:18 | eval_success | Lite evaluated: Mild positive (0.20) | - - |
| 2026-03-01 21:18 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.20 (Mild positive) 0.00 | |
| reasoning News article on ICE ban |
| 2026-03-01 21:00 | eval_success | Lite evaluated: Mild positive (0.10) | - - |
| 2026-03-01 21:00 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) 0.00 | |
| reasoning ED, implicit rights discussion |
| 2026-03-01 20:55 | eval_success | Lite evaluated: Mild positive (0.10) | - - |
| 2026-03-01 20:55 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) -0.20 | |
| reasoning ED, implicit rights discussion |
| 2026-03-01 20:43 | eval_success | Lite evaluated: Mild positive (0.20) | - - |
| 2026-03-01 20:43 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.20 (Mild positive) 0.00 | |
| reasoning News article on ICE ban |
| 2026-03-01 20:10 | eval_success | Lite evaluated: Moderate positive (0.30) | - - |
| 2026-03-01 20:10 |
eval
|
Evaluated by llama-4-scout-wai: +0.30 (Moderate positive) +0.20 | |
| reasoning ED, implicit rights discussion |
| 2026-03-01 20:01 | eval_success | Lite evaluated: Mild positive (0.20) | - - |
| 2026-03-01 20:01 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.20 (Mild positive) 0.00 | |
| reasoning News article on ICE ban |
| 2026-03-01 19:27 | eval_success | Lite evaluated: Mild positive (0.10) | - - |
| 2026-03-01 19:27 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) 0.00 | |
| reasoning ED, implicit rights discussion |
| 2026-03-01 19:17 | eval_success | Lite evaluated: Mild positive (0.20) | - - |
| 2026-03-01 19:17 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.20 (Mild positive) 0.00 | |
| reasoning News article on ICE ban |
| 2026-03-01 18:45 | eval_success | Lite evaluated: Mild positive (0.10) | - - |
| 2026-03-01 18:45 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) 0.00 | |
| reasoning ED, implicit rights discussion |
| 2026-03-01 18:36 | eval_success | Lite evaluated: Mild positive (0.20) | - - |
| 2026-03-01 18:36 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.20 (Mild positive) 0.00 | |
| reasoning News article on ICE ban |
| 2026-03-01 17:42 | eval_success | Lite evaluated: Mild positive (0.10) | - - |
| 2026-03-01 17:42 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) -0.20 | |
| reasoning ED, implicit rights discussion |
| 2026-03-01 17:37 | eval_success | Lite evaluated: Moderate positive (0.30) | - - |
| 2026-03-01 17:37 |
eval
|
Evaluated by llama-4-scout-wai: +0.30 (Moderate positive) +0.20 | |
| reasoning ED, implicit rights discussion |
| 2026-03-01 17:33 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.20 (Mild positive) 0.00 | |
| reasoning News article on ICE ban |
| 2026-03-01 16:18 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) -0.20 | |
| reasoning ED, implicit rights discussion |
| 2026-03-01 16:15 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.20 (Mild positive) 0.00 | |
| reasoning News article on ICE ban |
| 2026-03-01 16:13 |
eval
|
Evaluated by llama-4-scout-wai: +0.30 (Moderate positive) +0.20 | |
| reasoning ED, implicit rights discussion |
| 2026-03-01 15:34 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.20 (Mild positive) 0.00 | |
| reasoning News article on ICE ban |
| 2026-03-01 15:33 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) 0.00 | |
| reasoning ED, implicit rights discussion |
| 2026-03-01 14:31 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.20 (Mild positive) 0.00 | |
| reasoning News article on ICE ban |
| 2026-03-01 14:28 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) 0.00 | |
| reasoning ED, implicit rights discussion |
| 2026-03-01 14:22 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) -0.20 | |
| reasoning ED, implicit rights discussion |
| 2026-03-01 13:43 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.20 (Mild positive) 0.00 | |
| reasoning News article on ICE ban |
| 2026-03-01 13:36 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.20 (Mild positive) 0.00 | |
| reasoning News article on ICE ban |
| 2026-03-01 13:34 |
eval
|
Evaluated by llama-4-scout-wai: +0.30 (Moderate positive) +0.20 | |
| reasoning ED, implicit rights discussion |
| 2026-03-01 12:57 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.20 (Mild positive) 0.00 | |
| reasoning News article on ICE ban |
| 2026-03-01 12:54 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) 0.00 | |
| reasoning ED, implicit rights discussion |
| 2026-03-01 12:52 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.20 (Mild positive) 0.00 | |
| reasoning News article on ICE ban |
| 2026-03-01 12:11 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) -0.20 | |
| reasoning ED, implicit rights discussion |
| 2026-03-01 12:11 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.20 (Mild positive) 0.00 | |
| reasoning News article on ICE ban |
| 2026-03-01 11:29 |
eval
|
Evaluated by llama-4-scout-wai: +0.30 (Moderate positive) +0.20 | |
| reasoning ED, implicit rights discussion |
| 2026-03-01 11:29 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.20 (Mild positive) 0.00 | |
| reasoning News article on ICE ban |
| 2026-03-01 10:25 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) 0.00 | |
| reasoning ED, implicit rights discussion |
| 2026-03-01 10:23 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.20 (Mild positive) 0.00 | |
| reasoning News article on ICE ban |
| 2026-03-01 10:18 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.20 (Mild positive) 0.00 | |
| reasoning News article on ICE ban |
| 2026-03-01 09:42 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) -0.20 | |
| reasoning ED, implicit rights discussion |
| 2026-03-01 09:36 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.20 (Mild positive) 0.00 | |
| reasoning News article on ICE ban |
| 2026-03-01 09:00 |
eval
|
Evaluated by llama-4-scout-wai: +0.30 (Moderate positive) +0.20 | |
| reasoning ED, implicit rights discussion |
| 2026-03-01 08:55 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) -0.20 | |
| reasoning ED, implicit rights discussion |
| 2026-03-01 08:52 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.20 (Mild positive) -0.10 | |
| reasoning News article on ICE ban |
| 2026-02-28 19:09 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.30 (Moderate positive) 0.00 | |
| reasoning News article on ICE ban |
| 2026-02-28 18:43 |
eval
|
Evaluated by llama-4-scout-wai: +0.30 (Moderate positive) 0.00 | |
| reasoning ED, implicit rights discussion |
| 2026-02-28 18:31 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.30 (Moderate positive) 0.00 | |
| reasoning News article on ICE ban |
| 2026-02-28 18:26 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.30 (Moderate positive) 0.00 | |
| reasoning News article on ICE ban |
| 2026-02-28 18:15 |
eval
|
Evaluated by llama-4-scout-wai: +0.30 (Moderate positive) 0.00 | |
| reasoning ED, implicit rights discussion |
| 2026-02-28 18:01 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.30 (Moderate positive) 0.00 | |
| reasoning News article on ICE ban |
| 2026-02-28 17:50 |
eval
|
Evaluated by llama-4-scout-wai: +0.30 (Moderate positive) 0.00 | |
| reasoning ED, implicit rights discussion |
| 2026-02-28 17:37 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.30 (Moderate positive) 0.00 | |
| reasoning News article on ICE ban |
| 2026-02-28 17:33 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.30 (Moderate positive) 0.00 | |
| reasoning News article on ICE ban |
| 2026-02-28 17:27 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.30 (Moderate positive) +0.10 | |
| reasoning News article on ICE ban |
| 2026-02-28 17:22 |
eval
|
Evaluated by llama-4-scout-wai: +0.30 (Moderate positive) 0.00 | |
| reasoning ED, implicit rights discussion |
| 2026-02-28 17:00 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.20 (Mild positive) 0.00 | |
| reasoning News article on ICE ban |
| 2026-02-28 16:56 |
eval
|
Evaluated by llama-4-scout-wai: +0.30 (Moderate positive) 0.00 | |
| reasoning ED, implicit rights discussion |
| 2026-02-28 16:55 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.20 (Mild positive) -0.10 | |
| reasoning News article on ICE ban |
| 2026-02-28 16:28 |
eval
|
Evaluated by llama-4-scout-wai: +0.30 (Moderate positive) 0.00 | |
| reasoning ED, implicit rights discussion |
| 2026-02-28 16:28 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.30 (Moderate positive) +0.10 | |
| reasoning News article on ICE ban |
| 2026-02-28 15:06 |
eval
|
Evaluated by llama-4-scout-wai: +0.30 (Moderate positive) 0.00 | |
| reasoning ED, implicit rights discussion |
| 2026-02-28 15:05 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.20 (Mild positive) 0.00 | |
| reasoning News article on ICE ban |
| 2026-02-28 09:19 |
eval
|
Evaluated by llama-4-scout-wai: +0.30 (Moderate positive) -0.20 | |
| reasoning ED, implicit rights discussion |
| 2026-02-28 09:18 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.20 (Mild positive) -0.30 | |
| reasoning News article on ICE ban |
| 2026-02-28 03:48 |
eval
|
Evaluated by llama-4-scout-wai: +0.50 (Moderate positive) | |
| reasoning ED, implicit rights discussion |
| 2026-02-28 03:30 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.50 (Moderate positive) | |
| reasoning News article on ICE ban |