| 2026-03-01 18:01 | eval_success | Lite evaluated: Mild positive (0.20) | - - |
| 2026-03-01 18:01 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.20 (Mild positive) 0.00 | |
| reasoning Editorial criticizes UK policy |
| 2026-03-01 16:59 | eval_success | Lite evaluated: Moderate positive (0.40) | - - |
| 2026-03-01 16:59 |
eval
|
Evaluated by llama-4-scout-wai: +0.40 (Moderate positive) 0.00 | |
| reasoning Editorial criticizing UK's digital-only travel rules |
| 2026-03-01 16:40 | eval_success | Lite evaluated: Mild positive (0.20) | - - |
| 2026-03-01 16:40 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.20 (Mild positive) 0.00 | |
| reasoning Editorial criticizes UK policy |
| 2026-03-01 15:22 | eval_success | Lite evaluated: Moderate positive (0.40) | - - |
| 2026-03-01 15:22 |
eval
|
Evaluated by llama-4-scout-wai: +0.40 (Moderate positive) 0.00 | |
| reasoning Editorial criticizing UK's digital-only travel rules |
| 2026-03-01 15:09 | eval_success | Lite evaluated: Mild positive (0.20) | - - |
| 2026-03-01 15:09 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.20 (Mild positive) 0.00 | |
| reasoning Editorial criticizes UK policy |
| 2026-03-01 13:21 | eval_success | Lite evaluated: Moderate positive (0.40) | - - |
| 2026-03-01 13:21 |
eval
|
Evaluated by llama-4-scout-wai: +0.40 (Moderate positive) 0.00 | |
| reasoning Editorial criticizing UK's digital-only travel rules |
| 2026-03-01 13:19 | eval_success | Lite evaluated: Mild positive (0.20) | - - |
| 2026-03-01 13:19 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.20 (Mild positive) 0.00 | |
| reasoning Editorial criticizes UK policy |
| 2026-03-01 04:35 | credit_exhausted | Credit balance too low, pausing provider for 30 min | - - |
| 2026-03-01 02:22 | eval_success | Lite evaluated: Mild positive (0.20) | - - |
| 2026-03-01 02:22 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.20 (Mild positive) 0.00 | |
| reasoning Editorial criticizes UK policy |
| 2026-03-01 01:41 | eval_success | Lite evaluated: Moderate positive (0.40) | - - |
| 2026-03-01 01:41 |
eval
|
Evaluated by llama-4-scout-wai: +0.40 (Moderate positive) 0.00 | |
| reasoning Editorial criticizing UK's digital-only travel rules |
| 2026-03-01 01:36 | eval_success | Lite evaluated: Moderate positive (0.40) | - - |
| 2026-03-01 01:36 |
eval
|
Evaluated by llama-4-scout-wai: +0.40 (Moderate positive) 0.00 | |
| reasoning Editorial criticizing UK's digital-only travel rules |
| 2026-03-01 01:35 | eval_success | Lite evaluated: Mild positive (0.20) | - - |
| 2026-03-01 01:35 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.20 (Mild positive) 0.00 | |
| reasoning Editorial criticizes UK policy |
| 2026-02-28 23:12 | eval_success | Lite evaluated: Moderate positive (0.40) | - - |
| 2026-02-28 23:12 |
eval
|
Evaluated by llama-4-scout-wai: +0.40 (Moderate positive) 0.00 | |
| reasoning Editorial criticizing UK's digital-only travel rules |
| 2026-02-28 22:47 | eval_success | Lite evaluated: Mild positive (0.20) | - - |
| 2026-02-28 22:47 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.20 (Mild positive) 0.00 | |
| reasoning Editorial criticizes UK policy |
| 2026-02-28 22:19 | eval_success | Lite evaluated: Moderate positive (0.40) | - - |
| 2026-02-28 22:19 |
eval
|
Evaluated by llama-4-scout-wai: +0.40 (Moderate positive) 0.00 | |
| reasoning Editorial criticizing UK's digital-only travel rules |
| 2026-02-28 22:03 | eval_success | Lite evaluated: Mild positive (0.20) | - - |
| 2026-02-28 22:03 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.20 (Mild positive) 0.00 | |
| reasoning Editorial criticizes UK policy |
| 2026-02-28 21:34 | eval_success | Lite evaluated: Moderate positive (0.40) | - - |
| 2026-02-28 21:34 |
eval
|
Evaluated by llama-4-scout-wai: +0.40 (Moderate positive) 0.00 | |
| reasoning Editorial criticizing UK's digital-only travel rules |
| 2026-02-28 21:14 | eval_success | Lite evaluated: Mild positive (0.20) | - - |
| 2026-02-28 21:14 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.20 (Mild positive) 0.00 | |
| reasoning Editorial criticizes UK policy |
| 2026-02-28 21:09 | eval_success | Lite evaluated: Mild positive (0.20) | - - |
| 2026-02-28 21:09 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.20 (Mild positive) 0.00 | |
| reasoning Editorial criticizes UK policy |
| 2026-02-28 20:48 | eval_success | Lite evaluated: Moderate positive (0.40) | - - |
| 2026-02-28 20:48 |
eval
|
Evaluated by llama-4-scout-wai: +0.40 (Moderate positive) 0.00 | |
| reasoning Editorial criticizing UK's digital-only travel rules |
| 2026-02-28 20:20 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.20 (Mild positive) 0.00 | |
| reasoning Editorial criticizes UK policy |
| 2026-02-28 19:59 |
eval
|
Evaluated by llama-4-scout-wai: +0.40 (Moderate positive) 0.00 | |
| reasoning Editorial criticizing UK's digital-only travel rules |
| 2026-02-28 19:54 |
eval
|
Evaluated by llama-4-scout-wai: +0.40 (Moderate positive) 0.00 | |
| reasoning Editorial criticizing UK's digital-only travel rules |
| 2026-02-28 19:31 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.20 (Mild positive) 0.00 | |
| reasoning Editorial criticizes UK policy |
| 2026-02-28 19:07 |
eval
|
Evaluated by llama-4-scout-wai: +0.40 (Moderate positive) 0.00 | |
| reasoning Editorial criticizing UK's digital-only travel rules |
| 2026-02-28 18:50 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.20 (Mild positive) 0.00 | |
| reasoning Editorial criticizes UK policy |
| 2026-02-28 18:44 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.20 (Mild positive) 0.00 | |
| reasoning Editorial criticizes UK policy |
| 2026-02-28 18:27 |
eval
|
Evaluated by llama-4-scout-wai: +0.40 (Moderate positive) 0.00 | |
| reasoning Editorial criticizing UK's digital-only travel rules |
| 2026-02-28 18:18 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.20 (Mild positive) 0.00 | |
| reasoning Editorial criticizes UK policy |
| 2026-02-28 18:01 |
eval
|
Evaluated by llama-4-scout-wai: +0.40 (Moderate positive) 0.00 | |
| reasoning Editorial criticizing UK's digital-only travel rules |
| 2026-02-28 17:56 |
eval
|
Evaluated by llama-4-scout-wai: +0.40 (Moderate positive) 0.00 | |
| reasoning Editorial criticizing UK's digital-only travel rules |
| 2026-02-28 17:51 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.20 (Mild positive) 0.00 | |
| reasoning Editorial criticizes UK policy |
| 2026-02-28 17:31 |
eval
|
Evaluated by llama-4-scout-wai: +0.40 (Moderate positive) 0.00 | |
| reasoning Editorial criticizing UK's digital-only travel rules |
| 2026-02-28 17:26 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.20 (Mild positive) 0.00 | |
| reasoning Editorial criticizes UK policy |
| 2026-02-28 17:03 |
eval
|
Evaluated by llama-4-scout-wai: +0.40 (Moderate positive) 0.00 | |
| reasoning Editorial criticizing UK's digital-only travel rules |
| 2026-02-28 16:57 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.20 (Mild positive) 0.00 | |
| reasoning Editorial criticizes UK policy |
| 2026-02-28 16:36 |
eval
|
Evaluated by llama-4-scout-wai: +0.40 (Moderate positive) 0.00 | |
| reasoning Editorial criticizing UK's digital-only travel rules |
| 2026-02-28 16:32 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.20 (Mild positive) 0.00 | |
| reasoning Editorial criticizes UK policy |
| 2026-02-28 15:42 |
eval
|
Evaluated by llama-4-scout-wai: +0.40 (Moderate positive) 0.00 | |
| reasoning Editorial criticizing UK's digital-only travel rules |
| 2026-02-28 15:35 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.20 (Mild positive) 0.00 | |
| reasoning Editorial criticizes UK policy |
| 2026-02-28 09:02 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.20 (Mild positive) 0.00 | |
| reasoning Editorial criticizes UK policy |
| 2026-02-28 08:59 |
eval
|
Evaluated by llama-4-scout-wai: +0.40 (Moderate positive) +0.90 | |
| reasoning Editorial criticizing UK's digital-only travel rules |
| 2026-02-28 08:57 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.20 (Mild positive) 0.00 | |
| reasoning Editorial criticizes UK policy |
| 2026-02-28 05:23 |
eval
|
Evaluated by claude-haiku-4-5: +0.36 (Moderate positive) | |
| 2026-02-28 02:54 |
eval
|
Evaluated by llama-4-scout-wai: -0.50 (Moderate negative) | |
| reasoning Editorial criticizing UK's digital-only travel rules |
| 2026-02-28 02:33 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.20 (Mild positive) | |
| reasoning Editorial criticizes UK policy |