| 2026-03-01 17:58 | eval_success | Lite evaluated: Moderate positive (0.40) | - - |
| 2026-03-01 17:58 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.40 (Moderate positive) 0.00 | |
| reasoning PO exposes price fixing |
| 2026-03-01 16:58 | eval_success | Lite evaluated: Moderate positive (0.30) | - - |
| 2026-03-01 16:58 |
eval
|
Evaluated by llama-4-scout-wai: +0.30 (Moderate positive) 0.00 | |
| reasoning ED, consumer protection with implicit rights relevance |
| 2026-03-01 16:37 | eval_success | Lite evaluated: Moderate positive (0.40) | - - |
| 2026-03-01 16:37 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.40 (Moderate positive) 0.00 | |
| reasoning PO exposes price fixing |
| 2026-03-01 15:22 | eval_success | Lite evaluated: Moderate positive (0.30) | - - |
| 2026-03-01 15:22 |
eval
|
Evaluated by llama-4-scout-wai: +0.30 (Moderate positive) -0.06 | |
| reasoning ED, consumer protection with implicit rights relevance |
| 2026-03-01 15:08 | eval_success | Lite evaluated: Moderate positive (0.40) | - - |
| 2026-03-01 15:08 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.40 (Moderate positive) 0.00 | |
| reasoning PO exposes price fixing |
| 2026-02-28 16:09 | eval_success | Lite evaluated: Moderate positive (0.36) | - - |
| 2026-02-28 16:09 |
eval
|
Evaluated by llama-4-scout-wai: +0.36 (Moderate positive) 0.00 | |
| reasoning ED, consumer protection with implicit rights relevance |
| 2026-02-28 16:09 | eval_success | Lite evaluated: Moderate positive (0.40) | - - |
| 2026-02-28 16:09 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.40 (Moderate positive) 0.00 | |
| reasoning PO exposes price fixing |
| 2026-02-28 16:04 | eval_success | Lite evaluated: Moderate positive (0.36) | - - |
| 2026-02-28 16:04 |
eval
|
Evaluated by llama-4-scout-wai: +0.36 (Moderate positive) +0.06 | |
| reasoning ED, consumer protection with implicit rights relevance |
| 2026-02-28 16:04 | eval_success | Lite evaluated: Moderate positive (0.40) | - - |
| 2026-02-28 16:04 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.40 (Moderate positive) 0.00 | |
| reasoning PO exposes price fixing |
| 2026-02-28 11:25 | eval_success | Lite evaluated: Moderate positive (0.40) | - - |
| 2026-02-28 11:25 | rater_validation_warn | Lite validation warnings for model llama-3.3-70b-wai: 0W 1R | - - |
| 2026-02-28 11:25 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.40 (Moderate positive) 0.00 | |
| reasoning PO exposes price fixing |
| 2026-02-28 09:20 | eval_success | Light evaluated: Moderate positive (0.30) | - - |
| 2026-02-28 09:20 |
eval
|
Evaluated by llama-4-scout-wai: +0.30 (Moderate positive) -0.30 | |
| reasoning ED, consumer protection with implicit rights relevance |
| 2026-02-28 09:20 | rater_validation_warn | Light validation warnings for model llama-4-scout-wai: 0W 1R | - - |
| 2026-02-28 09:20 | eval_success | Light evaluated: Moderate positive (0.40) | - - |
| 2026-02-28 09:20 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.40 (Moderate positive) | |
| reasoning PO exposes price fixing |
| 2026-02-28 09:20 | rater_validation_warn | Light validation warnings for model llama-3.3-70b-wai: 0W 1R | - - |
| 2026-02-28 08:54 | credit_exhausted | Credit balance too low, pausing provider for 30 min | - - |
| 2026-02-28 04:22 | eval_success | Light evaluated: Strong positive (0.60) | - - |
| 2026-02-28 04:22 |
eval
|
Evaluated by llama-4-scout-wai: +0.60 (Strong positive) | |
| reasoning ED, consumer protection with implicit rights relevance |