| 2026-03-06 22:46 | eval_success | PSQ evaluated: g-PSQ=0.120 (3 dims) | - - |
| 2026-03-06 22:46 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-06 22:41 | eval_success | PSQ evaluated: g-PSQ=0.120 (3 dims) | - - |
| 2026-03-06 22:41 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-06 22:25 | eval_success | PSQ evaluated: g-PSQ=0.419 (3 dims) | - - |
| 2026-03-06 22:25 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.42 (Moderate positive) 0.00 | |
| 2026-03-06 17:30 | eval_success | PSQ evaluated: g-PSQ=0.419 (3 dims) | - - |
| 2026-03-06 17:30 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.42 (Moderate positive) 0.00 | |
| 2026-03-06 17:25 | eval_success | PSQ evaluated: g-PSQ=0.419 (3 dims) | - - |
| 2026-03-06 17:25 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.42 (Moderate positive) -0.05 | |
| 2026-03-06 16:56 | eval_success | PSQ evaluated: g-PSQ=0.120 (3 dims) | - - |
| 2026-03-06 16:56 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-06 16:47 | eval_success | PSQ evaluated: g-PSQ=0.474 (3 dims) | - - |
| 2026-03-06 16:47 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.47 (Moderate positive) +0.05 | |
| 2026-03-06 16:20 | eval_success | PSQ evaluated: g-PSQ=0.120 (3 dims) | - - |
| 2026-03-06 16:20 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-06 16:10 | eval_success | PSQ evaluated: g-PSQ=0.419 (3 dims) | - - |
| 2026-03-06 16:10 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.42 (Moderate positive) 0.00 | |
| 2026-03-06 16:05 | eval_success | PSQ evaluated: g-PSQ=0.419 (3 dims) | - - |
| 2026-03-06 16:05 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.42 (Moderate positive) 0.00 | |
| 2026-03-06 15:44 | eval_success | PSQ evaluated: g-PSQ=0.120 (3 dims) | - - |
| 2026-03-06 15:44 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-06 15:28 | eval_success | PSQ evaluated: g-PSQ=0.419 (3 dims) | - - |
| 2026-03-06 15:28 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.42 (Moderate positive) 0.00 | |
| 2026-03-06 15:06 | eval_success | PSQ evaluated: g-PSQ=0.120 (3 dims) | - - |
| 2026-03-06 15:06 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-06 14:52 | eval_success | PSQ evaluated: g-PSQ=0.419 (3 dims) | - - |
| 2026-03-06 14:52 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.42 (Moderate positive) 0.00 | |
| 2026-03-06 14:23 | eval_success | PSQ evaluated: g-PSQ=0.120 (3 dims) | - - |
| 2026-03-06 14:23 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-06 14:09 | eval_success | PSQ evaluated: g-PSQ=0.419 (3 dims) | - - |
| 2026-03-06 14:09 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.42 (Moderate positive) 0.00 | |
| 2026-03-06 13:46 | eval_success | PSQ evaluated: g-PSQ=0.120 (3 dims) | - - |
| 2026-03-06 13:46 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-06 13:31 | eval_success | PSQ evaluated: g-PSQ=0.419 (3 dims) | - - |
| 2026-03-06 13:31 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.42 (Moderate positive) 0.00 | |
| 2026-03-06 13:07 | eval_success | PSQ evaluated: g-PSQ=0.120 (3 dims) | - - |
| 2026-03-06 13:07 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-06 12:58 | eval_success | PSQ evaluated: g-PSQ=0.419 (3 dims) | - - |
| 2026-03-06 12:58 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.42 (Moderate positive) 0.00 | |
| 2026-03-06 12:53 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.42 (Moderate positive) 0.00 | |
| 2026-03-06 12:31 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-06 12:20 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.42 (Moderate positive) 0.00 | |
| 2026-03-06 11:55 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-06 11:47 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.42 (Moderate positive) 0.00 | |
| 2026-03-06 11:43 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.42 (Moderate positive) 0.00 | |
| 2026-03-06 11:21 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-06 11:10 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.42 (Moderate positive) 0.00 | |
| 2026-03-06 10:49 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-06 10:44 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-06 10:36 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.42 (Moderate positive) 0.00 | |
| 2026-03-06 10:09 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-06 10:05 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.42 (Moderate positive) 0.00 | |
| 2026-03-06 10:04 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-06 09:32 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.42 (Moderate positive) 0.00 | |
| 2026-03-06 09:30 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) | |
| 2026-03-06 09:28 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.42 (Moderate positive) | |
| 2026-03-06 09:20 |
eval
|
Evaluated by llama-4-scout-wai: -0.26 (Mild negative) 0.00 | |
| reasoning Blog post about implementing anti-bot measures, no explicit human rights discussion |
| 2026-03-06 09:15 |
eval
|
Evaluated by llama-4-scout-wai: -0.26 (Mild negative) 0.00 | |
| reasoning Blog post about implementing anti-bot measures, no explicit human rights discussion |
| 2026-03-06 09:10 |
eval
|
Evaluated by llama-4-scout-wai: -0.26 (Mild negative) 0.00 | |
| reasoning Blog post about implementing anti-bot measures, no explicit human rights discussion |
| 2026-03-06 09:05 |
eval
|
Evaluated by llama-4-scout-wai: -0.26 (Mild negative) 0.00 | |
| reasoning Blog post about implementing anti-bot measures, no explicit human rights discussion |
| 2026-03-06 09:00 |
eval
|
Evaluated by llama-4-scout-wai: -0.26 (Mild negative) | |
| reasoning Blog post about implementing anti-bot measures, no explicit human rights discussion |
| 2026-03-06 08:59 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.26 (Mild negative) | |
| reasoning Technical blog post on anti-scraping measures |