| 2026-03-06 22:40 | eval_success | PSQ evaluated: g-PSQ=0.600 (3 dims) | - - |
| 2026-03-06 22:40 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.60 (Strong positive) 0.00 | |
| 2026-03-06 22:22 | eval_success | PSQ evaluated: g-PSQ=0.464 (3 dims) | - - |
| 2026-03-06 22:22 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.46 (Moderate positive) 0.00 | |
| 2026-03-06 17:46 | eval_success | PSQ evaluated: g-PSQ=0.600 (3 dims) | - - |
| 2026-03-06 17:45 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.60 (Strong positive) 0.00 | |
| 2026-03-06 17:25 | eval_success | PSQ evaluated: g-PSQ=0.464 (3 dims) | - - |
| 2026-03-06 17:25 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.46 (Moderate positive) 0.00 | |
| 2026-03-06 16:28 | eval_success | PSQ evaluated: g-PSQ=0.600 (3 dims) | - - |
| 2026-03-06 16:28 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.60 (Strong positive) 0.00 | |
| 2026-03-06 16:24 | eval_success | PSQ evaluated: g-PSQ=0.464 (3 dims) | - - |
| 2026-03-06 16:24 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.46 (Moderate positive) +0.18 | |
| 2026-03-06 15:51 | eval_success | PSQ evaluated: g-PSQ=0.600 (3 dims) | - - |
| 2026-03-06 15:51 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.60 (Strong positive) 0.00 | |
| 2026-03-06 15:36 | eval_success | PSQ evaluated: g-PSQ=0.284 (3 dims) | - - |
| 2026-03-06 15:36 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.28 (Mild positive) -0.18 | |
| 2026-03-06 15:14 | eval_success | PSQ evaluated: g-PSQ=0.600 (3 dims) | - - |
| 2026-03-06 15:14 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.60 (Strong positive) 0.00 | |
| 2026-03-06 15:00 | eval_success | PSQ evaluated: g-PSQ=0.464 (3 dims) | - - |
| 2026-03-06 15:00 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.46 (Moderate positive) 0.00 | |
| 2026-03-06 14:28 | eval_success | PSQ evaluated: g-PSQ=0.600 (3 dims) | - - |
| 2026-03-06 14:28 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.60 (Strong positive) 0.00 | |
| 2026-03-06 14:23 | eval_success | PSQ evaluated: g-PSQ=0.600 (3 dims) | - - |
| 2026-03-06 14:23 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.60 (Strong positive) 0.00 | |
| 2026-03-06 14:17 | eval_success | PSQ evaluated: g-PSQ=0.464 (3 dims) | - - |
| 2026-03-06 14:17 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.46 (Moderate positive) 0.00 | |
| 2026-03-06 13:47 | eval_success | PSQ evaluated: g-PSQ=0.600 (3 dims) | - - |
| 2026-03-06 13:47 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.60 (Strong positive) 0.00 | |
| 2026-03-06 13:39 | eval_success | PSQ evaluated: g-PSQ=0.464 (3 dims) | - - |
| 2026-03-06 13:39 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.46 (Moderate positive) 0.00 | |
| 2026-03-06 13:07 | eval_success | PSQ evaluated: g-PSQ=0.600 (3 dims) | - - |
| 2026-03-06 13:07 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.60 (Strong positive) 0.00 | |
| 2026-03-06 13:05 | eval_success | PSQ evaluated: g-PSQ=0.464 (3 dims) | - - |
| 2026-03-06 13:05 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.46 (Moderate positive) 0.00 | |
| 2026-03-06 13:00 | eval_success | PSQ evaluated: g-PSQ=0.464 (3 dims) | - - |
| 2026-03-06 13:00 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.46 (Moderate positive) 0.00 | |
| 2026-03-06 12:32 | eval_success | PSQ evaluated: g-PSQ=0.600 (3 dims) | - - |
| 2026-03-06 12:32 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.60 (Strong positive) 0.00 | |
| 2026-03-06 12:27 | eval_success | PSQ evaluated: g-PSQ=0.464 (3 dims) | - - |
| 2026-03-06 12:27 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.46 (Moderate positive) 0.00 | |
| 2026-03-06 11:55 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.60 (Strong positive) 0.00 | |
| 2026-03-06 11:54 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.46 (Moderate positive) 0.00 | |
| 2026-03-06 11:50 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.46 (Moderate positive) -0.02 | |
| 2026-03-06 11:21 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.60 (Strong positive) 0.00 | |
| 2026-03-06 11:16 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.49 (Moderate positive) -0.11 | |
| 2026-03-06 10:49 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.60 (Strong positive) 0.00 | |
| 2026-03-06 10:43 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.60 (Strong positive) +0.32 | |
| 2026-03-06 10:13 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.60 (Strong positive) 0.00 | |
| 2026-03-06 10:08 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.28 (Mild positive) -0.32 | |
| 2026-03-06 09:39 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.60 (Strong positive) 0.00 | |
| 2026-03-06 09:36 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.60 (Strong positive) +0.32 | |
| 2026-03-06 09:04 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.60 (Strong positive) 0.00 | |
| 2026-03-06 09:03 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.28 (Mild positive) -0.18 | |
| 2026-03-06 08:33 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.60 (Strong positive) 0.00 | |
| 2026-03-06 08:32 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.46 (Moderate positive) 0.00 | |
| 2026-03-06 08:28 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.46 (Moderate positive) +0.18 | |
| 2026-03-06 08:03 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.60 (Strong positive) 0.00 | |
| 2026-03-06 07:57 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-06 07:52 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.28 (Mild positive) -0.18 | |
| 2026-03-06 07:29 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.60 (Strong positive) 0.00 | |
| 2026-03-06 07:21 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.46 (Moderate positive) +0.18 | |
| 2026-03-06 06:57 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.60 (Strong positive) 0.00 | |
| 2026-03-06 06:50 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.28 (Mild positive) -0.18 | |
| 2026-03-06 06:25 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.60 (Strong positive) 0.00 | |
| 2026-03-06 06:19 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.46 (Moderate positive) +0.18 | |
| 2026-03-06 05:53 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.60 (Strong positive) 0.00 | |
| 2026-03-06 05:36 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-06 04:50 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.60 (Strong positive) 0.00 | |
| 2026-03-06 04:45 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.60 (Strong positive) 0.00 | |
| 2026-03-06 04:18 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.28 (Mild positive) -0.18 | |
| 2026-03-05 18:40 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.60 (Strong positive) 0.00 | |
| 2026-03-05 18:35 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.60 (Strong positive) 0.00 | |
| 2026-03-05 15:18 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.46 (Moderate positive) 0.00 | |
| 2026-03-05 15:12 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.46 (Moderate positive) 0.00 | |
| 2026-03-05 07:49 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.60 (Strong positive) 0.00 | |
| 2026-03-05 06:53 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.46 (Moderate positive) +0.18 | |
| 2026-03-05 06:48 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.28 (Mild positive) -0.18 | |
| 2026-03-05 04:43 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.60 (Strong positive) | |
| 2026-03-05 04:27 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.46 (Moderate positive) | |
| 2026-03-05 04:02 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) +1.00 | |
| reasoning Technical GitHub page, no human rights discussion |
| 2026-03-05 03:59 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical content, no rights discussion |
| 2026-03-05 03:16 |
eval
|
Evaluated by llama-4-scout-wai: -1.00 (Strong negative) 0.00 | |
| reasoning Technical GitHub page, no human rights discussion |
| 2026-03-05 03:15 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical content, no rights discussion |
| 2026-03-05 03:11 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical content, no rights discussion |
| 2026-03-05 02:43 |
eval
|
Evaluated by llama-4-scout-wai: -1.00 (Strong negative) 0.00 | |
| reasoning Technical GitHub page, no human rights discussion |
| 2026-03-05 02:30 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical content, no rights discussion |
| 2026-03-05 02:03 |
eval
|
Evaluated by llama-4-scout-wai: -1.00 (Strong negative) 0.00 | |
| reasoning Technical GitHub page, no human rights discussion |
| 2026-03-05 01:57 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical content, no rights discussion |
| 2026-03-05 01:21 |
eval
|
Evaluated by llama-4-scout-wai: -1.00 (Strong negative) 0.00 | |
| reasoning Technical GitHub page, no human rights discussion |
| 2026-03-05 01:17 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical content, no rights discussion |
| 2026-03-05 01:16 |
eval
|
Evaluated by llama-4-scout-wai: -1.00 (Strong negative) 0.00 | |
| reasoning Technical GitHub page, no human rights discussion |
| 2026-03-05 00:42 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical content, no rights discussion |
| 2026-03-05 00:39 |
eval
|
Evaluated by llama-4-scout-wai: -1.00 (Strong negative) 0.00 | |
| reasoning Technical GitHub page, no human rights discussion |
| 2026-03-05 00:37 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical content, no rights discussion |
| 2026-03-05 00:00 |
eval
|
Evaluated by llama-4-scout-wai: -1.00 (Strong negative) 0.00 | |
| reasoning Technical GitHub page, no human rights discussion |
| 2026-03-04 23:57 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical content, no rights discussion |
| 2026-03-04 23:23 |
eval
|
Evaluated by llama-4-scout-wai: -1.00 (Strong negative) 0.00 | |
| reasoning Technical GitHub page, no human rights discussion |
| 2026-03-04 23:22 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical content, no rights discussion |
| 2026-03-04 22:42 |
eval
|
Evaluated by llama-4-scout-wai: -1.00 (Strong negative) -1.00 | |
| reasoning Technical GitHub page, no human rights discussion |
| 2026-03-04 22:41 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical content, no rights discussion |
| 2026-03-04 22:36 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical content, no rights discussion |
| 2026-03-04 22:03 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) +1.00 | |
| reasoning Technical GitHub page, no human rights discussion |
| 2026-03-04 21:59 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical content, no rights discussion |
| 2026-03-04 21:54 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical content, no rights discussion |
| 2026-03-04 21:26 |
eval
|
Evaluated by llama-4-scout-wai: -1.00 (Strong negative) 0.00 | |
| reasoning Technical GitHub page, no human rights discussion |
| 2026-03-04 21:19 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical content, no rights discussion |
| 2026-03-04 20:43 |
eval
|
Evaluated by llama-4-scout-wai: -1.00 (Strong negative) 0.00 | |
| reasoning Technical GitHub page, no human rights discussion |
| 2026-03-04 20:38 |
eval
|
Evaluated by llama-4-scout-wai: -1.00 (Strong negative) 0.00 | |
| reasoning Technical GitHub page, no human rights discussion |
| 2026-03-04 20:37 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical content, no rights discussion |
| 2026-03-04 20:03 |
eval
|
Evaluated by llama-4-scout-wai: -1.00 (Strong negative) 0.00 | |
| reasoning Technical GitHub page, no human rights discussion |
| 2026-03-04 20:03 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical content, no rights discussion |
| 2026-03-04 19:57 |
eval
|
Evaluated by llama-4-scout-wai: -1.00 (Strong negative) 0.00 | |
| reasoning Technical GitHub page, no human rights discussion |
| 2026-03-04 19:54 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) +0.42 | |
| reasoning Technical content, no rights discussion |
| 2026-03-04 19:10 |
eval
|
Evaluated by llama-4-scout-wai: -1.00 (Strong negative) 0.00 | |
| reasoning Technical GitHub page, no human rights discussion |
| 2026-03-04 19:08 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.42 (Moderate negative) 0.00 | |
| reasoning Technical content, no rights discussion |
| 2026-03-04 18:08 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.42 (Moderate negative) 0.00 | |
| reasoning Technical content, no rights discussion |
| 2026-03-04 18:07 |
eval
|
Evaluated by llama-4-scout-wai: -1.00 (Strong negative) -0.20 | |
| reasoning Technical GitHub page, no human rights discussion |
| 2026-03-04 18:02 |
eval
|
Evaluated by llama-4-scout-wai: -0.80 (Strong negative) 0.00 | |
| reasoning Technical GitHub page, no human rights discussion |
| 2026-03-04 16:40 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.42 (Moderate negative) 0.00 | |
| reasoning Technical content, no rights discussion |
| 2026-03-04 16:32 |
eval
|
Evaluated by llama-4-scout-wai: -0.80 (Strong negative) +0.20 | |
| reasoning Technical GitHub page, no human rights discussion |
| 2026-03-04 16:02 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.42 (Moderate negative) 0.00 | |
| reasoning Technical content, no rights discussion |
| 2026-03-04 15:49 |
eval
|
Evaluated by llama-4-scout-wai: -1.00 (Strong negative) -0.20 | |
| reasoning Technical GitHub page, no human rights discussion |
| 2026-03-04 15:44 |
eval
|
Evaluated by llama-4-scout-wai: -0.80 (Strong negative) 0.00 | |
| reasoning Technical GitHub page, no human rights discussion |
| 2026-03-04 15:21 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.42 (Moderate negative) 0.00 | |
| reasoning Technical content, no rights discussion |
| 2026-03-04 15:06 |
eval
|
Evaluated by llama-4-scout-wai: -0.80 (Strong negative) +0.20 | |
| reasoning Technical GitHub page, no human rights discussion |
| 2026-03-04 14:44 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.42 (Moderate negative) 0.00 | |
| reasoning Technical content, no rights discussion |
| 2026-03-04 14:24 |
eval
|
Evaluated by llama-4-scout-wai: -1.00 (Strong negative) -0.20 | |
| reasoning Technical GitHub page, no human rights discussion |
| 2026-03-04 14:17 |
eval
|
Evaluated by llama-4-scout-wai: -0.80 (Strong negative) +0.20 | |
| reasoning Technical GitHub page, no human rights discussion |
| 2026-03-04 14:06 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.42 (Moderate negative) 0.00 | |
| reasoning Technical content, no rights discussion |
| 2026-03-04 14:02 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.42 (Moderate negative) 0.00 | |
| reasoning Technical content, no rights discussion |
| 2026-03-04 13:37 |
eval
|
Evaluated by llama-4-scout-wai: -1.00 (Strong negative) 0.00 | |
| reasoning Technical GitHub page, no human rights discussion |
| 2026-03-04 13:25 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.42 (Moderate negative) 0.00 | |
| reasoning Technical content, no rights discussion |
| 2026-03-04 13:04 |
eval
|
Evaluated by llama-4-scout-wai: -1.00 (Strong negative) 0.00 | |
| reasoning Technical GitHub page, no human rights discussion |
| 2026-03-04 12:52 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.42 (Moderate negative) 0.00 | |
| reasoning Technical content, no rights discussion |
| 2026-03-04 12:22 |
eval
|
Evaluated by llama-4-scout-wai: -1.00 (Strong negative) 0.00 | |
| reasoning Technical GitHub page, no human rights discussion |
| 2026-03-04 12:12 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.42 (Moderate negative) 0.00 | |
| reasoning Technical content, no rights discussion |
| 2026-03-04 11:44 |
eval
|
Evaluated by llama-4-scout-wai: -1.00 (Strong negative) -0.20 | |
| reasoning Technical GitHub page, no human rights discussion |
| 2026-03-04 11:40 |
eval
|
Evaluated by llama-4-scout-wai: -0.80 (Strong negative) 0.00 | |
| reasoning Technical GitHub page, no human rights discussion |
| 2026-03-04 11:34 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.42 (Moderate negative) 0.00 | |
| reasoning Technical content, no rights discussion |
| 2026-03-04 10:55 |
eval
|
Evaluated by llama-4-scout-wai: -0.80 (Strong negative) +0.20 | |
| reasoning Technical GitHub page, no human rights discussion |
| 2026-03-04 10:46 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.42 (Moderate negative) 0.00 | |
| reasoning Technical content, no rights discussion |
| 2026-03-04 10:15 |
eval
|
Evaluated by llama-4-scout-wai: -1.00 (Strong negative) 0.00 | |
| reasoning Technical GitHub page, no human rights discussion |
| 2026-03-04 10:09 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.42 (Moderate negative) 0.00 | |
| reasoning Technical content, no rights discussion |
| 2026-03-04 09:37 |
eval
|
Evaluated by llama-4-scout-wai: -1.00 (Strong negative) -0.20 | |
| reasoning Technical GitHub page, no human rights discussion |
| 2026-03-04 09:32 |
eval
|
Evaluated by llama-4-scout-wai: -0.80 (Strong negative) 0.00 | |
| reasoning Technical GitHub page, no human rights discussion |
| 2026-03-04 09:30 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.42 (Moderate negative) 0.00 | |
| reasoning Technical content, no rights discussion |
| 2026-03-04 08:55 |
eval
|
Evaluated by llama-4-scout-wai: -0.80 (Strong negative) 0.00 | |
| reasoning Technical GitHub page, no human rights discussion |
| 2026-03-04 08:55 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.42 (Moderate negative) 0.00 | |
| reasoning Technical content, no rights discussion |
| 2026-03-04 08:50 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.42 (Moderate negative) -0.28 | |
| reasoning Technical content, no rights discussion |
| 2026-03-04 08:17 |
eval
|
Evaluated by llama-4-scout-wai: -0.80 (Strong negative) 0.00 | |
| reasoning Technical GitHub page, no human rights discussion |
| 2026-03-04 08:15 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.14 (Mild negative) +0.28 | |
| reasoning Technical content, no rights discussion |
| 2026-03-04 07:50 |
eval
|
Evaluated by llama-4-scout-wai: -0.80 (Strong negative) -0.88 | |
| reasoning Technical GitHub page, no human rights discussion |
| 2026-03-04 07:47 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.42 (Moderate negative) -0.47 | |
| reasoning Technical content, no rights discussion |
| 2026-03-04 07:09 |
eval
|
Evaluated by llama-4-scout-wai: +0.08 (Neutral) 0.00 | |
| reasoning Technical GitHub page, no human rights discussion |
| 2026-03-04 07:08 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.05 (Neutral) 0.00 | |
| reasoning Technical content, no rights discussion |
| 2026-03-04 06:00 |
eval
|
Evaluated by llama-4-scout-wai: +0.08 (Neutral) 0.00 | |
| reasoning Technical GitHub page, no human rights discussion |
| 2026-03-04 05:59 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.05 (Neutral) 0.00 | |
| reasoning Technical content, no rights discussion |
| 2026-03-04 05:51 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.05 (Neutral) 0.00 | |
| reasoning Technical content, no rights discussion |
| 2026-03-04 05:17 |
eval
|
Evaluated by llama-4-scout-wai: +0.08 (Neutral) 0.00 | |
| reasoning Technical GitHub page, no human rights discussion |
| 2026-03-04 05:09 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.05 (Neutral) 0.00 | |
| reasoning Technical content, no rights discussion |
| 2026-03-04 04:29 |
eval
|
Evaluated by llama-4-scout-wai: +0.08 (Neutral) 0.00 | |
| reasoning Technical GitHub page, no human rights discussion |
| 2026-03-04 04:28 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.05 (Neutral) 0.00 | |
| reasoning Technical content, no rights discussion |
| 2026-03-04 03:58 |
eval
|
Evaluated by llama-4-scout-wai: +0.08 (Neutral) 0.00 | |
| reasoning Technical GitHub page, no human rights discussion |
| 2026-03-04 03:58 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.05 (Neutral) 0.00 | |
| reasoning Technical content, no rights discussion |
| 2026-03-04 03:20 |
eval
|
Evaluated by llama-4-scout-wai: +0.08 (Neutral) | |
| reasoning Technical GitHub page, no human rights discussion |
| 2026-03-04 03:20 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.05 (Neutral) | |
| reasoning Technical content, no rights discussion |