| 2026-03-02 14:08 | eval_success | Evaluated: Mild positive (0.19) | - - |
| 2026-03-02 14:08 |
eval
|
Evaluated by deepseek-v3.2: +0.19 (Mild positive) 11,476 tokens -0.17 | |
| 2026-03-02 13:59 | eval_success | Evaluated: Moderate positive (0.36) | - - |
| 2026-03-02 13:59 |
eval
|
Evaluated by deepseek-v3.2: +0.36 (Moderate positive) 11,465 tokens +0.24 | |
| 2026-03-02 13:45 | eval_success | Evaluated: Mild positive (0.12) | - - |
| 2026-03-02 13:45 |
eval
|
Evaluated by deepseek-v3.2: +0.12 (Mild positive) 14,779 tokens -0.11 | |
| 2026-03-02 13:42 | eval_success | Evaluated: Mild positive (0.23) | - - |
| 2026-03-02 13:42 |
eval
|
Evaluated by deepseek-v3.2: +0.23 (Mild positive) 11,753 tokens -0.13 | |
| 2026-03-02 13:26 | eval_success | Evaluated: Moderate positive (0.36) | - - |
| 2026-03-02 13:26 |
eval
|
Evaluated by deepseek-v3.2: +0.36 (Moderate positive) 12,088 tokens +0.36 | |
| 2026-03-02 13:19 | eval_success | Evaluated: Neutral (0.01) | - - |
| 2026-03-02 13:19 |
eval
|
Evaluated by deepseek-v3.2: +0.01 (Neutral) 12,616 tokens -0.27 | |
| 2026-03-02 13:12 | eval_success | Evaluated: Mild positive (0.27) | - - |
| 2026-03-02 13:12 |
eval
|
Evaluated by deepseek-v3.2: +0.27 (Mild positive) 11,404 tokens +0.13 | |
| 2026-03-02 12:35 | eval_success | Evaluated: Mild positive (0.14) | - - |
| 2026-03-02 12:35 |
eval
|
Evaluated by deepseek-v3.2: +0.14 (Mild positive) 11,134 tokens +0.12 | |
| 2026-03-02 12:23 | eval_success | Evaluated: Neutral (0.02) | - - |
| 2026-03-02 12:23 |
eval
|
Evaluated by deepseek-v3.2: +0.02 (Neutral) 11,256 tokens +0.02 | |
| 2026-03-02 12:06 | eval_success | Evaluated: Neutral (0.00) | - - |
| 2026-03-02 12:06 |
eval
|
Evaluated by deepseek-v3.2: 0.00 (Neutral) 12,074 tokens -0.04 | |
| 2026-03-02 12:03 | eval_success | Evaluated: Neutral (0.04) | - - |
| 2026-03-02 12:03 |
eval
|
Evaluated by deepseek-v3.2: +0.04 (Neutral) 10,976 tokens -0.26 | |
| 2026-03-02 11:52 | eval_success | Evaluated: Moderate positive (0.31) | - - |
| 2026-03-02 11:52 |
eval
|
Evaluated by deepseek-v3.2: +0.31 (Moderate positive) 11,585 tokens +0.14 | |
| 2026-03-02 11:52 | rater_validation_warn | Validation warnings for model deepseek-v3.2: 0W 1R | - - |
| 2026-03-02 11:41 | eval_success | Evaluated: Mild positive (0.16) | - - |
| 2026-03-02 11:41 |
eval
|
Evaluated by deepseek-v3.2: +0.16 (Mild positive) 10,682 tokens +0.03 | |
| 2026-03-02 11:41 | rater_validation_warn | Validation warnings for model deepseek-v3.2: 0W 60R | - - |
| 2026-03-02 11:19 | eval_success | Evaluated: Mild positive (0.13) | - - |
| 2026-03-02 11:19 |
eval
|
Evaluated by deepseek-v3.2: +0.13 (Mild positive) 11,295 tokens -0.28 | |
| 2026-03-02 11:05 | eval_success | Evaluated: Moderate positive (0.41) | - - |
| 2026-03-02 11:05 |
eval
|
Evaluated by deepseek-v3.2: +0.41 (Moderate positive) 11,226 tokens +0.89 | |
| 2026-03-02 11:05 | rater_validation_warn | Validation warnings for model deepseek-v3.2: 0W 1R | - - |
| 2026-03-02 10:54 | eval_failure | Evaluation failed: Error: Network connection lost. | - - |
| 2026-03-02 10:46 | rater_validation_fail | Parse failure for model deepseek-v3.2: Error: Failed to parse OpenRouter JSON: SyntaxError: Expected ',' or ']' after array element in JSON at position 6136 (line 176 column 6). Extracted text starts with: {
"schema_version": "3.7",
"e | - - |
| 2026-03-02 10:35 |
eval
|
Evaluated by deepseek-v3.2: -0.47 (Moderate negative) 11,311 tokens -0.47 | |
| 2026-03-02 10:26 |
eval
|
Evaluated by deepseek-v3.2: 0.00 (Neutral) 10,636 tokens -0.31 | |
| 2026-03-02 10:18 |
eval
|
Evaluated by deepseek-v3.2: +0.31 (Moderate positive) 11,148 tokens -0.12 | |
| 2026-03-02 09:58 |
eval
|
Evaluated by deepseek-v3.2: +0.43 (Moderate positive) 11,329 tokens +0.23 | |
| 2026-03-02 09:33 |
eval
|
Evaluated by deepseek-v3.2: +0.21 (Mild positive) 10,916 tokens +0.21 | |
| 2026-03-02 08:56 |
eval
|
Evaluated by deepseek-v3.2: 0.00 (Neutral) 12,814 tokens -0.28 | |
| 2026-03-02 08:41 |
eval
|
Evaluated by deepseek-v3.2: +0.28 (Mild positive) 15,068 tokens +0.28 | |
| 2026-03-02 08:30 |
eval
|
Evaluated by deepseek-v3.2: 0.00 (Neutral) 11,874 tokens -0.20 | |
| 2026-03-02 07:54 |
eval
|
Evaluated by deepseek-v3.2: +0.20 (Mild positive) 10,660 tokens -0.20 | |
| 2026-03-02 07:44 |
eval
|
Evaluated by deepseek-v3.2: +0.40 (Moderate positive) 11,053 tokens +0.04 | |
| 2026-03-02 07:04 |
eval
|
Evaluated by deepseek-v3.2: +0.36 (Moderate positive) 10,771 tokens +0.23 | |
| 2026-03-02 06:52 |
eval
|
Evaluated by deepseek-v3.2: +0.13 (Mild positive) 10,935 tokens -0.00 | |
| 2026-03-02 06:38 |
eval
|
Evaluated by deepseek-v3.2: +0.13 (Mild positive) 11,698 tokens +0.11 | |
| 2026-03-02 06:36 |
eval
|
Evaluated by deepseek-v3.2: +0.02 (Neutral) 12,239 tokens -0.60 | |
| 2026-03-02 06:15 |
eval
|
Evaluated by deepseek-v3.2: +0.62 (Strong positive) 11,108 tokens +0.39 | |
| 2026-03-02 06:03 |
eval
|
Evaluated by deepseek-v3.2: +0.22 (Mild positive) 11,218 tokens +0.07 | |
| 2026-03-02 05:49 |
eval
|
Evaluated by deepseek-v3.2: +0.16 (Mild positive) 11,444 tokens -0.08 | |
| 2026-03-02 05:24 |
eval
|
Evaluated by deepseek-v3.2: +0.24 (Mild positive) 11,383 tokens -0.03 | |
| 2026-03-02 05:15 |
eval
|
Evaluated by deepseek-v3.2: +0.27 (Mild positive) 11,698 tokens +0.16 | |
| 2026-03-02 05:00 |
eval
|
Evaluated by deepseek-v3.2: +0.12 (Mild positive) 11,400 tokens +0.10 | |
| 2026-03-02 04:56 |
eval
|
Evaluated by deepseek-v3.2: +0.02 (Neutral) 11,742 tokens +0.01 | |
| 2026-03-02 04:33 |
eval
|
Evaluated by deepseek-v3.2: +0.01 (Neutral) 11,963 tokens -0.26 | |
| 2026-03-02 04:31 |
eval
|
Evaluated by deepseek-v3.2: +0.27 (Mild positive) 11,443 tokens -0.12 | |
| 2026-03-02 04:08 |
eval
|
Evaluated by deepseek-v3.2: +0.40 (Moderate positive) 11,721 tokens +0.38 | |
| 2026-03-02 04:02 |
eval
|
Evaluated by deepseek-v3.2: +0.02 (Neutral) 11,416 tokens -0.14 | |
| 2026-03-02 03:53 |
eval
|
Evaluated by deepseek-v3.2: +0.16 (Mild positive) 10,667 tokens -0.00 | |
| 2026-03-02 03:48 |
eval
|
Evaluated by deepseek-v3.2: +0.16 (Mild positive) 11,339 tokens -0.00 | |
| 2026-03-02 03:41 |
eval
|
Evaluated by deepseek-v3.2: +0.16 (Mild positive) 11,225 tokens +0.05 | |
| 2026-03-02 03:16 |
eval
|
Evaluated by deepseek-v3.2: +0.12 (Mild positive) 11,811 tokens +0.04 | |
| 2026-03-02 03:03 |
eval
|
Evaluated by deepseek-v3.2: +0.07 (Neutral) 11,419 tokens -0.08 | |
| 2026-03-02 02:24 |
eval
|
Evaluated by deepseek-v3.2: +0.15 (Mild positive) 11,143 tokens -0.01 | |
| 2026-03-02 02:18 |
eval
|
Evaluated by deepseek-v3.2: +0.16 (Mild positive) 10,871 tokens +0.20 | |
| 2026-03-02 02:13 |
eval
|
Evaluated by deepseek-v3.2: -0.05 (Neutral) 11,566 tokens -0.05 | |
| 2026-03-02 01:48 |
eval
|
Evaluated by deepseek-v3.2: +0.01 (Neutral) 11,775 tokens -0.30 | |
| 2026-03-02 01:40 |
eval
|
Evaluated by deepseek-v3.2: +0.31 (Moderate positive) 11,416 tokens +0.30 | |
| 2026-03-02 01:35 |
eval
|
Evaluated by deepseek-v3.2: +0.00 (Neutral) 12,321 tokens +0.00 | |
| 2026-03-02 01:28 |
eval
|
Evaluated by deepseek-v3.2: 0.00 (Neutral) 11,286 tokens -0.15 | |
| 2026-03-02 01:09 |
eval
|
Evaluated by deepseek-v3.2: +0.15 (Mild positive) 11,895 tokens -0.19 | |
| 2026-03-02 01:04 |
eval
|
Evaluated by deepseek-v3.2: +0.33 (Moderate positive) 10,897 tokens -0.10 | |
| 2026-03-02 00:53 |
eval
|
Evaluated by deepseek-v3.2: +0.43 (Moderate positive) 11,120 tokens +0.39 | |
| 2026-03-02 00:46 |
eval
|
Evaluated by deepseek-v3.2: +0.04 (Neutral) 11,836 tokens +0.03 | |
| 2026-03-02 00:45 |
eval
|
Evaluated by deepseek-v3.2: +0.00 (Neutral) 12,081 tokens -0.25 | |
| 2026-03-02 00:15 |
eval
|
Evaluated by deepseek-v3.2: +0.25 (Mild positive) 10,974 tokens +0.05 | |
| 2026-03-01 23:52 |
eval
|
Evaluated by deepseek-v3.2: +0.20 (Mild positive) 11,065 tokens +0.18 | |
| 2026-03-01 23:47 |
eval
|
Evaluated by deepseek-v3.2: +0.03 (Neutral) 11,254 tokens -0.17 | |
| 2026-03-01 23:35 |
eval
|
Evaluated by deepseek-v3.2: +0.19 (Mild positive) 11,271 tokens -0.05 | |
| 2026-03-01 23:14 |
eval
|
Evaluated by deepseek-v3.2: +0.24 (Mild positive) 11,474 tokens -0.16 | |
| 2026-03-01 23:07 |
eval
|
Evaluated by deepseek-v3.2: +0.40 (Moderate positive) 11,025 tokens +0.40 | |
| 2026-03-01 23:02 |
eval
|
Evaluated by deepseek-v3.2: 0.00 (Neutral) 10,813 tokens -0.05 | |
| 2026-03-01 22:50 |
eval
|
Evaluated by deepseek-v3.2: +0.05 (Neutral) 12,024 tokens +0.03 | |
| 2026-03-01 22:39 |
eval
|
Evaluated by deepseek-v3.2: +0.02 (Neutral) 11,158 tokens -0.23 | |
| 2026-03-01 22:24 |
eval
|
Evaluated by deepseek-v3.2: +0.25 (Mild positive) 11,487 tokens -0.22 | |
| 2026-03-01 22:06 |
eval
|
Evaluated by deepseek-v3.2: +0.47 (Moderate positive) 11,387 tokens +0.44 | |
| 2026-03-01 21:50 |
eval
|
Evaluated by deepseek-v3.2: +0.03 (Neutral) 12,118 tokens +0.03 | |
| 2026-03-01 21:42 |
eval
|
Evaluated by deepseek-v3.2: 0.00 (Neutral) 10,857 tokens -0.07 | |
| 2026-03-01 21:32 |
eval
|
Evaluated by deepseek-v3.2: +0.07 (Neutral) 11,341 tokens +0.05 | |
| 2026-03-01 21:16 |
eval
|
Evaluated by deepseek-v3.2: +0.02 (Neutral) 12,335 tokens -0.63 | |
| 2026-03-01 21:08 |
eval
|
Evaluated by deepseek-v3.2: +0.65 (Strong positive) 11,130 tokens +0.57 | |
| 2026-03-01 20:54 |
eval
|
Evaluated by deepseek-v3.2: +0.08 (Neutral) 11,120 tokens -0.21 | |
| 2026-03-01 20:47 |
eval
|
Evaluated by deepseek-v3.2: +0.29 (Mild positive) 11,934 tokens +0.25 | |
| 2026-03-01 20:30 |
eval
|
Evaluated by deepseek-v3.2: +0.04 (Neutral) 12,376 tokens -0.29 | |
| 2026-03-01 20:24 |
eval
|
Evaluated by deepseek-v3.2: +0.33 (Moderate positive) 10,999 tokens +0.30 | |
| 2026-03-01 20:17 |
eval
|
Evaluated by deepseek-v3.2: +0.03 (Neutral) 11,911 tokens -0.21 | |
| 2026-03-01 19:53 |
eval
|
Evaluated by deepseek-v3.2: +0.24 (Mild positive) 11,253 tokens -0.28 | |
| 2026-03-01 19:48 |
eval
|
Evaluated by deepseek-v3.2: +0.51 (Moderate positive) 11,352 tokens +0.49 | |
| 2026-03-01 19:41 |
eval
|
Evaluated by deepseek-v3.2: +0.03 (Neutral) 11,816 tokens +0.03 | |
| 2026-03-01 19:35 |
eval
|
Evaluated by deepseek-v3.2: 0.00 (Neutral) 10,734 tokens -0.18 | |
| 2026-03-01 19:12 |
eval
|
Evaluated by deepseek-v3.2: +0.18 (Mild positive) 11,546 tokens +0.17 | |
| 2026-03-01 19:03 |
eval
|
Evaluated by deepseek-v3.2: +0.02 (Neutral) 11,934 tokens -0.17 | |
| 2026-03-01 18:54 |
eval
|
Evaluated by deepseek-v3.2: +0.19 (Mild positive) 11,078 tokens -0.20 | |
| 2026-03-01 18:47 |
eval
|
Evaluated by deepseek-v3.2: +0.39 (Moderate positive) 11,227 tokens +0.10 | |
| 2026-03-01 18:42 |
eval
|
Evaluated by deepseek-v3.2: +0.29 (Mild positive) 11,543 tokens +0.03 | |
| 2026-03-01 18:33 |
eval
|
Evaluated by deepseek-v3.2: +0.26 (Mild positive) 12,373 tokens +0.13 | |
| 2026-03-01 18:30 |
eval
|
Evaluated by deepseek-v3.2: +0.12 (Mild positive) 11,388 tokens +0.09 | |
| 2026-03-01 18:13 |
eval
|
Evaluated by deepseek-v3.2: +0.04 (Neutral) 12,204 tokens -0.17 | |
| 2026-03-01 18:04 |
eval
|
Evaluated by deepseek-v3.2: +0.21 (Mild positive) 11,387 tokens -0.07 | |
| 2026-03-01 17:55 |
eval
|
Evaluated by deepseek-v3.2: +0.28 (Mild positive) 11,757 tokens +0.16 | |
| 2026-03-01 17:48 |
eval
|
Evaluated by deepseek-v3.2: +0.12 (Mild positive) 12,500 tokens +0.01 | |
| 2026-03-01 17:38 |
eval
|
Evaluated by deepseek-v3.2: +0.11 (Mild positive) 12,125 tokens -0.02 | |
| 2026-03-01 17:30 |
eval
|
Evaluated by deepseek-v3.2: +0.13 (Mild positive) 10,828 tokens +0.05 | |
| 2026-03-01 17:25 |
eval
|
Evaluated by deepseek-v3.2: +0.08 (Neutral) 11,615 tokens -0.14 | |
| 2026-03-01 17:19 |
eval
|
Evaluated by deepseek-v3.2: +0.23 (Mild positive) 11,523 tokens +0.03 | |
| 2026-03-01 17:03 |
eval
|
Evaluated by deepseek-v3.2: +0.20 (Mild positive) 11,429 tokens +0.07 | |
| 2026-03-01 17:02 |
eval
|
Evaluated by deepseek-v3.2: +0.13 (Mild positive) 11,390 tokens -0.36 | |
| 2026-03-01 16:55 |
eval
|
Evaluated by deepseek-v3.2: +0.49 (Moderate positive) 11,350 tokens +0.28 | |
| 2026-03-01 16:45 |
eval
|
Evaluated by deepseek-v3.2: +0.21 (Mild positive) 11,126 tokens | |
| 2026-02-28 20:14 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.10 (Mild positive) 0.00 | |
| reasoning Editorial on Trump's actions |
| 2026-02-28 20:13 |
eval
|
Evaluated by llama-4-scout-wai: +0.20 (Mild positive) 0.00 | |
| reasoning Editorial discussing Trump and Silicon Valley, slight positive lean |
| 2026-02-28 19:28 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.10 (Mild positive) 0.00 | |
| reasoning Editorial on Trump's actions |
| 2026-02-28 19:24 |
eval
|
Evaluated by llama-4-scout-wai: +0.20 (Mild positive) | |
| reasoning Editorial discussing Trump and Silicon Valley, slight positive lean |
| 2026-02-28 19:23 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.10 (Mild positive) | |
| reasoning Editorial on Trump's actions |