| 2026-03-02 04:46 | eval_success | Evaluated: Mild positive (0.25) | - - |
| 2026-03-02 04:46 | model_divergence | Cross-model spread 0.40 exceeds threshold (3 models) | - - |
| 2026-03-02 04:46 |
eval
|
Evaluated by deepseek-v3.2: +0.25 (Mild positive) 13,372 tokens +0.02 | |
| 2026-03-02 04:46 | rater_validation_warn | Validation warnings for model deepseek-v3.2: 0W 52R | - - |
| 2026-03-01 17:41 | eval_success | Evaluated: Mild positive (0.23) | - - |
| 2026-03-01 17:41 | model_divergence | Cross-model spread 0.40 exceeds threshold (3 models) | - - |
| 2026-03-01 17:41 |
eval
|
Evaluated by deepseek-v3.2: +0.23 (Mild positive) 13,716 tokens -0.07 | |
| 2026-03-01 16:49 | eval_success | Evaluated: Moderate positive (0.30) | - - |
| 2026-03-01 16:48 | model_divergence | Cross-model spread 0.40 exceeds threshold (3 models) | - - |
| 2026-03-01 16:48 |
eval
|
Evaluated by deepseek-v3.2: +0.30 (Moderate positive) 13,248 tokens +0.07 | |
| 2026-03-01 02:07 | eval_success | Evaluated: Mild positive (0.23) | - - |
| 2026-03-01 02:07 | model_divergence | Cross-model spread 0.40 exceeds threshold (4 models) | - - |
| 2026-03-01 02:07 |
eval
|
Evaluated by deepseek-v3.2: +0.23 (Mild positive) 13,120 tokens | |
| 2026-03-01 01:59 | eval_success | Lite evaluated: Moderate positive (0.40) | - - |
| 2026-03-01 01:59 | model_divergence | Cross-model spread 0.40 exceeds threshold (3 models) | - - |
| 2026-03-01 01:59 |
eval
|
Evaluated by llama-4-scout-wai: +0.40 (Moderate positive) +0.30 | |
| reasoning Editorial discusses security concerns with OpenClaw, advocates isolated VM |
| 2026-03-01 01:55 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-03-01 01:55 | model_divergence | Cross-model spread 0.32 exceeds threshold (2 models) | - - |
| 2026-03-01 01:55 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Tech blog warns of security risks |
| 2026-03-01 01:20 | model_divergence | Cross-model spread 0.32 exceeds threshold (3 models) | - - |
| 2026-03-01 01:20 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-03-01 01:20 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Tech blog warns of security risks |
| 2026-03-01 00:52 | eval_success | Lite evaluated: Mild positive (0.10) | - - |
| 2026-03-01 00:52 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) 0.00 | |
| reasoning Editorial discusses security concerns with OpenClaw, advocates isolated VM |
| 2026-03-01 00:46 | eval_success | Lite evaluated: Mild positive (0.10) | - - |
| 2026-03-01 00:46 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) -0.30 | |
| reasoning Editorial discusses security concerns with OpenClaw, advocates isolated VM |
| 2026-03-01 00:25 | model_divergence | Cross-model spread 0.32 exceeds threshold (2 models) | - - |
| 2026-03-01 00:25 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-03-01 00:25 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Tech blog warns of security risks |
| 2026-03-01 00:20 | model_divergence | Cross-model spread 0.32 exceeds threshold (2 models) | - - |
| 2026-03-01 00:20 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Tech blog warns of security risks |
| 2026-03-01 00:05 |
eval
|
Evaluated by llama-4-scout-wai: +0.40 (Moderate positive) +0.30 | |
| reasoning Editorial discusses security concerns with OpenClaw, advocates isolated VM |
| 2026-02-28 23:33 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Tech blog warns of security risks |
| 2026-02-28 23:09 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) 0.00 | |
| reasoning Editorial discusses security concerns with OpenClaw, advocates isolated VM |
| 2026-02-28 22:36 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Tech blog warns of security risks |
| 2026-02-28 22:31 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Tech blog warns of security risks |
| 2026-02-28 22:14 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) 0.00 | |
| reasoning Editorial discusses security concerns with OpenClaw, advocates isolated VM |
| 2026-02-28 21:48 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Tech blog warns of security risks |
| 2026-02-28 21:43 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Tech blog warns of security risks |
| 2026-02-28 21:30 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) -0.30 | |
| reasoning Editorial discusses security concerns with OpenClaw, advocates isolated VM |
| 2026-02-28 21:25 |
eval
|
Evaluated by llama-4-scout-wai: +0.40 (Moderate positive) +0.30 | |
| reasoning Editorial discusses security concerns with OpenClaw, advocates isolated VM |
| 2026-02-28 21:00 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Tech blog warns of security risks |
| 2026-02-28 20:35 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) 0.00 | |
| reasoning Editorial discusses security concerns with OpenClaw, advocates isolated VM |
| 2026-02-28 20:10 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Tech blog warns of security risks |
| 2026-02-28 19:48 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) -0.30 | |
| reasoning Editorial discusses security concerns with OpenClaw, advocates isolated VM |
| 2026-02-28 19:23 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Tech blog warns of security risks |
| 2026-02-28 19:05 |
eval
|
Evaluated by llama-4-scout-wai: +0.40 (Moderate positive) 0.00 | |
| reasoning Editorial discusses security concerns with OpenClaw, advocates isolated VM |
| 2026-02-28 18:57 |
eval
|
Evaluated by llama-4-scout-wai: +0.40 (Moderate positive) 0.00 | |
| reasoning Editorial discusses security concerns with OpenClaw, advocates isolated VM |
| 2026-02-28 18:42 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Tech blog warns of security risks |
| 2026-02-28 18:26 |
eval
|
Evaluated by llama-4-scout-wai: +0.40 (Moderate positive) 0.00 | |
| reasoning Editorial discusses security concerns with OpenClaw, advocates isolated VM |
| 2026-02-28 18:17 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Tech blog warns of security risks |
| 2026-02-28 18:00 |
eval
|
Evaluated by llama-4-scout-wai: +0.40 (Moderate positive) +0.30 | |
| reasoning Editorial discusses security concerns with OpenClaw, advocates isolated VM |
| 2026-02-28 17:55 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) -0.30 | |
| reasoning Editorial discusses security concerns with OpenClaw, advocates isolated VM |
| 2026-02-28 17:53 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Tech blog warns of security risks |
| 2026-02-28 17:29 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Tech blog warns of security risks |
| 2026-02-28 17:28 |
eval
|
Evaluated by llama-4-scout-wai: +0.40 (Moderate positive) +0.30 | |
| reasoning Editorial discusses security concerns with OpenClaw, advocates isolated VM |
| 2026-02-28 17:24 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Tech blog warns of security risks |
| 2026-02-28 17:01 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) 0.00 | |
| reasoning Editorial discusses security concerns with OpenClaw, advocates isolated VM |
| 2026-02-28 16:56 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Tech blog warns of security risks |
| 2026-02-28 16:35 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) 0.00 | |
| reasoning Editorial discusses security concerns with OpenClaw, advocates isolated VM |
| 2026-02-28 16:31 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Tech blog warns of security risks |
| 2026-02-28 15:39 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) 0.00 | |
| reasoning Editorial discusses security concerns with OpenClaw, advocates isolated VM |
| 2026-02-28 15:28 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Tech blog warns of security risks |
| 2026-02-28 13:41 |
eval
|
Evaluated by claude-haiku-4-5-20251001: +0.32 (Moderate positive) | |
| 2026-02-28 13:22 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Mild positive) -0.30 | |
| reasoning Editorial discusses security concerns with OpenClaw, advocates isolated VM |
| 2026-02-28 12:58 |
eval
|
Evaluated by llama-4-scout-wai: +0.40 (Moderate positive) +0.02 | |
| reasoning Editorial discusses security concerns with OpenClaw, advocates isolated VM |
| 2026-02-28 08:57 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Tech blog warns of security risks |
| 2026-02-28 06:05 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Tech blog warns of security risks |
| 2026-02-28 05:49 |
eval
|
Evaluated by llama-4-scout-wai: +0.38 (Moderate positive) -0.02 | |
| reasoning Editorial discusses security concerns with OpenClaw, advocates isolated VM |
| 2026-02-28 05:45 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Tech blog warns of security risks |
| 2026-02-28 05:43 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Tech blog warns of security risks |
| 2026-02-28 05:02 |
eval
|
Evaluated by llama-4-scout-wai: +0.40 (Moderate positive) +0.40 | |
| reasoning Editorial discusses security concerns with OpenClaw, advocates isolated VM |
| 2026-02-28 04:40 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Tech blog warns of security risks |
| 2026-02-28 04:26 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Editorial discusses security concerns with OpenClaw, advocates isolated VM |
| 2026-02-28 03:13 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Editorial discusses security concerns with OpenClaw, advocates isolated VM |
| 2026-02-28 02:55 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Tech blog warns of security risks |
| 2026-02-28 02:49 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Tech blog warns of security risks |
| 2026-02-28 02:38 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Editorial discusses security concerns with OpenClaw, advocates isolated VM |
| 2026-02-28 02:25 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Tech blog warns of security risks |
| 2026-02-28 02:01 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Editorial discusses security concerns with OpenClaw, advocates isolated VM |
| 2026-02-28 01:54 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Tech blog warns of security risks |
| 2026-02-28 01:40 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Editorial discusses security concerns with OpenClaw, advocates isolated VM |
| 2026-02-28 01:29 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00 | |
| reasoning Tech blog warns of security risks |
| 2026-02-28 01:27 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Editorial discusses security concerns with OpenClaw, advocates isolated VM |
| 2026-02-28 01:18 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) | |
| reasoning Tech blog warns of security risks |
| 2026-02-28 01:05 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) | |
| reasoning Editorial discusses security concerns with OpenClaw, advocates isolated VM |