Longitudinal 993 HN snapshots · 103 evals
+1 0 −1 HN
Audit Trail 123 entries
2026-03-16 01:56 eval_success Evaluated: Mild positive (0.28) - -
2026-03-16 01:56 model_divergence Cross-model spread 0.35 exceeds threshold (2 models) - -
2026-03-16 01:56 eval Evaluated by claude-haiku-4-5-20251001: +0.28 (Mild positive) 13,222 tokens +0.05
2026-03-16 01:25 eval_success Evaluated: Mild positive (0.23) - -
2026-03-16 01:25 model_divergence Cross-model spread 0.30 exceeds threshold (2 models) - -
2026-03-16 01:25 eval Evaluated by claude-haiku-4-5-20251001: +0.23 (Mild positive) 13,248 tokens -0.13
2026-03-16 00:52 eval_success Evaluated: Moderate positive (0.36) - -
2026-03-16 00:52 model_divergence Cross-model spread 0.44 exceeds threshold (2 models) - -
2026-03-16 00:52 eval Evaluated by claude-haiku-4-5-20251001: +0.36 (Moderate positive) 13,120 tokens -0.06
2026-03-16 00:42 eval_success PSQ evaluated: g-PSQ=0.280 (3 dims) - -
2026-03-16 00:42 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-16 00:38 eval_success Lite evaluated: Neutral (-0.08) - -
2026-03-16 00:38 model_divergence Cross-model spread 0.50 exceeds threshold (2 models) - -
2026-03-16 00:38 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on XML usage in software development, no explicit human rights discussion
2026-03-16 00:38 rater_validation_warn Lite validation warnings for model llama-4-scout-wai: 1W 0R - -
2026-03-16 00:22 eval_success Evaluated: Moderate positive (0.42) - -
2026-03-16 00:22 eval Evaluated by claude-haiku-4-5-20251001: +0.42 (Moderate positive) 14,447 tokens -0.00
2026-03-15 23:47 eval_success Evaluated: Moderate positive (0.42) - -
2026-03-15 23:47 eval Evaluated by claude-haiku-4-5-20251001: +0.42 (Moderate positive) 13,324 tokens +0.06
2026-03-15 23:09 eval_success Evaluated: Moderate positive (0.36) - -
2026-03-15 23:09 eval Evaluated by claude-haiku-4-5-20251001: +0.36 (Moderate positive) 13,698 tokens +0.21
2026-03-15 23:06 eval_success Evaluated: Mild positive (0.15) - -
2026-03-15 23:06 eval Evaluated by claude-haiku-4-5-20251001: +0.15 (Mild positive) 13,406 tokens -0.25
2026-03-15 22:30 eval_success Evaluated: Moderate positive (0.40) - -
2026-03-15 22:30 eval Evaluated by claude-haiku-4-5-20251001: +0.40 (Moderate positive) 12,818 tokens
2026-03-15 21:50 eval_success PSQ evaluated: g-PSQ=0.280 (3 dims) - -
2026-03-15 21:50 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-15 21:43 eval_success Lite evaluated: Neutral (-0.08) - -
2026-03-15 21:43 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on XML usage in software development, no explicit human rights discussion
2026-03-15 21:43 rater_validation_warn Lite validation warnings for model llama-4-scout-wai: 1W 0R - -
2026-03-15 21:08 eval_success PSQ evaluated: g-PSQ=0.280 (3 dims) - -
2026-03-15 21:08 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-15 21:04 eval_success Lite evaluated: Neutral (-0.08) - -
2026-03-15 21:04 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on XML usage in software development, no explicit human rights discussion
2026-03-15 20:31 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-15 20:26 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on XML usage in software development, no explicit human rights discussion
2026-03-15 19:53 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) -0.16
2026-03-15 19:51 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on XML usage in software development, no explicit human rights discussion
2026-03-15 19:16 eval Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) +0.16
2026-03-15 19:13 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on XML usage in software development, no explicit human rights discussion
2026-03-15 18:31 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-15 18:27 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on XML usage in software development, no explicit human rights discussion
2026-03-15 17:18 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) -0.16
2026-03-15 17:17 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on XML usage in software development, no explicit human rights discussion
2026-03-15 16:05 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on XML usage in software development, no explicit human rights discussion
2026-03-15 16:05 eval Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) 0.00
2026-03-15 15:28 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on XML usage in software development, no explicit human rights discussion
2026-03-15 15:26 eval Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) +0.16
2026-03-15 14:51 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on XML usage in software development, no explicit human rights discussion
2026-03-15 14:46 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-15 14:16 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on XML usage in software development, no explicit human rights discussion
2026-03-15 14:08 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-15 13:40 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on XML usage in software development, no explicit human rights discussion
2026-03-15 13:30 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) -0.16
2026-03-15 13:01 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on XML usage in software development, no explicit human rights discussion
2026-03-15 12:50 eval Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) +0.16
2026-03-15 12:21 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on XML usage in software development, no explicit human rights discussion
2026-03-15 12:10 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-15 11:43 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on XML usage in software development, no explicit human rights discussion
2026-03-15 11:32 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-15 11:03 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on XML usage in software development, no explicit human rights discussion
2026-03-15 10:49 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) -0.16
2026-03-15 10:24 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on XML usage in software development, no explicit human rights discussion
2026-03-15 10:10 eval Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) +0.16
2026-03-15 09:42 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on XML usage in software development, no explicit human rights discussion
2026-03-15 09:30 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-15 09:02 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on XML usage in software development, no explicit human rights discussion
2026-03-15 08:50 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-15 08:21 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on XML usage in software development, no explicit human rights discussion
2026-03-15 08:07 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-15 07:36 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on XML usage in software development, no explicit human rights discussion
2026-03-15 07:25 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-15 06:57 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on XML usage in software development, no explicit human rights discussion
2026-03-15 06:47 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-15 06:22 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on XML usage in software development, no explicit human rights discussion
2026-03-15 06:13 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-15 05:47 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on XML usage in software development, no explicit human rights discussion
2026-03-15 05:37 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-15 05:12 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on XML usage in software development, no explicit human rights discussion
2026-03-15 05:00 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-15 04:37 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on XML usage in software development, no explicit human rights discussion
2026-03-15 04:25 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) -0.16
2026-03-15 04:03 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on XML usage in software development, no explicit human rights discussion
2026-03-15 03:49 eval Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) +0.16
2026-03-15 03:25 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on XML usage in software development, no explicit human rights discussion
2026-03-15 03:09 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-15 02:50 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on XML usage in software development, no explicit human rights discussion
2026-03-15 02:32 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-15 02:15 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on XML usage in software development, no explicit human rights discussion
2026-03-15 01:57 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-15 01:40 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on XML usage in software development, no explicit human rights discussion
2026-03-15 01:22 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) -0.16
2026-03-15 01:11 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on XML usage in software development, no explicit human rights discussion
2026-03-15 00:52 eval Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) 0.00
2026-03-15 00:44 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on XML usage in software development, no explicit human rights discussion
2026-03-15 00:13 eval Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive)
2026-03-15 00:10 eval Evaluated by llama-3.3-70b-wai: -0.24 (Mild negative)
reasoning
Technical blog post on XML usage
2026-03-14 23:59 eval Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) 0.00
2026-03-14 23:39 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on XML usage in software development, no explicit human rights discussion
2026-03-14 23:21 eval Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) +0.16
2026-03-14 23:01 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on XML usage in software development, no explicit human rights discussion
2026-03-14 22:30 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-14 22:01 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on XML usage in software development, no explicit human rights discussion
2026-03-14 21:16 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) -0.16
2026-03-14 21:00 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on XML usage in software development, no explicit human rights discussion
2026-03-14 20:06 eval Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) 0.00
2026-03-14 19:51 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on XML usage in software development, no explicit human rights discussion
2026-03-14 19:23 eval Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) +0.16
2026-03-14 19:11 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on XML usage in software development, no explicit human rights discussion
2026-03-14 18:21 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-14 18:08 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on XML usage in software development, no explicit human rights discussion
2026-03-14 16:44 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-14 16:35 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on XML usage in software development, no explicit human rights discussion
2026-03-14 15:35 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-14 15:24 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on XML usage in software development, no explicit human rights discussion
2026-03-14 14:52 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) -0.16
2026-03-14 14:46 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on XML usage in software development, no explicit human rights discussion
2026-03-14 14:16 eval Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) +0.16
2026-03-14 14:12 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on XML usage in software development, no explicit human rights discussion
2026-03-14 13:40 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) -0.16
2026-03-14 13:35 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on XML usage in software development, no explicit human rights discussion
2026-03-14 13:01 eval Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive)
2026-03-14 13:00 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral)
reasoning
Technical blog post on XML usage in software development, no explicit human rights discussion