0.00 GPT-5.4 Thinking System Card (openai.com)
1019 points by mudkipdev 10 days ago | 806 comments on HN | Neutral ~lite vlite-1.6
Summary ~lite AI Technology Neutral
Technical description of GPT-5.4 Thinking System
EQ 0.50
SO 0.00
TD 0.00
Lite evaluation by llama-4-scout-wai · editorial channel only · no per-section breakdown available
Longitudinal 1805 HN snapshots · 237 evals
+1 0 −1 HN
Audit Trail 257 entries
2026-03-08 18:29 eval_skip Skipped: no readable text (pre-fetch) - -
2026-03-08 18:06 eval_success Lite evaluated: Neutral (0.00) - -
2026-03-08 18:06 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical content, no explicit human rights discussion
2026-03-08 18:06 rater_validation_warn Lite validation warnings for model llama-4-scout-wai: 1W 0R - -
2026-03-08 17:31 eval_success PSQ evaluated: g-PSQ=0.450 (3 dims) - -
2026-03-08 17:31 eval Evaluated by llama-4-scout-wai-psq: +0.45 (Moderate positive) 0.00
2026-03-08 17:14 eval_success Lite evaluated: Neutral (0.00) - -
2026-03-08 17:14 eval Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00
reasoning
Technical content, no rights discussion
2026-03-08 17:14 rater_validation_warn Lite validation warnings for model llama-3.3-70b-wai: 1W 0R - -
2026-03-08 16:53 eval_success PSQ evaluated: g-PSQ=0.267 (3 dims) - -
2026-03-08 16:53 eval Evaluated by llama-3.3-70b-wai-psq: +0.27 (Mild positive) -0.21
2026-03-08 16:01 eval_success Lite evaluated: Neutral (0.00) - -
2026-03-08 16:01 rater_validation_warn Lite validation warnings for model llama-4-scout-wai: 1W 0R - -
2026-03-08 16:01 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical content, no explicit human rights discussion
2026-03-08 15:08 eval_success PSQ evaluated: g-PSQ=0.450 (3 dims) - -
2026-03-08 15:08 eval Evaluated by llama-4-scout-wai-psq: +0.45 (Moderate positive) 0.00
2026-03-08 14:54 eval_success Lite evaluated: Neutral (0.00) - -
2026-03-08 14:54 eval Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00
reasoning
Technical content, no rights discussion
2026-03-08 14:54 rater_validation_warn Lite validation warnings for model llama-3.3-70b-wai: 1W 0R - -
2026-03-08 14:30 eval_success PSQ evaluated: g-PSQ=0.474 (3 dims) - -
2026-03-08 14:30 eval Evaluated by llama-3.3-70b-wai-psq: +0.47 (Moderate positive) +0.21
2026-03-08 14:07 eval_success Lite evaluated: Neutral (0.00) - -
2026-03-08 14:07 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical content, no explicit human rights discussion
2026-03-08 14:07 rater_validation_warn Lite validation warnings for model llama-4-scout-wai: 1W 0R - -
2026-03-08 13:52 eval_success PSQ evaluated: g-PSQ=0.450 (3 dims) - -
2026-03-08 13:52 eval Evaluated by llama-4-scout-wai-psq: +0.45 (Moderate positive) 0.00
2026-03-08 13:39 eval_success Lite evaluated: Neutral (0.00) - -
2026-03-08 13:39 eval Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00
reasoning
Technical content, no rights discussion
2026-03-08 13:39 rater_validation_warn Lite validation warnings for model llama-3.3-70b-wai: 1W 0R - -
2026-03-08 13:19 eval_success PSQ evaluated: g-PSQ=0.267 (3 dims) - -
2026-03-08 13:19 eval Evaluated by llama-3.3-70b-wai-psq: +0.27 (Mild positive) -0.21
2026-03-08 13:13 eval_success PSQ evaluated: g-PSQ=0.474 (3 dims) - -
2026-03-08 13:13 eval Evaluated by llama-3.3-70b-wai-psq: +0.47 (Moderate positive) +0.21
2026-03-08 12:57 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical content, no explicit human rights discussion
2026-03-08 12:41 eval Evaluated by llama-4-scout-wai-psq: +0.45 (Moderate positive) 0.00
2026-03-08 12:36 eval Evaluated by llama-4-scout-wai-psq: +0.45 (Moderate positive) 0.00
2026-03-08 12:34 eval Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00
reasoning
Technical content, no rights discussion
2026-03-08 12:01 eval Evaluated by llama-3.3-70b-wai-psq: +0.27 (Mild positive) -0.21
2026-03-08 11:56 eval Evaluated by llama-3.3-70b-wai-psq: +0.47 (Moderate positive) +0.21
2026-03-08 11:44 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical content, no explicit human rights discussion
2026-03-08 11:22 eval Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00
reasoning
Technical content, no rights discussion
2026-03-08 11:21 eval Evaluated by llama-4-scout-wai-psq: +0.45 (Moderate positive) 0.00
2026-03-08 10:43 eval Evaluated by llama-3.3-70b-wai-psq: +0.27 (Mild positive) 0.00
2026-03-08 10:32 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical content, no explicit human rights discussion
2026-03-08 10:27 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical content, no explicit human rights discussion
2026-03-08 10:10 eval Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00
reasoning
Technical content, no rights discussion
2026-03-08 10:07 eval Evaluated by llama-4-scout-wai-psq: +0.45 (Moderate positive) 0.00
2026-03-08 09:40 eval Evaluated by llama-3.3-70b-wai-psq: +0.27 (Mild positive) 0.00
2026-03-08 09:35 eval Evaluated by llama-3.3-70b-wai-psq: +0.27 (Mild positive) -0.21
2026-03-08 09:23 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical content, no explicit human rights discussion
2026-03-08 09:06 eval Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00
reasoning
Technical content, no rights discussion
2026-03-08 09:02 eval Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00
reasoning
Technical content, no rights discussion
2026-03-08 09:01 eval Evaluated by llama-4-scout-wai-psq: +0.45 (Moderate positive) 0.00
2026-03-08 08:56 eval Evaluated by llama-4-scout-wai-psq: +0.45 (Moderate positive) 0.00
2026-03-08 08:33 eval Evaluated by llama-3.3-70b-wai-psq: +0.47 (Moderate positive) +0.21
2026-03-08 08:22 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical content, no explicit human rights discussion
2026-03-08 07:58 eval Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00
reasoning
Technical content, no rights discussion
2026-03-08 07:50 eval Evaluated by llama-4-scout-wai-psq: +0.45 (Moderate positive) 0.00
2026-03-08 07:28 eval Evaluated by llama-3.3-70b-wai-psq: +0.27 (Mild positive) 0.00
2026-03-08 07:20 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical content, no explicit human rights discussion
2026-03-08 06:56 eval Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00
reasoning
Technical content, no rights discussion
2026-03-08 06:49 eval Evaluated by llama-4-scout-wai-psq: +0.45 (Moderate positive) 0.00
2026-03-08 06:30 eval Evaluated by llama-3.3-70b-wai-psq: +0.27 (Mild positive) 0.00
2026-03-08 06:21 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical content, no explicit human rights discussion
2026-03-08 05:58 eval Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00
reasoning
Technical content, no rights discussion
2026-03-08 05:48 eval Evaluated by llama-4-scout-wai-psq: +0.45 (Moderate positive) 0.00
2026-03-08 05:30 eval Evaluated by llama-3.3-70b-wai-psq: +0.27 (Mild positive) -0.21
2026-03-08 05:18 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical content, no explicit human rights discussion
2026-03-08 05:13 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical content, no explicit human rights discussion
2026-03-08 04:57 eval Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00
reasoning
Technical content, no rights discussion
2026-03-08 04:44 eval Evaluated by llama-4-scout-wai-psq: +0.45 (Moderate positive) 0.00
2026-03-08 04:27 eval Evaluated by llama-3.3-70b-wai-psq: +0.47 (Moderate positive) +0.21
2026-03-08 04:11 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical content, no explicit human rights discussion
2026-03-08 03:58 eval Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00
reasoning
Technical content, no rights discussion
2026-03-08 03:44 eval Evaluated by llama-4-scout-wai-psq: +0.45 (Moderate positive) 0.00
2026-03-08 03:26 eval Evaluated by llama-3.3-70b-wai-psq: +0.27 (Mild positive) 0.00
2026-03-08 03:08 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical content, no explicit human rights discussion
2026-03-08 03:02 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical content, no explicit human rights discussion
2026-03-08 02:52 eval Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00
reasoning
Technical content, no rights discussion
2026-03-08 02:36 eval Evaluated by llama-4-scout-wai-psq: +0.45 (Moderate positive) 0.00
2026-03-08 02:19 eval Evaluated by llama-3.3-70b-wai-psq: +0.27 (Mild positive) 0.00
2026-03-08 02:01 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical content, no explicit human rights discussion
2026-03-08 01:49 eval Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00
reasoning
Technical content, no rights discussion
2026-03-08 01:32 eval Evaluated by llama-4-scout-wai-psq: +0.45 (Moderate positive) 0.00
2026-03-08 01:27 eval Evaluated by llama-4-scout-wai-psq: +0.45 (Moderate positive) 0.00
2026-03-08 01:14 eval Evaluated by llama-3.3-70b-wai-psq: +0.27 (Mild positive) 0.00
2026-03-08 00:57 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical content, no explicit human rights discussion
2026-03-08 00:48 eval Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00
reasoning
Technical content, no rights discussion
2026-03-08 00:28 eval Evaluated by llama-4-scout-wai-psq: +0.45 (Moderate positive) 0.00
2026-03-08 00:13 eval Evaluated by llama-3.3-70b-wai-psq: +0.27 (Mild positive) 0.00
2026-03-07 23:55 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical content, no explicit human rights discussion
2026-03-07 23:45 eval Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00
reasoning
Technical content, no rights discussion
2026-03-07 23:21 eval Evaluated by llama-4-scout-wai-psq: +0.45 (Moderate positive) 0.00
2026-03-07 23:04 eval Evaluated by llama-3.3-70b-wai-psq: +0.27 (Mild positive) 0.00
2026-03-07 22:47 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical content, no explicit human rights discussion
2026-03-07 22:38 eval Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00
reasoning
Technical content, no rights discussion
2026-03-07 22:33 eval Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00
reasoning
Technical content, no rights discussion
2026-03-07 21:53 eval Evaluated by llama-4-scout-wai-psq: +0.45 (Moderate positive) 0.00
2026-03-07 21:48 eval Evaluated by llama-4-scout-wai-psq: +0.45 (Moderate positive) 0.00
2026-03-07 20:37 eval Evaluated by llama-3.3-70b-wai-psq: +0.27 (Mild positive) 0.00
2026-03-07 20:33 eval Evaluated by llama-3.3-70b-wai-psq: +0.27 (Mild positive) 0.00
2026-03-07 19:53 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical content, no explicit human rights discussion
2026-03-07 19:39 eval Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) 0.00
reasoning
Technical content, no rights discussion
2026-03-07 19:01 eval Evaluated by llama-4-scout-wai-psq: +0.45 (Moderate positive) 0.00
2026-03-07 18:56 eval Evaluated by llama-4-scout-wai-psq: +0.45 (Moderate positive) -0.03
2026-03-07 18:43 eval Evaluated by llama-3.3-70b-wai-psq: +0.27 (Mild positive) 0.00
2026-03-07 18:38 eval Evaluated by llama-3.3-70b-wai-psq: +0.27 (Mild positive) -0.21
2026-03-07 18:05 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-07 17:45 eval Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) +0.01
2026-03-07 16:58 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-07 16:42 eval Evaluated by llama-3.3-70b-wai-psq: +0.47 (Moderate positive) 0.00
2026-03-07 16:23 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-07 16:18 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-07 16:10 eval Evaluated by llama-3.3-70b-wai-psq: +0.47 (Moderate positive) -0.01
2026-03-07 16:05 eval Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-07 15:45 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-07 15:31 eval Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) +0.01
2026-03-07 15:13 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-07 15:00 eval Evaluated by llama-3.3-70b-wai-psq: +0.47 (Moderate positive) -0.01
2026-03-07 14:37 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-07 14:27 eval Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-07 14:03 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-07 13:54 eval Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-07 13:29 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-07 13:24 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-07 13:23 eval Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-07 12:52 eval Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-07 12:51 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-07 12:22 eval Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-07 12:18 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-07 11:52 eval Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) +0.01
2026-03-07 11:48 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-07 11:21 eval Evaluated by llama-3.3-70b-wai-psq: +0.47 (Moderate positive) 0.00
2026-03-07 11:17 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-07 11:11 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-07 10:53 eval Evaluated by llama-3.3-70b-wai-psq: +0.47 (Moderate positive) -0.01
2026-03-07 10:48 eval Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-07 10:40 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-07 10:18 eval Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-07 10:07 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-07 09:48 eval Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) +0.01
2026-03-07 09:35 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-07 09:19 eval Evaluated by llama-3.3-70b-wai-psq: +0.47 (Moderate positive) 0.00
2026-03-07 09:04 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-07 08:49 eval Evaluated by llama-3.3-70b-wai-psq: +0.47 (Moderate positive) 0.00
2026-03-07 08:33 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-07 08:19 eval Evaluated by llama-3.3-70b-wai-psq: +0.47 (Moderate positive) -0.01
2026-03-07 08:03 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-07 07:48 eval Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) +0.01
2026-03-07 07:29 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-07 07:17 eval Evaluated by llama-3.3-70b-wai-psq: +0.47 (Moderate positive) -0.01
2026-03-07 06:58 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-07 06:46 eval Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-07 06:26 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-07 06:16 eval Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-07 05:55 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-07 05:46 eval Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-07 05:24 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-07 05:17 eval Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-07 05:12 eval Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-07 04:53 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-07 04:42 eval Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-07 04:22 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-07 04:12 eval Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) +0.01
2026-03-07 03:50 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-07 03:41 eval Evaluated by llama-3.3-70b-wai-psq: +0.47 (Moderate positive) -0.01
2026-03-07 03:17 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-07 03:06 eval Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-07 02:41 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-07 02:36 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-07 02:32 eval Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-07 02:27 eval Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) +0.01
2026-03-07 02:05 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-07 01:56 eval Evaluated by llama-3.3-70b-wai-psq: +0.47 (Moderate positive) -0.01
2026-03-07 01:51 eval Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) +0.01
2026-03-07 01:26 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-07 01:15 eval Evaluated by llama-3.3-70b-wai-psq: +0.47 (Moderate positive) -0.01
2026-03-07 00:51 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-07 00:29 eval Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-06 23:50 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-06 23:42 eval Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) +0.02
2026-03-06 23:37 eval Evaluated by llama-3.3-70b-wai-psq: +0.46 (Moderate positive) -0.01
2026-03-06 23:17 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-06 23:05 eval Evaluated by llama-3.3-70b-wai-psq: +0.47 (Moderate positive) 0.00
2026-03-06 22:15 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-06 22:04 eval Evaluated by llama-3.3-70b-wai-psq: +0.47 (Moderate positive) 0.00
2026-03-06 21:31 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-06 21:14 eval Evaluated by llama-3.3-70b-wai-psq: +0.47 (Moderate positive) 0.00
2026-03-06 20:52 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-06 20:32 eval Evaluated by llama-3.3-70b-wai-psq: +0.47 (Moderate positive) -0.01
2026-03-06 20:13 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-06 19:57 eval Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) +0.01
2026-03-06 19:36 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-06 19:24 eval Evaluated by llama-3.3-70b-wai-psq: +0.47 (Moderate positive) 0.00
2026-03-06 19:19 eval Evaluated by llama-3.3-70b-wai-psq: +0.47 (Moderate positive) 0.00
2026-03-06 19:00 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-06 18:41 eval Evaluated by llama-3.3-70b-wai-psq: +0.47 (Moderate positive) 0.00
2026-03-06 18:18 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-06 18:13 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-06 17:57 eval Evaluated by llama-3.3-70b-wai-psq: +0.47 (Moderate positive) 0.00
2026-03-06 17:04 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-06 16:51 eval Evaluated by llama-3.3-70b-wai-psq: +0.47 (Moderate positive) 0.00
2026-03-06 16:27 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-06 16:18 eval Evaluated by llama-3.3-70b-wai-psq: +0.47 (Moderate positive) 0.00
2026-03-06 15:53 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-06 15:48 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-06 15:44 eval Evaluated by llama-3.3-70b-wai-psq: +0.47 (Moderate positive) 0.00
2026-03-06 15:11 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-06 15:09 eval Evaluated by llama-3.3-70b-wai-psq: +0.47 (Moderate positive) 0.00
2026-03-06 14:28 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-06 14:26 eval Evaluated by llama-3.3-70b-wai-psq: +0.47 (Moderate positive) 0.00
2026-03-06 13:53 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-06 13:52 eval Evaluated by llama-3.3-70b-wai-psq: +0.47 (Moderate positive) 0.00
2026-03-06 13:48 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-06 13:47 eval Evaluated by llama-3.3-70b-wai-psq: +0.47 (Moderate positive) 0.00
2026-03-06 13:09 eval Evaluated by llama-3.3-70b-wai-psq: +0.47 (Moderate positive) 0.00
2026-03-06 13:08 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-06 13:03 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-06 12:38 eval Evaluated by llama-3.3-70b-wai-psq: +0.47 (Moderate positive) 0.00
2026-03-06 12:30 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-06 12:06 eval Evaluated by llama-3.3-70b-wai-psq: +0.47 (Moderate positive) 0.00
2026-03-06 11:56 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-06 11:35 eval Evaluated by llama-3.3-70b-wai-psq: +0.47 (Moderate positive) 0.00
2026-03-06 11:30 eval Evaluated by llama-3.3-70b-wai-psq: +0.47 (Moderate positive) -0.01
2026-03-06 11:23 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-06 10:59 eval Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) +0.01
2026-03-06 10:54 eval Evaluated by llama-3.3-70b-wai-psq: +0.47 (Moderate positive) 0.00
2026-03-06 10:53 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-06 10:21 eval Evaluated by llama-3.3-70b-wai-psq: +0.47 (Moderate positive) 0.00
2026-03-06 10:20 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-06 09:49 eval Evaluated by llama-3.3-70b-wai-psq: +0.47 (Moderate positive) 0.00
2026-03-06 09:48 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-06 09:16 eval Evaluated by llama-3.3-70b-wai-psq: +0.47 (Moderate positive) 0.00
2026-03-06 09:12 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-06 08:45 eval Evaluated by llama-3.3-70b-wai-psq: +0.47 (Moderate positive) 0.00
2026-03-06 08:40 eval Evaluated by llama-3.3-70b-wai-psq: +0.47 (Moderate positive) 0.00
2026-03-06 08:40 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-06 08:12 eval Evaluated by llama-3.3-70b-wai-psq: +0.47 (Moderate positive) 0.00
2026-03-06 08:11 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-06 08:07 eval Evaluated by llama-3.3-70b-wai-psq: +0.47 (Moderate positive) 0.00
2026-03-06 07:39 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-06 07:38 eval Evaluated by llama-3.3-70b-wai-psq: +0.47 (Moderate positive) 0.00
2026-03-06 07:08 eval Evaluated by llama-3.3-70b-wai-psq: +0.47 (Moderate positive) 0.00
2026-03-06 07:06 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-06 07:03 eval Evaluated by llama-3.3-70b-wai-psq: +0.47 (Moderate positive) 0.00
2026-03-06 07:02 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-06 06:33 eval Evaluated by llama-3.3-70b-wai-psq: +0.47 (Moderate positive) 0.00
2026-03-06 06:29 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-06 06:28 eval Evaluated by llama-3.3-70b-wai-psq: +0.47 (Moderate positive) -0.01
2026-03-06 06:24 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-06 05:56 eval Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-06 05:51 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-06 05:51 eval Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) +0.01
2026-03-06 04:41 eval Evaluated by llama-4-scout-wai-psq: +0.48 (Moderate positive)
2026-03-06 04:41 eval Evaluated by llama-3.3-70b-wai-psq: +0.47 (Moderate positive)
2026-03-05 19:12 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral)
reasoning
Technical content, no explicit human rights discussion
2026-03-05 19:09 eval Evaluated by llama-3.3-70b-wai: 0.00 (Neutral)
reasoning
Technical content, no rights discussion