| 2026-03-07 19:16 | eval_success | PSQ evaluated: g-PSQ=0.120 (3 dims) | - - |
| 2026-03-07 19:16 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) -0.16 | |
| 2026-03-07 19:10 | eval_success | PSQ evaluated: g-PSQ=0.055 (3 dims) | - - |
| 2026-03-07 19:10 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.06 (Neutral) 0.00 | |
| 2026-03-07 19:06 | eval_success | PSQ evaluated: g-PSQ=0.055 (3 dims) | - - |
| 2026-03-07 19:06 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.06 (Neutral) 0.00 | |
| 2026-03-07 17:41 | eval_success | PSQ evaluated: g-PSQ=0.280 (3 dims) | - - |
| 2026-03-07 17:41 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) +0.16 | |
| 2026-03-07 17:34 | eval_success | PSQ evaluated: g-PSQ=0.055 (3 dims) | - - |
| 2026-03-07 17:34 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.06 (Neutral) 0.00 | |
| 2026-03-07 17:29 | eval_success | PSQ evaluated: g-PSQ=0.055 (3 dims) | - - |
| 2026-03-07 17:29 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.06 (Neutral) 0.00 | |
| 2026-03-07 17:13 | self_throttle | Self-throttle: circuit-breaker: 3 consecutive 429s | - - |
| 2026-03-07 16:50 | self_throttle | Self-throttle: circuit-breaker: 3 consecutive 429s | - - |
| 2026-03-07 16:17 | rate_limit | Rate limited (429), retrying in 52s | - - |
| 2026-03-07 15:20 | self_throttle | Self-throttle: circuit-breaker: 3 consecutive 429s | - - |
| 2026-03-07 15:15 | rate_limit | Rate limited (429), retrying in 70s | - - |
| 2026-03-07 15:10 | self_throttle | Self-throttle: circuit-breaker: 3 consecutive 429s | - - |
| 2026-03-07 15:05 | rate_limit | Rate limited (429), retrying in 49s | - - |
| 2026-03-07 14:59 | self_throttle | Self-throttle: circuit-breaker: 3 consecutive 429s | - - |
| 2026-03-07 14:54 | self_throttle | Self-throttle: circuit-breaker: 3 consecutive 429s | - - |
| 2026-03-07 14:49 | self_throttle | Self-throttle: circuit-breaker: 3 consecutive 429s | - - |
| 2026-03-07 14:49 | self_throttle | Self-throttle: circuit-breaker: 3 consecutive 429s | - - |
| 2026-03-07 14:49 | self_throttle | Self-throttle: circuit-breaker: 3 consecutive 429s | - - |
| 2026-03-07 04:49 | self_throttle | Self-throttle: circuit-breaker: 3 consecutive 429s | - - |
| 2026-03-07 04:44 | self_throttle | Self-throttle: circuit-breaker: 4 consecutive 429s | - - |
| 2026-03-06 22:45 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-06 22:33 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.06 (Neutral) 0.00 | |
| 2026-03-06 17:54 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-06 17:39 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.06 (Neutral) 0.00 | |
| 2026-03-06 04:30 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.06 (Neutral) 0.00 | |
| 2026-03-05 21:30 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00 | |
| 2026-03-05 17:21 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.06 (Neutral) 0.00 | |
| 2026-03-05 08:07 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) -0.16 | |
| 2026-03-05 07:28 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.06 (Neutral) 0.00 | |
| 2026-03-05 04:50 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-05 04:45 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) | |
| 2026-03-05 04:39 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.06 (Neutral) 0.00 | |
| 2026-03-05 04:34 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.06 (Neutral) | |
| 2026-03-05 04:22 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.02 (Neutral) +0.04 | |
| reasoning Medical journal correction case |
| 2026-03-05 04:19 |
eval
|
Evaluated by llama-4-scout-wai: -0.16 (Mild negative) -0.16 | |
| reasoning The content discusses a medical journal's admission that its case reports for 25 years were fictional, which relates to |
| 2026-03-05 04:17 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.06 (Neutral) -0.04 | |
| reasoning Medical journal correction case |
| 2026-03-05 04:14 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) +0.16 | |
| reasoning The content discusses a medical journal's admission that its case reports for 25 years were fictional, which relates to |
| 2026-03-05 03:44 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.02 (Neutral) 0.00 | |
| reasoning Medical journal correction case |
| 2026-03-05 03:40 |
eval
|
Evaluated by llama-4-scout-wai: -0.16 (Mild negative) 0.00 | |
| reasoning The content discusses a medical journal's admission that its case reports for 25 years were fictional, which relates to |
| 2026-03-05 03:05 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.02 (Neutral) 0.00 | |
| reasoning Medical journal correction case |
| 2026-03-05 02:57 |
eval
|
Evaluated by llama-4-scout-wai: -0.16 (Mild negative) -0.16 | |
| reasoning The content discusses a medical journal's admission that its case reports for 25 years were fictional, which relates to |
| 2026-03-05 02:22 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.02 (Neutral) +0.04 | |
| reasoning Medical journal correction case |
| 2026-03-05 02:18 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning The content discusses a medical journal's admission that its case reports for 25 years were fictional, which relates to |
| 2026-03-05 01:49 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.06 (Neutral) -0.04 | |
| reasoning Medical journal correction case |
| 2026-03-05 01:42 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) +0.16 | |
| reasoning The content discusses a medical journal's admission that its case reports for 25 years were fictional, which relates to |
| 2026-03-05 01:37 |
eval
|
Evaluated by llama-4-scout-wai: -0.16 (Mild negative) -0.16 | |
| reasoning The content discusses a medical journal's admission that its case reports for 25 years were fictional, which relates to |
| 2026-03-05 01:09 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.02 (Neutral) +0.09 | |
| reasoning Medical journal correction case |
| 2026-03-05 01:00 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) +0.16 | |
| reasoning The content discusses a medical journal's admission that its case reports for 25 years were fictional, which relates to |
| 2026-03-05 00:55 |
eval
|
Evaluated by llama-4-scout-wai: -0.16 (Mild negative) -0.16 | |
| reasoning The content discusses a medical journal's admission that its case reports for 25 years were fictional, which relates to |
| 2026-03-05 00:35 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.11 (Mild negative) -0.05 | |
| reasoning Medical journal correction case |
| 2026-03-05 00:11 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) +0.16 | |
| reasoning The content discusses a medical journal's admission that its case reports for 25 years were fictional, which relates to |
| 2026-03-05 00:06 |
eval
|
Evaluated by llama-4-scout-wai: -0.16 (Mild negative) 0.00 | |
| reasoning The content discusses a medical journal's admission that its case reports for 25 years were fictional, which relates to |
| 2026-03-04 23:54 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.06 (Neutral) 0.00 | |
| reasoning Medical journal correction case |
| 2026-03-04 23:49 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.06 (Neutral) 0.00 | |
| reasoning Medical journal correction case |
| 2026-03-04 23:29 |
eval
|
Evaluated by llama-4-scout-wai: -0.16 (Mild negative) 0.00 | |
| reasoning The content discusses a medical journal's admission that its case reports for 25 years were fictional, which relates to |
| 2026-03-04 23:23 |
eval
|
Evaluated by llama-4-scout-wai: -0.16 (Mild negative) 0.00 | |
| reasoning The content discusses a medical journal's admission that its case reports for 25 years were fictional, which relates to |
| 2026-03-04 23:11 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.06 (Neutral) +0.05 | |
| reasoning Medical journal correction case |
| 2026-03-04 22:41 |
eval
|
Evaluated by llama-4-scout-wai: -0.16 (Mild negative) 0.00 | |
| reasoning The content discusses a medical journal's admission that its case reports for 25 years were fictional, which relates to |
| 2026-03-04 22:36 |
eval
|
Evaluated by llama-4-scout-wai: -0.16 (Mild negative) 0.00 | |
| reasoning The content discusses a medical journal's admission that its case reports for 25 years were fictional, which relates to |
| 2026-03-04 22:29 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.11 (Mild negative) -0.03 | |
| reasoning Medical journal correction case |
| 2026-03-04 21:59 |
eval
|
Evaluated by llama-4-scout-wai: -0.16 (Mild negative) 0.00 | |
| reasoning The content discusses a medical journal's admission that its case reports for 25 years were fictional, which relates to |
| 2026-03-04 21:54 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.08 (Neutral) 0.00 | |
| reasoning Medical journal correction case |
| 2026-03-04 21:22 |
eval
|
Evaluated by llama-4-scout-wai: -0.16 (Mild negative) 0.00 | |
| reasoning The content discusses a medical journal's admission that its case reports for 25 years were fictional, which relates to |
| 2026-03-04 21:19 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.08 (Neutral) +0.03 | |
| reasoning Medical journal correction case |
| 2026-03-04 20:39 |
eval
|
Evaluated by llama-4-scout-wai: -0.16 (Mild negative) 0.00 | |
| reasoning The content discusses a medical journal's admission that its case reports for 25 years were fictional, which relates to |
| 2026-03-04 20:39 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.11 (Mild negative) 0.00 | |
| reasoning Medical journal correction case |
| 2026-03-04 20:06 |
eval
|
Evaluated by llama-4-scout-wai: -0.16 (Mild negative) 0.00 | |
| reasoning The content discusses a medical journal's admission that its case reports for 25 years were fictional, which relates to |
| 2026-03-04 20:05 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.11 (Mild negative) 0.00 | |
| reasoning Medical journal correction case |
| 2026-03-04 19:14 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.11 (Mild negative) 0.00 | |
| reasoning Medical journal correction case |
| 2026-03-04 19:12 |
eval
|
Evaluated by llama-4-scout-wai: -0.16 (Mild negative) -0.16 | |
| reasoning The content discusses a medical journal's admission that its case reports for 25 years were fictional, which relates to |
| 2026-03-04 19:06 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) +0.16 | |
| reasoning The content discusses a medical journal's admission that its case reports for 25 years were fictional, which relates to |
| 2026-03-04 18:13 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.11 (Mild negative) -0.03 | |
| reasoning Medical journal correction case |
| 2026-03-04 18:06 |
eval
|
Evaluated by llama-4-scout-wai: -0.16 (Mild negative) 0.00 | |
| reasoning The content discusses a medical journal's admission that its case reports for 25 years were fictional, which relates to |
| 2026-03-04 16:46 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.08 (Neutral) 0.00 | |
| reasoning Medical journal correction case |
| 2026-03-04 16:39 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.08 (Neutral) +0.03 | |
| reasoning Medical journal correction case |
| 2026-03-04 16:39 |
eval
|
Evaluated by llama-4-scout-wai: -0.16 (Mild negative) 0.00 | |
| reasoning The content discusses a medical journal's admission that its case reports for 25 years were fictional, which relates to |
| 2026-03-04 15:59 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.11 (Mild negative) 0.00 | |
| reasoning Medical journal correction case |
| 2026-03-04 15:57 |
eval
|
Evaluated by llama-4-scout-wai: -0.16 (Mild negative) | |
| reasoning The content discusses a medical journal's admission that its case reports for 25 years were fictional, which relates to |
| 2026-03-04 15:55 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.11 (Mild negative) | |
| reasoning Medical journal correction case |