Model Comparison
Model Editorial Structural Class Conf SETL Theme
@cf/meta/llama-4-scout-17b-16e-instruct lite ND ND 0.77
@cf/meta/llama-4-scout-17b-16e-instruct lite 0.00 ND Neutral 1.00 0.00 Software Engineering
claude-haiku-4-5-20251001 +0.29 +0.36 Moderate positive 0.40 -0.15 Free Expression & Education
@cf/meta/llama-3.3-70b-instruct-fp8-fast lite 0.00 +0.23 Neutral 0.90 -0.23 Software Engineering
@cf/meta/llama-3.3-70b-instruct-fp8-fast lite ND ND 0.78
Section @cf/meta/llama-4-scout-17b-16e-instruct lite @cf/meta/llama-4-scout-17b-16e-instruct lite claude-haiku-4-5-20251001 @cf/meta/llama-3.3-70b-instruct-fp8-fast lite @cf/meta/llama-3.3-70b-instruct-fp8-fast lite
Preamble ND ND 0.29 ND ND
Article 1 ND ND 0.24 ND ND
Article 2 ND ND ND ND ND
Article 3 ND ND 0.19 ND ND
Article 4 ND ND ND ND ND
Article 5 ND ND ND ND ND
Article 6 ND ND 0.22 ND ND
Article 7 ND ND 0.32 ND ND
Article 8 ND ND 0.27 ND ND
Article 9 ND ND ND ND ND
Article 10 ND ND 0.22 ND ND
Article 11 ND ND ND ND ND
Article 12 ND ND ND ND ND
Article 13 ND ND 0.42 ND ND
Article 14 ND ND 0.32 ND ND
Article 15 ND ND 0.24 ND ND
Article 16 ND ND 0.37 ND ND
Article 17 ND ND 0.29 ND ND
Article 18 ND ND 0.37 ND ND
Article 19 ND ND 0.84 ND ND
Article 20 ND ND 0.27 ND ND
Article 21 ND ND ND ND ND
Article 22 ND ND 0.32 ND ND
Article 23 ND ND 0.37 ND ND
Article 24 ND ND 0.27 ND ND
Article 25 ND ND 0.32 ND ND
Article 26 ND ND 0.84 ND ND
Article 27 ND ND 0.82 ND ND
Article 28 ND ND 0.32 ND ND
Article 29 ND ND 0.22 ND ND
Article 30 ND ND 0.17 ND ND
+0.29 SWE-CI: Evaluating Agent Capabilities in Maintaining Codebases via CI (arxiv.org S:+0.36 )
122 points by mpweiher 7 days ago | 42 comments on HN | Moderate positive Contested Low agreement (3 models) Editorial · v3.7 · 2026-03-16 00:47:36 0
Summary Free Expression & Education Champions
This arXiv abstract describes a scientific research paper on software engineering benchmarks (SWE-CI) evaluated through arXiv's open-access platform. The content champions human rights through Articles 19 (free expression) and 26 (education) by enabling unrestricted global access to scientific knowledge in multiple formats. The structural model—free, non-profit, commons-based publishing without gatekeeping—directly instantiates the UDHR's vision of universal knowledge participation.
Rights Tensions 1 pair
Art 19 Art 25 Scientific freedom to publish technical research on AI agents may raise concerns about health/safety oversight, but the content resolves this through rigorous evaluation methodology rather than suppression.
Article Heatmap
Preamble: +0.29 — Preamble P Article 1: +0.24 — Freedom, Equality, Brotherhood 1 Article 2: ND — Non-Discrimination Article 2: No Data — Non-Discrimination 2 Article 3: +0.19 — Life, Liberty, Security 3 Article 4: ND — No Slavery Article 4: No Data — No Slavery 4 Article 5: ND — No Torture Article 5: No Data — No Torture 5 Article 6: +0.22 — Legal Personhood 6 Article 7: +0.32 — Equality Before Law 7 Article 8: +0.27 — Right to Remedy 8 Article 9: ND — No Arbitrary Detention Article 9: No Data — No Arbitrary Detention 9 Article 10: +0.22 — Fair Hearing 10 Article 11: ND — Presumption of Innocence Article 11: No Data — Presumption of Innocence 11 Article 12: ND — Privacy Article 12: No Data — Privacy 12 Article 13: +0.42 — Freedom of Movement 13 Article 14: +0.32 — Asylum 14 Article 15: +0.24 — Nationality 15 Article 16: +0.37 — Marriage & Family 16 Article 17: +0.29 — Property 17 Article 18: +0.37 — Freedom of Thought 18 Article 19: +0.84 — Freedom of Expression 19 Article 20: +0.27 — Assembly & Association 20 Article 21: ND — Political Participation Article 21: No Data — Political Participation 21 Article 22: +0.32 — Social Security 22 Article 23: +0.37 — Work & Equal Pay 23 Article 24: +0.27 — Rest & Leisure 24 Article 25: +0.32 — Standard of Living 25 Article 26: +0.84 — Education 26 Article 27: +0.82 — Cultural Participation 27 Article 28: +0.32 — Social & International Order 28 Article 29: +0.22 — Duties to Community 29 Article 30: +0.17 — No Destruction of Rights 30
Negative Neutral Positive No Data
Aggregates
E
+0.29
S
+0.36
Weighted Mean +0.41 Unweighted Mean +0.35
Max +0.84 Article 19 Min +0.17 Article 30
Signal 24 No Data 7
Volatility 0.19 (Medium)
Negative 0 Channels E: 0.6 S: 0.4
SETL -0.15 Structural-dominant
FW Ratio 53% 61 facts · 54 inferences
Agreement Low 3 models · spread ±0.203
Evidence 38% coverage
3H 12M 11L 7 ND
Theme Radar
Foundation Security Legal Privacy & Movement Personal Expression Economic & Social Cultural Order & Duties Foundation: 0.27 (2 articles) Security: 0.19 (1 articles) Legal: 0.26 (4 articles) Privacy & Movement: 0.33 (3 articles) Personal: 0.34 (3 articles) Expression: 0.55 (2 articles) Economic & Social: 0.32 (4 articles) Cultural: 0.83 (2 articles) Order & Duties: 0.24 (3 articles)
Editorial Channel
What the content says
+0.50
Article 19 Freedom of Expression
High Advocacy Coverage
Editorial
+0.50
SETL
-0.24

The research directly addresses freedom of expression through publication of novel scientific methodology, ideas, and findings. Abstract demonstrates unrestricted expression of technical concepts.

+0.50
Article 26 Education
High Advocacy Coverage
Editorial
+0.50
SETL
-0.24

Scientific research and publication constitute education; the paper contributes to knowledge dissemination and technical education globally.

+0.50
Article 27 Cultural Participation
High Advocacy Coverage
Editorial
+0.50
SETL
-0.17

The paper contributes to scientific and cultural life of the community through novel benchmarking methodology and intellectual participation.

+0.40
Article 13 Freedom of Movement
Medium Advocacy
Editorial
+0.40
SETL
-0.15

Research and publication enable freedom of movement in the knowledge commons; international authorship demonstrates cross-border intellectual participation.

+0.35
Article 16 Marriage & Family
Medium Advocacy
Editorial
+0.35
SETL
-0.14

Scientific publication and open collaboration constitute the right to marry and family life through intellectual partnership and collective knowledge creation.

+0.35
Article 18 Freedom of Thought
Medium Advocacy
Editorial
+0.35
SETL
-0.14

Scientific research and publication constitute freedom of thought and conscience through unrestricted inquiry and peer evaluation of ideas.

+0.35
Article 23 Work & Equal Pay
Medium Advocacy
Editorial
+0.35
SETL
-0.14

Scientific research constitutes right to work; authorship and publication enable meaningful employment and intellectual contribution.

+0.30
Article 7 Equality Before Law
Medium Advocacy
Editorial
+0.30
SETL
-0.13

Scientific research and benchmarking operate under principles of equal treatment before objective standards and fair evaluation metrics.

+0.30
Article 14 Asylum
Medium Advocacy
Editorial
+0.30
SETL
-0.13

Open scientific research and publication constitute asylum and refuge from suppressive intellectual regimes; evidence-based methodology offers protection against persecution based on opinion.

+0.30
Article 22 Social Security
Medium Advocacy
Editorial
+0.30
SETL
-0.13

Scientific research and institutional participation constitute social security through access to the knowledge commons and intellectual community.

+0.30
Article 25 Standard of Living
Medium Advocacy
Editorial
+0.30
SETL
-0.13

Scientific research methodology and benchmarking address health and well-being through evidence-based evaluation and systematic improvement.

+0.30
Article 28 Social & International Order
Medium Advocacy
Editorial
+0.30
SETL
-0.13

Scientific research and peer evaluation constitute social and international order supporting the rights described in the UDHR.

+0.25
Preamble Preamble
Medium Advocacy
Editorial
+0.25
SETL
-0.19

Abstract discusses scientific research advancement and systematic evaluation methodologies, implying commitment to human dignity through knowledge progress and rational discourse.

+0.25
Article 8 Right to Remedy
Medium Advocacy
Editorial
+0.25
SETL
-0.12

Scholarly research and peer evaluation constitute institutional remedy against unfair treatment through transparent methodology and published results.

+0.25
Article 17 Property
Medium Advocacy
Editorial
+0.25
SETL
-0.19

Scientific research and publication constitute protection of intellectual property through attribution, citation, and archival preservation.

+0.25
Article 20 Assembly & Association
Low Advocacy
Editorial
+0.25
SETL
-0.12

Scientific collaboration and research communities constitute peaceful assembly and association through intellectual partnership.

+0.25
Article 24 Rest & Leisure
Low Advocacy
Editorial
+0.25
SETL
-0.12

Scientific research and publication enable rest and leisure through intellectual freedom and participation in knowledge communities.

+0.20
Article 1 Freedom, Equality, Brotherhood
Low Advocacy
Editorial
+0.20
SETL
-0.17

Paper concerns scientific and technical equality through standardized benchmarking that treats all agents fairly according to objective metrics.

+0.20
Article 6 Legal Personhood
Low Advocacy
Editorial
+0.20
SETL
-0.11

Scientific research recognizes the right to personhood through individual attribution of work and acknowledgment in scholarship.

+0.20
Article 10 Fair Hearing
Low Advocacy
Editorial
+0.20
SETL
-0.11

Peer review and scientific evaluation ideally constitute fair and public hearing of claims, though this abstract does not detail review procedures.

+0.20
Article 15 Nationality
Low Advocacy
Editorial
+0.20
SETL
-0.17

Scientific research affirms the right to a nationality through institutional attribution and community membership in scholarship.

+0.20
Article 29 Duties to Community
Low Advocacy
Editorial
+0.20
SETL
-0.11

Scientific research and community participation entail responsibilities; peer evaluation and reproducibility constitute collective duty.

+0.15
Article 3 Life, Liberty, Security
Low Advocacy
Editorial
+0.15
SETL
-0.16

Scientific research and benchmarking fundamentally assert the right to existence and safety through evidence-based evaluation.

+0.15
Article 30 No Destruction of Rights
Low
Editorial
+0.15
SETL
-0.10

Scientific research and open publication oppose restriction of UDHR rights; the paper's methodology does not appear to restrict others' rights.

ND
Article 2 Non-Discrimination
Low

Content does not directly address discrimination or protected characteristics.

ND
Article 4 No Slavery

Content does not engage with slavery or servitude.

ND
Article 5 No Torture

Content does not address torture or cruel treatment.

ND
Article 9 No Arbitrary Detention

Content does not address arbitrary arrest or detention.

ND
Article 11 Presumption of Innocence

Content does not address criminal liability or presumption of innocence.

ND
Article 12 Privacy
Low

Content does not address privacy and correspondence.

ND
Article 21 Political Participation

Content does not address voting or political participation.

Structural Channel
What the site does
Element Modifier Affects Note
Legal & Terms
Privacy
arXiv does not employ invasive tracking; email submission history visible only to author.
Terms of Service
arXiv permits open access and redistribution under CC licenses; terms support knowledge dissemination.
Identity & Mission
Mission +0.20
Article 19 Article 27
arXiv's mission aligns with free dissemination of scientific knowledge and open access to research.
Editorial Code
arXiv operates a moderation system; no evidence of censorship or editorial bias on this abstract page.
Ownership
arXiv operated by Cornell University; non-profit stewardship supports research commons.
Access & Distribution
Access Model +0.20
Article 19 Article 26 Article 27
Free, unrestricted access to preprints removes financial barriers to knowledge access.
Ad/Tracking
No advertisements or tracking systems observed on arXiv.
Accessibility +0.15
Article 26
arXiv provides HTML and PDF formats, LaTeX source, and multiple citation export formats. Supports broad accessibility for researchers globally.
+0.60
Article 19 Freedom of Expression
High Advocacy Coverage
Structural
+0.60
Context Modifier
+0.30
SETL
-0.24

arXiv's core mission is free dissemination of scientific knowledge; unrestricted global access to the paper exemplifies Article 19 protection at scale.

+0.60
Article 26 Education
High Advocacy Coverage
Structural
+0.60
Context Modifier
+0.30
SETL
-0.24

arXiv's open-access model and multiple format support (HTML, PDF, TeX) exemplify unrestricted access to education and learning resources.

+0.55
Article 27 Cultural Participation
High Advocacy Coverage
Structural
+0.55
Context Modifier
+0.30
SETL
-0.17

arXiv's mission and access model directly support Article 27 by enabling free participation in scientific and cultural knowledge.

+0.45
Article 13 Freedom of Movement
Medium Advocacy
Structural
+0.45
Context Modifier
0.00
SETL
-0.15

arXiv's global reach and unrestricted access enable researchers of all nationalities to participate equally.

+0.40
Article 16 Marriage & Family
Medium Advocacy
Structural
+0.40
Context Modifier
0.00
SETL
-0.14

arXiv enables collaborative authorship and institutional partnerships that form communities.

+0.40
Article 18 Freedom of Thought
Medium Advocacy
Structural
+0.40
Context Modifier
0.00
SETL
-0.14

arXiv's open-access model and non-censorious moderation protect freedom to publish and access scientific ideas.

+0.40
Article 23 Work & Equal Pay
Medium Advocacy
Structural
+0.40
Context Modifier
0.00
SETL
-0.14

arXiv's publishing infrastructure supports researchers' work and career development without discrimination.

+0.35
Preamble Preamble
Medium Advocacy
Structural
+0.35
Context Modifier
0.00
SETL
-0.19

arXiv's open-access infrastructure provides free dissemination of research without barriers, enabling universal participation in scientific knowledge creation.

+0.35
Article 7 Equality Before Law
Medium Advocacy
Structural
+0.35
Context Modifier
0.00
SETL
-0.13

arXiv applies uniform submission and archival standards; no gatekeeping based on individual status or affiliation.

+0.35
Article 14 Asylum
Medium Advocacy
Structural
+0.35
Context Modifier
0.00
SETL
-0.13

arXiv's non-censorious, open-access model provides refuge for scientific knowledge regardless of political context.

+0.35
Article 17 Property
Medium Advocacy
Structural
+0.35
Context Modifier
0.00
SETL
-0.19

arXiv provides DOI-based persistent identification, license-based protection, and citation infrastructure.

+0.35
Article 22 Social Security
Medium Advocacy
Structural
+0.35
Context Modifier
0.00
SETL
-0.13

Free access to arXiv supports participation in global knowledge networks without economic barriers.

+0.35
Article 25 Standard of Living
Medium Advocacy
Structural
+0.35
Context Modifier
0.00
SETL
-0.13

arXiv's infrastructure supports research on health-related topics and technological improvements that benefit well-being.

+0.35
Article 28 Social & International Order
Medium Advocacy
Structural
+0.35
Context Modifier
0.00
SETL
-0.13

arXiv operates as part of international scientific infrastructure; Cornell University stewardship supports institutional order aligned with rights.

+0.30
Article 1 Freedom, Equality, Brotherhood
Low Advocacy
Structural
+0.30
Context Modifier
0.00
SETL
-0.17

arXiv's non-profit, university-operated model supports equal access and treatment of all researchers globally.

+0.30
Article 8 Right to Remedy
Medium Advocacy
Structural
+0.30
Context Modifier
0.00
SETL
-0.12

arXiv provides accessible submission history and DOI-based citation systems that enable verification and recourse.

+0.30
Article 15 Nationality
Low Advocacy
Structural
+0.30
Context Modifier
0.00
SETL
-0.17

arXiv preserves author and institutional identity, supporting national and scholarly affiliation.

+0.30
Article 20 Assembly & Association
Low Advocacy
Structural
+0.30
Context Modifier
0.00
SETL
-0.12

arXiv supports community formation around shared research interests and citation networks.

+0.30
Article 24 Rest & Leisure
Low Advocacy
Structural
+0.30
Context Modifier
0.00
SETL
-0.12

arXiv supports participation without time-based paywalls or forced labor models.

+0.25
Article 3 Life, Liberty, Security
Low Advocacy
Structural
+0.25
Context Modifier
0.00
SETL
-0.16

Free access and transparent metadata protect researcher and user rights to participate in knowledge generation safely.

+0.25
Article 6 Legal Personhood
Low Advocacy
Structural
+0.25
Context Modifier
0.00
SETL
-0.11

arXiv attributes papers to named individual authors and preserves authorship records.

+0.25
Article 10 Fair Hearing
Low Advocacy
Structural
+0.25
Context Modifier
0.00
SETL
-0.11

arXiv's moderation system provides consistent procedural standards for all submissions.

+0.25
Article 29 Duties to Community
Low Advocacy
Structural
+0.25
Context Modifier
0.00
SETL
-0.11

arXiv's moderation system enforces standards; researcher responsibility is implicit in submission requirements.

+0.20
Article 30 No Destruction of Rights
Low
Structural
+0.20
Context Modifier
0.00
SETL
-0.10

arXiv's CC-based license explicitly prevents rights restriction; the model opposes Article 30 violations.

ND
Article 2 Non-Discrimination
Low

arXiv's author-blind review process and open platform structure minimize institutional discrimination in knowledge dissemination.

ND
Article 4 No Slavery

No relevant structural signals.

ND
Article 5 No Torture

No relevant structural signals.

ND
Article 9 No Arbitrary Detention

No relevant structural signals.

ND
Article 11 Presumption of Innocence

No relevant structural signals.

ND
Article 12 Privacy
Low

arXiv limits email visibility to authors; submission history does not expose private information.

ND
Article 21 Political Participation

No relevant structural signals.

Psychological Safety
experimental
How safe this content is to read — independent from rights stance. Scores are ordinal (rank-order only). Learn more
PSQ
+0.4
Per-model PSQ
L4P +0.2 L3P +0.4
Supplementary Signals
How this content communicates, beyond directional lean. Learn more
Epistemic Quality
How well-sourced and evidence-based is this content?
0.82 medium claims
Sources
0.8
Evidence
0.8
Uncertainty
0.8
Purpose
0.9
Propaganda Flags
No manipulative rhetoric detected
0 techniques detected
Emotional Tone
Emotional character: positive/negative, intensity, authority
measured
Valence
+0.3
Arousal
0.4
Dominance
0.5
Transparency
Does the content identify its author and disclose interests?
0.67
✓ Author
More signals: context, framing & audience
Solution Orientation
Does this content offer solutions or only describe problems?
0.70 solution oriented
Reader Agency
0.8
Stakeholder Voice
Whose perspectives are represented in this content?
0.60 3 perspectives
Speaks: researchersinstitution
About: software_agentsbroader_scientific_community
Temporal Framing
Is this content looking backward, at the present, or forward?
prospective long term
Geographic Scope
What geographic area does this content cover?
global
Complexity
How accessible is this content to a general audience?
technical high jargon domain specific
Longitudinal 797 HN snapshots · 305 evals
+1 0 −1 HN
Audit Trail 325 entries
2026-03-16 03:29 ap_publish AP publish failed: 401 - -
2026-03-16 03:27 eval_success PSQ evaluated: g-PSQ=0.202 (3 dims) - -
2026-03-16 03:27 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-16 03:27 ap_publish AP publish failed: 401 - -
2026-03-16 03:26 eval_success Lite evaluated: Neutral (0.00) - -
2026-03-16 03:26 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-16 03:26 model_divergence Cross-model spread 0.41 exceeds threshold (2 models) - -
2026-03-16 03:26 rater_validation_warn Lite validation warnings for model llama-4-scout-wai: 1W 0R - -
2026-03-16 03:24 ap_publish AP publish failed: 401 - -
2026-03-16 03:21 ap_publish AP publish failed: 401 - -
2026-03-16 03:19 ap_publish AP publish failed: 401 - -
2026-03-16 03:17 ap_publish AP publish failed: 401 - -
2026-03-16 03:14 ap_publish AP publish failed: 401 - -
2026-03-16 03:11 ap_publish AP publish failed: 401 - -
2026-03-16 03:10 ap_publish AP publish failed: 401 - -
2026-03-16 03:06 ap_publish AP publish failed: 401 - -
2026-03-16 03:04 ap_publish AP publish failed: 401 - -
2026-03-16 03:02 ap_publish AP publish failed: 401 - -
2026-03-16 02:59 ap_publish AP publish failed: 401 - -
2026-03-16 02:57 ap_publish AP publish failed: 401 - -
2026-03-16 02:55 ap_publish AP publish failed: 401 - -
2026-03-16 02:52 ap_publish AP publish failed: 401 - -
2026-03-16 00:47 eval Evaluated by claude-haiku-4-5-20251001: +0.41 (Moderate positive) 13,978 tokens
2026-03-11 17:42 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-11 17:26 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-11 16:27 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-11 16:08 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-11 14:59 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-11 14:42 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-11 13:46 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-11 13:28 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-11 04:16 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-11 04:08 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-11 03:01 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-11 02:45 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-11 01:19 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-11 01:10 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-10 23:48 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-10 23:31 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-10 22:06 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-10 21:57 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-10 21:03 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-10 20:35 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-10 20:27 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-10 18:35 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-10 18:26 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-10 17:27 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-10 17:17 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-10 16:48 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-10 16:36 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-10 16:09 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-10 15:59 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-10 15:36 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-10 15:25 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-10 14:57 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-10 14:46 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-10 14:28 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-10 14:13 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-10 13:50 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-10 13:38 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-10 13:15 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-10 13:03 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-10 12:35 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-10 12:26 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-10 12:17 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-10 12:08 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-10 11:57 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-10 11:49 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-10 11:41 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-10 11:32 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-10 11:22 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-10 11:14 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-10 11:02 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-10 10:57 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-10 10:44 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-10 10:36 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-10 10:24 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-10 10:20 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-10 10:07 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-10 10:04 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-10 09:50 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-10 09:47 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-10 09:33 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-10 09:30 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-10 09:16 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-10 09:12 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-10 08:59 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-10 08:55 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-10 08:41 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-10 08:38 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-10 08:25 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-10 08:22 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-10 08:06 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-10 08:05 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-10 07:48 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-10 07:47 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-10 07:32 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-10 07:27 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-10 07:14 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-10 07:07 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-10 06:55 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-10 06:47 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-10 06:38 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-10 06:27 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-10 06:16 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-10 06:08 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-10 05:59 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-10 05:50 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-10 05:41 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-10 05:34 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-10 05:25 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-10 05:17 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-10 05:08 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-10 05:00 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-10 04:49 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-10 04:40 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-10 04:16 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-10 04:07 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-10 04:01 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-10 03:50 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-10 03:44 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-10 03:33 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-10 03:25 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-10 03:16 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-10 03:07 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-10 02:58 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-10 02:50 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-10 02:42 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-10 02:34 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-10 02:22 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-10 02:16 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-10 02:04 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-10 01:59 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-10 01:47 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-10 01:43 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-10 01:28 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-10 01:27 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-10 01:10 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-10 01:09 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-10 00:53 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-10 00:51 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-10 00:31 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-10 00:29 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-09 23:49 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-09 23:48 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-09 23:34 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-09 23:30 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-09 23:17 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-09 23:12 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-09 23:01 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-09 22:55 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-09 22:44 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-09 22:38 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-09 22:28 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-09 22:19 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-09 22:09 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-09 22:01 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-09 21:52 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-09 21:43 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-09 21:36 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-09 21:25 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-09 21:21 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-09 21:07 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-09 21:02 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-09 20:44 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-09 20:40 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-09 20:28 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-09 20:24 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-09 20:10 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-09 20:08 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-09 19:51 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-09 19:51 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-09 19:34 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-09 19:32 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-09 19:16 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-09 19:12 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-09 18:58 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-09 18:54 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-09 18:40 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-09 18:35 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-09 18:22 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-09 18:15 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-09 18:04 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-09 17:58 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-09 17:46 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-09 17:38 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-09 17:28 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-09 17:17 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-09 17:11 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-09 16:59 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-09 16:54 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-09 16:41 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-09 16:37 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-09 16:21 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-09 16:20 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-09 16:03 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-09 16:02 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-09 15:44 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-09 15:44 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-09 15:28 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-09 15:26 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-09 15:13 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-09 15:09 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-09 14:56 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-09 14:50 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-09 14:38 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-09 14:32 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-09 14:21 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-09 14:15 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-09 14:04 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-09 13:57 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-09 13:48 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-09 13:38 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-09 13:29 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-09 13:20 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-09 13:13 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-09 13:03 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-09 12:58 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-09 12:40 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-09 12:36 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-09 12:21 eval Evaluated by llama-3.3-70b-wai: +0.09 (Neutral) 0.00
reasoning
Technical paper on software engineering
2026-03-09 11:53 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-09 11:41 eval Evaluated by llama-3.3-70b-wai-psq: +0.41 (Moderate positive) 0.00
2026-03-09 11:39 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-09 11:20 eval Evaluated by llama-3.3-70b-wai: +0.09 (Neutral) 0.00
reasoning
Technical paper on software engineering
2026-03-09 11:14 eval Evaluated by llama-3.3-70b-wai: +0.09 (Neutral) 0.00
reasoning
Technical paper on software engineering
2026-03-09 10:46 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-09 10:34 eval Evaluated by llama-3.3-70b-wai-psq: +0.41 (Moderate positive) 0.00
2026-03-09 10:25 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-09 10:20 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-09 10:08 eval Evaluated by llama-3.3-70b-wai: +0.09 (Neutral) 0.00
reasoning
Technical paper on software engineering
2026-03-09 09:38 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-09 09:33 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-09 09:29 eval Evaluated by llama-3.3-70b-wai-psq: +0.41 (Moderate positive) 0.00
2026-03-09 09:23 eval Evaluated by llama-3.3-70b-wai-psq: +0.41 (Moderate positive) 0.00
2026-03-09 09:08 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-09 08:58 eval Evaluated by llama-3.3-70b-wai: +0.09 (Neutral) 0.00
reasoning
Technical paper on software engineering
2026-03-09 08:53 eval Evaluated by llama-3.3-70b-wai: +0.09 (Neutral) 0.00
reasoning
Technical paper on software engineering
2026-03-09 08:20 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-09 08:11 eval Evaluated by llama-3.3-70b-wai-psq: +0.41 (Moderate positive) 0.00
2026-03-09 08:05 eval Evaluated by llama-3.3-70b-wai-psq: +0.41 (Moderate positive) -0.07
2026-03-09 08:02 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-09 07:57 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-09 07:49 eval Evaluated by llama-3.3-70b-wai: +0.09 (Neutral) 0.00
reasoning
Technical paper on software engineering
2026-03-09 07:44 eval Evaluated by llama-3.3-70b-wai: +0.09 (Neutral) 0.00
reasoning
Technical paper on software engineering
2026-03-09 07:15 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-09 07:02 eval Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-09 06:54 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-09 06:50 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-09 06:43 eval Evaluated by llama-3.3-70b-wai: +0.09 (Neutral) 0.00
reasoning
Technical paper on software engineering
2026-03-09 06:09 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-09 05:57 eval Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-09 05:43 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-09 05:36 eval Evaluated by llama-3.3-70b-wai: +0.09 (Neutral) 0.00
reasoning
Technical paper on software engineering
2026-03-09 04:59 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-09 04:54 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-09 04:47 eval Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-09 04:33 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-09 04:28 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-09 04:27 eval Evaluated by llama-3.3-70b-wai: +0.09 (Neutral) 0.00
reasoning
Technical paper on software engineering
2026-03-09 04:22 eval Evaluated by llama-3.3-70b-wai: +0.09 (Neutral) 0.00
reasoning
Technical paper on software engineering
2026-03-09 03:47 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-09 03:42 eval Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) +0.07
2026-03-09 03:21 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-09 03:17 eval Evaluated by llama-3.3-70b-wai: +0.09 (Neutral) 0.00
reasoning
Technical paper on software engineering
2026-03-09 02:39 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-09 02:34 eval Evaluated by llama-3.3-70b-wai-psq: +0.41 (Moderate positive) 0.00
2026-03-09 02:13 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-09 02:12 eval Evaluated by llama-3.3-70b-wai: +0.09 (Neutral) 0.00
reasoning
Technical paper on software engineering
2026-03-09 02:07 eval Evaluated by llama-3.3-70b-wai: +0.09 (Neutral) 0.00
reasoning
Technical paper on software engineering
2026-03-09 01:32 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-09 01:28 eval Evaluated by llama-3.3-70b-wai-psq: +0.41 (Moderate positive) -0.07
2026-03-09 01:08 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-09 01:01 eval Evaluated by llama-3.3-70b-wai: +0.09 (Neutral) 0.00
reasoning
Technical paper on software engineering
2026-03-09 00:22 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-09 00:20 eval Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) +0.07
2026-03-09 00:00 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-08 23:53 eval Evaluated by llama-3.3-70b-wai: +0.09 (Neutral) 0.00
reasoning
Technical paper on software engineering
2026-03-08 22:59 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-08 22:57 eval Evaluated by llama-3.3-70b-wai-psq: +0.41 (Moderate positive) -0.07
2026-03-08 22:41 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-08 22:30 eval Evaluated by llama-3.3-70b-wai: +0.09 (Neutral) 0.00
reasoning
Technical paper on software engineering
2026-03-08 21:45 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-08 21:43 eval Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) +0.07
2026-03-08 21:19 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-08 21:10 eval Evaluated by llama-3.3-70b-wai: +0.09 (Neutral) 0.00
reasoning
Technical paper on software engineering
2026-03-08 20:15 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-08 20:12 eval Evaluated by llama-3.3-70b-wai-psq: +0.41 (Moderate positive) -0.07
2026-03-08 19:55 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-08 19:50 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-08 19:47 eval Evaluated by llama-3.3-70b-wai: +0.09 (Neutral) +0.01
reasoning
Technical paper on software engineering
2026-03-08 18:23 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-08 18:22 eval Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-08 18:19 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-08 18:17 eval Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) +0.07
2026-03-08 17:34 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-08 17:32 eval Evaluated by llama-3.3-70b-wai: +0.08 (Neutral) 0.00
reasoning
Technical paper on software engineering
2026-03-08 16:37 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-08 16:25 eval Evaluated by llama-3.3-70b-wai-psq: +0.41 (Moderate positive) -0.07
2026-03-08 15:12 eval Evaluated by llama-3.3-70b-wai: +0.08 (Neutral) 0.00
reasoning
Technical paper on software engineering
2026-03-08 15:10 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-08 14:08 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-08 14:07 eval Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-08 14:03 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-08 14:02 eval Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-08 13:57 eval Evaluated by llama-3.3-70b-wai: +0.08 (Neutral) 0.00
reasoning
Technical paper on software engineering
2026-03-08 13:56 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-08 12:53 eval Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-08 12:51 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-08 12:49 eval Evaluated by llama-3.3-70b-wai: +0.08 (Neutral) 0.00
reasoning
Technical paper on software engineering
2026-03-08 12:48 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-08 11:40 eval Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) +0.07
2026-03-08 11:37 eval Evaluated by llama-3.3-70b-wai: +0.08 (Neutral) 0.00
reasoning
Technical paper on software engineering
2026-03-08 11:36 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-08 11:35 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-08 10:28 eval Evaluated by llama-3.3-70b-wai-psq: +0.41 (Moderate positive) -0.07
2026-03-08 10:23 eval Evaluated by llama-3.3-70b-wai: +0.08 (Neutral) 0.00
reasoning
Technical paper on software engineering
2026-03-08 10:23 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive) 0.00
2026-03-08 10:22 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral) +0.08
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-08 09:24 eval Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive) 0.00
2026-03-08 09:22 eval Evaluated by llama-3.3-70b-wai: +0.08 (Neutral) 0.00
reasoning
Technical paper on software engineering
2026-03-08 09:19 eval Evaluated by llama-4-scout-wai-psq: +0.20 (Mild positive)
2026-03-08 09:19 eval Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive)
2026-03-08 09:19 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral)
reasoning
Technical paper on software engineering, no human rights discussion
2026-03-08 09:18 eval Evaluated by llama-3.3-70b-wai: +0.08 (Neutral)
reasoning
Technical paper on software engineering