-0.25 Show HN: Open-source playground to red-team AI agents with exploits published (github.com S:+0.05 )
20 points by zachdotai 4 hours ago | 2 comments on HN | Neutral High agreement (3 models) Mixed · v3.7 · 2026-03-16 00:01:17 0
Summary Digital Access & Safety Neutral
This GitHub repository hosts a software project for adversarial testing of AI agent defenses. The page presents minimal editorial content focused on technical capability rather than human rights principles, though the project engages indirectly with AI safety concerns. GitHub's structural infrastructure (encryption, no tracking, accessibility features) provides baseline protections for privacy and expression (Articles 12, 19), while the repository itself functions as a neutral platform for shared technical knowledge creation.
Article Heatmap
Preamble: ND — Preamble Preamble: No Data — Preamble P Article 1: ND — Freedom, Equality, Brotherhood Article 1: No Data — Freedom, Equality, Brotherhood 1 Article 2: ND — Non-Discrimination Article 2: No Data — Non-Discrimination 2 Article 3: ND — Life, Liberty, Security Article 3: No Data — Life, Liberty, Security 3 Article 4: ND — No Slavery Article 4: No Data — No Slavery 4 Article 5: ND — No Torture Article 5: No Data — No Torture 5 Article 6: ND — Legal Personhood Article 6: No Data — Legal Personhood 6 Article 7: ND — Equality Before Law Article 7: No Data — Equality Before Law 7 Article 8: ND — Right to Remedy Article 8: No Data — Right to Remedy 8 Article 9: ND — No Arbitrary Detention Article 9: No Data — No Arbitrary Detention 9 Article 10: ND — Fair Hearing Article 10: No Data — Fair Hearing 10 Article 11: ND — Presumption of Innocence Article 11: No Data — Presumption of Innocence 11 Article 12: ND — Privacy Article 12: No Data — Privacy 12 Article 13: ND — Freedom of Movement Article 13: No Data — Freedom of Movement 13 Article 14: ND — Asylum Article 14: No Data — Asylum 14 Article 15: ND — Nationality Article 15: No Data — Nationality 15 Article 16: ND — Marriage & Family Article 16: No Data — Marriage & Family 16 Article 17: ND — Property Article 17: No Data — Property 17 Article 18: ND — Freedom of Thought Article 18: No Data — Freedom of Thought 18 Article 19: -0.05 — Freedom of Expression 19 Article 20: ND — Assembly & Association Article 20: No Data — Assembly & Association 20 Article 21: ND — Political Participation Article 21: No Data — Political Participation 21 Article 22: ND — Social Security Article 22: No Data — Social Security 22 Article 23: ND — Work & Equal Pay Article 23: No Data — Work & Equal Pay 23 Article 24: ND — Rest & Leisure Article 24: No Data — Rest & Leisure 24 Article 25: ND — Standard of Living Article 25: No Data — Standard of Living 25 Article 26: ND — Education Article 26: No Data — Education 26 Article 27: ND — Cultural Participation Article 27: No Data — Cultural Participation 27 Article 28: ND — Social & International Order Article 28: No Data — Social & International Order 28 Article 29: ND — Duties to Community Article 29: No Data — Duties to Community 29 Article 30: ND — No Destruction of Rights Article 30: No Data — No Destruction of Rights 30
Negative Neutral Positive No Data
Aggregates
E
-0.25
S
+0.05
Weighted Mean -0.05 Unweighted Mean -0.05
Max -0.05 Article 19 Min -0.05 Article 19
Signal 1 No Data 30
Volatility 0.00 (Low)
Negative 1 Channels E: 0.6 S: 0.4
SETL -0.27 Structural-dominant
FW Ratio 58% 19 facts · 14 inferences
Agreement High 3 models · spread ±0.025
Evidence 12% coverage
1H 5M 5L 30 ND
Theme Radar
Foundation Security Legal Privacy & Movement Personal Expression Economic & Social Cultural Order & Duties Foundation: 0.00 (0 articles) Security: 0.00 (0 articles) Legal: 0.00 (0 articles) Privacy & Movement: 0.00 (0 articles) Personal: 0.00 (0 articles) Expression: -0.05 (1 articles) Economic & Social: 0.00 (0 articles) Cultural: 0.00 (0 articles) Order & Duties: 0.00 (0 articles)
Editorial Channel
What the content says
-0.25
Article 19 Freedom of Expression
High Framing Practice
Editorial
-0.25
SETL
-0.27

Repository description frames AI agent adversarial testing ('stress-test AI agent defenses through adversarial play'). While this engages with AI safety, the framing emphasizes testing/attack rather than freedom of expression itself. The content does not discuss free speech principles.

ND
Preamble Preamble
Medium Practice

No editorial content addressing human dignity or universal human rights principles.

ND
Article 1 Freedom, Equality, Brotherhood
Low Practice

Repository does not discuss human equality or rights.

ND
Article 2 Non-Discrimination
Low Practice

No editorial content addressing discrimination or protected characteristics.

ND
Article 3 Life, Liberty, Security
Medium Practice

No editorial content addressing right to life or security.

ND
Article 4 No Slavery

No evidence of slavery or servitude discussion or practice.

ND
Article 5 No Torture

No content addressing torture or cruel treatment.

ND
Article 6 Legal Personhood

No content on right to legal personality.

ND
Article 7 Equality Before Law

No content on equality before law.

ND
Article 8 Right to Remedy

No content on remedies for rights violations.

ND
Article 9 No Arbitrary Detention

No content on arbitrary arrest or detention.

ND
Article 10 Fair Hearing

No content on fair trial or due process.

ND
Article 11 Presumption of Innocence

No content on criminal liability or presumption of innocence.

ND
Article 12 Privacy
Medium Practice

No editorial content on privacy.

ND
Article 13 Freedom of Movement
Low

No content on freedom of movement.

ND
Article 14 Asylum

No content on asylum or refuge.

ND
Article 15 Nationality

No content on nationality.

ND
Article 16 Marriage & Family

No content on marriage or family.

ND
Article 17 Property

No content on property rights.

ND
Article 18 Freedom of Thought
Low

No content on freedom of conscience or religion.

ND
Article 20 Assembly & Association
Low

No content on freedom of assembly or association.

ND
Article 21 Political Participation

No content on political participation.

ND
Article 22 Social Security

No content on social security or economic rights.

ND
Article 23 Work & Equal Pay

No content on work or employment rights.

ND
Article 24 Rest & Leisure

No content on rest, leisure, or limited working hours.

ND
Article 25 Standard of Living

No content on adequate standard of living or health.

ND
Article 26 Education
Medium

No editorial content on education.

ND
Article 27 Cultural Participation
Medium

No content on cultural participation or scientific advancement.

ND
Article 28 Social & International Order

No content on social and international order.

ND
Article 29 Duties to Community

No content on duties or limitations of rights.

ND
Article 30 No Destruction of Rights

No content on interpretation or misuse of rights.

Structural Channel
What the site does
Element Modifier Affects Note
br_tracking +0.05
Preamble ¶5 Article 12 Article 19
No third-party trackers detected
br_security +0.05
Article 3 Article 12
Security headers: HTTPS, HSTS, CSP
br_accessibility 0.00
Article 26 Article 27 ¶1
Accessibility: lang attr, 100% alt text
br_consent 0.00
Article 12 Article 19 Article 20 ¶2
No cookie consent banner detected
+0.05
Article 19 Freedom of Expression
High Framing Practice
Structural
+0.05
Context Modifier
+0.05
SETL
-0.27

GitHub's infrastructure (no trackers per DCP, HTTPS) supports freedom of expression; repository can be publicly forked and modified.

ND
Preamble Preamble
Medium Practice

GitHub's infrastructure implements HTTPS, HSTS, and CSP headers supporting security and privacy foundations underlying rights protection (from DCP).

ND
Article 1 Freedom, Equality, Brotherhood
Low Practice

GitHub's platform-level terms of service and anti-discrimination policies apply universally to all users regardless of status.

ND
Article 2 Non-Discrimination
Low Practice

GitHub's access controls and terms apply equally across protected categories; no visible discriminatory architecture on this repository page.

ND
Article 3 Life, Liberty, Security
Medium Practice

HTTPS and security headers (from DCP) provide technical protection for user account security and data integrity.

ND
Article 4 No Slavery

Not applicable to a software repository.

ND
Article 5 No Torture

Not applicable to a software repository.

ND
Article 6 Legal Personhood

Not applicable to a software repository.

ND
Article 7 Equality Before Law

Not applicable to a software repository.

ND
Article 8 Right to Remedy

Not applicable to a software repository.

ND
Article 9 No Arbitrary Detention

Not applicable to a software repository.

ND
Article 10 Fair Hearing

Not applicable to a software repository.

ND
Article 11 Presumption of Innocence

Not applicable to a software repository.

ND
Article 12 Privacy
Medium Practice

GitHub's infrastructure (HTTPS, no third-party trackers per DCP) protects user privacy from arbitrary interference.

ND
Article 13 Freedom of Movement
Low

Repository page does not restrict or enable movement in physical or digital sense relevant to Article 13.

ND
Article 14 Asylum

Not applicable to a software repository.

ND
Article 15 Nationality

Not applicable to a software repository.

ND
Article 16 Marriage & Family

Not applicable to a software repository.

ND
Article 17 Property

Repository itself exists within GitHub's terms; no relevant structural signals.

ND
Article 18 Freedom of Thought
Low

Repository hosting does not restrict or enable freedom of conscience.

ND
Article 20 Assembly & Association
Low

Repository does not restrict or enable assembly; GitHub's community features (discussions, issues) provide neutral infrastructure.

ND
Article 21 Political Participation

Not applicable to a software repository.

ND
Article 22 Social Security

Not applicable to a software repository.

ND
Article 23 Work & Equal Pay

Repository is not a labor context; not applicable.

ND
Article 24 Rest & Leisure

Not applicable to a software repository.

ND
Article 25 Standard of Living

Not applicable to a software repository.

ND
Article 26 Education
Medium

Repository code and documentation accessible; GitHub platform includes full alt text and accessibility features per DCP (100% alt text).

ND
Article 27 Cultural Participation
Medium

Repository participates in scientific/technical culture; GitHub's open-source framework enables shared creation and cultural participation.

ND
Article 28 Social & International Order

Not applicable to a software repository.

ND
Article 29 Duties to Community

Not applicable to a software repository.

ND
Article 30 No Destruction of Rights

Not applicable to a software repository.

Supplementary Signals
How this content communicates, beyond directional lean. Learn more
Epistemic Quality
How well-sourced and evidence-based is this content?
0.65 low claims
Sources
0.6
Evidence
0.5
Uncertainty
0.7
Purpose
0.8
Propaganda Flags
No manipulative rhetoric detected
0 techniques detected
Emotional Tone
Emotional character: positive/negative, intensity, authority
measured
Valence
+0.1
Arousal
0.3
Dominance
0.4
Transparency
Does the content identify its author and disclose interests?
0.50
✓ Author
More signals: context, framing & audience
Solution Orientation
Does this content offer solutions or only describe problems?
0.65 solution oriented
Reader Agency
0.8
Stakeholder Voice
Whose perspectives are represented in this content?
0.20 1 perspective
Speaks: institution
Temporal Framing
Is this content looking backward, at the present, or forward?
present unspecified
Geographic Scope
What geographic area does this content cover?
global
Complexity
How accessible is this content to a general audience?
technical medium jargon domain specific
Longitudinal 125 HN snapshots · 5 evals
+1 0 −1 HN
Audit Trail 13 entries
2026-03-16 02:26 eval_success PSQ evaluated: g-PSQ=0.600 (3 dims) - -
2026-03-16 02:26 eval Evaluated by llama-4-scout-wai-psq: +0.60 (Strong positive)
2026-03-16 02:23 eval_success Lite evaluated: Neutral (0.00) - -
2026-03-16 02:23 rater_validation_warn Lite validation warnings for model llama-4-scout-wai: 1W 0R - -
2026-03-16 02:23 eval Evaluated by llama-4-scout-wai: 0.00 (Neutral)
reasoning
Technical content, no explicit human rights discussion
2026-03-16 00:11 eval_success PSQ evaluated: g-PSQ=0.000 (3 dims) - -
2026-03-16 00:11 eval Evaluated by llama-3.3-70b-wai-psq: 0.00 (Neutral)
2026-03-16 00:08 eval_success Lite evaluated: Neutral (0.00) - -
2026-03-16 00:08 rater_validation_warn Lite validation warnings for model llama-3.3-70b-wai: 1W 0R - -
2026-03-16 00:08 eval Evaluated by llama-3.3-70b-wai: 0.00 (Neutral)
reasoning
Technical content with zero rights discussion
2026-03-16 00:01 eval_success Evaluated: Neutral (-0.05) - -
2026-03-16 00:01 rater_validation_warn Validation warnings for model claude-haiku-4-5-20251001: 0W 10R - -
2026-03-16 00:01 eval Evaluated by claude-haiku-4-5-20251001: -0.05 (Neutral) 11,708 tokens