+0.42 The Webpage Has Instructions. The Agent Has Your Credentials

Name: HRCB Evaluation: The Webpage Has Instructions. The Agent Has Your Credentials
Item: The Webpage Has Instructions. The Agent Has Your Credentials
Rating: 0.418
Author: Human Rights Observatory

Alpha This system is experimental. Scores and classifications are early-stage research and may be unreliable. Methodology →

Model: @cf/meta/llama-4-scout-17b-16e-instruct lite ND @cf/meta/llama-3.3-70b-instruct-fp8-fast lite ND @cf/meta/llama-3.3-70b-instruct-fp8-fast lite 0.00 @cf/meta/llama-4-scout-17b-16e-instruct lite 0.00 claude-haiku-4-5-20251001 +0.42 Compare

Model Comparison

Model	Editorial	Structural	Class	Conf	SETL	Theme
@cf/meta/llama-4-scout-17b-16e-instruct lite	ND	ND	—	0.80	—	—
@cf/meta/llama-3.3-70b-instruct-fp8-fast lite	ND	ND	—	0.70	—	—
@cf/meta/llama-3.3-70b-instruct-fp8-fast lite	0.00	ND	Neutral	0.80	0.00	Cyber Security
@cf/meta/llama-4-scout-17b-16e-instruct lite	0.00	ND	Neutral	1.00	0.00	AI Security
claude-haiku-4-5-20251001	+0.42	+0.26	Moderate positive	0.34	0.25	Agent Security & System Integrity

Section	@cf/meta/llama-4-scout-17b-16e-instruct lite	@cf/meta/llama-3.3-70b-instruct-fp8-fast lite	@cf/meta/llama-3.3-70b-instruct-fp8-fast lite	@cf/meta/llama-4-scout-17b-16e-instruct lite	claude-haiku-4-5-20251001
Preamble	ND	ND	ND	ND	0.39
Article 1	ND	ND	ND	ND	0.29
Article 2	ND	ND	ND	ND	0.40
Article 3	ND	ND	ND	ND	0.60
Article 4	ND	ND	ND	ND	ND
Article 5	ND	ND	ND	ND	ND
Article 6	ND	ND	ND	ND	ND
Article 7	ND	ND	ND	ND	ND
Article 8	ND	ND	ND	ND	ND
Article 9	ND	ND	ND	ND	0.32
Article 10	ND	ND	ND	ND	ND
Article 11	ND	ND	ND	ND	ND
Article 12	ND	ND	ND	ND	0.23
Article 13	ND	ND	ND	ND	0.28
Article 14	ND	ND	ND	ND	ND
Article 15	ND	ND	ND	ND	ND
Article 16	ND	ND	ND	ND	ND
Article 17	ND	ND	ND	ND	ND
Article 18	ND	ND	ND	ND	ND
Article 19	ND	ND	ND	ND	0.75
Article 20	ND	ND	ND	ND	0.36
Article 21	ND	ND	ND	ND	0.29
Article 22	ND	ND	ND	ND	ND
Article 23	ND	ND	ND	ND	ND
Article 24	ND	ND	ND	ND	ND
Article 25	ND	ND	ND	ND	0.31
Article 26	ND	ND	ND	ND	0.54
Article 27	ND	ND	ND	ND	0.54
Article 28	ND	ND	ND	ND	0.34
Article 29	ND	ND	ND	ND	0.29
Article 30	ND	ND	ND	ND	0.42

+0.42	The Webpage Has Instructions. The Agent Has Your Credentials (openguard.sh S:+0.26 )
	29 points by everlier 10 hours ago \| 14 comments on HN \| Moderate positive Contested Low agreement (3 models) Editorial · v3.7 · 2026-03-15 22:46:07 0

Summary Agent Security & System Integrity Advocates

This technical blog post advocates for comprehensive prompt-injection defense in AI agent systems, framing the vulnerability as a systemic threat to user autonomy, privacy, and trustworthiness in digital infrastructure. The content educates builders on attack mechanics, documents industry response efforts, and prescribes defensive baselines—treating security architecture as a prerequisite for preserving human rights in agent-driven workflows.

Rights Tensions 3 pairs

Art 19 ↔ Art 13 — Freedom of expression and information (Article 19) is partially restricted by outbound connection limits and link-safety controls designed to prevent prompt-injection attacks; the content acknowledges this trade-off but prioritizes security.

Art 3 ↔ Art 29 — Right to security and life (Article 3) is protected by approval gates and connector review, which impose duties and limitations on builder and user freedoms (Article 29); the content frames these constraints as proportionate and necessary.

Art 12 ↔ Art 20 — Right to privacy (Article 12) in agent memory and data handling is balanced against collective standards and shared transparency practices (Article 20); memory poisoning prevention requires visibility that may expose some privacy concerns.

Article Heatmap

Negative Neutral Positive No Data

Aggregates

+0.42

+0.26

Weighted Mean	+0.42	Unweighted Mean	+0.40
Max	+0.75 Article 19	Min	+0.23 Article 12
Signal	16	No Data	15
Volatility	0.14 (Medium)
Negative	0	Channels	E: 0.6 S: 0.4
SETL ℹ	+0.25	Editorial-dominant
FW Ratio ℹ	58%	42 facts · 30 inferences
Agreement	Low	3 models · spread ±0.209

Evidence 34% coverage ℹ

 3H  12M  1L  15 ND 

Theme Radar

Editorial Channel

What the content says

+0.60

Article 19 Freedom of Expression

High Framing Advocacy

Editorial

+0.60

SETL

+0.39

Content extensively advocates for transparency, disclosure, and informed decision-making in agent-system design. Emphasizes the right to receive and seek information about security risks and system behavior. Frames prompt-injection disclosure as essential to user understanding.

+0.55

Article 3 Life, Liberty, Security

High Framing Advocacy

Editorial

+0.55

SETL

+0.37

Content advocates for system design that preserves user security and autonomy in agent-driven workflows. Discusses how architectural decisions impact user safety.

+0.50

Article 2 Non-Discrimination

Medium Framing

Editorial

+0.50

SETL

+0.35

Content emphasizes dignity through trustworthiness and system integrity. Describes how agents should reliably serve user intent without corruption.

+0.50

Article 26 Education

High Framing Advocacy

Editorial

+0.50

SETL

+0.27

Content advocates for education and literacy regarding agent security, prompt injection, and system design principles. Frames technical understanding as essential to user empowerment and informed decision-making.

+0.50

Article 30 No Destruction of Rights

Medium Framing Advocacy

Editorial

+0.50

SETL

+0.32

Content defends the right to security and trustworthiness in agent systems against misinterpretation. Advocates that prompt-injection defense is not a violation of freedom but a prerequisite for it.

+0.45

Preamble Preamble

Medium Framing

Editorial

+0.45

SETL

+0.26

Content frames prompt injection as a systemic security problem that threatens human agency, autonomy, and the integrity of digital systems. Emphasizes shared responsibility for building trustworthy systems.

+0.45

Article 27 Cultural Participation

Medium Framing Advocacy

Editorial

+0.45

SETL

+0.26

Content advocates for participation in the cultural and technical commons of agent-system design. Emphasizes shared responsibility and collective standards-setting. Frames prompt-injection defense as a community practice.

+0.40

Article 9 No Arbitrary Detention

Medium Framing

Editorial

+0.40

SETL

+0.28

Content frames prompt injection as an arbitrary action—attackers cause agents to act contrary to user intent without authorization. Emphasizes threat to freedom from arbitrary interference.

+0.40

Article 20 Assembly & Association

Medium Framing Advocacy

Editorial

+0.40

SETL

+0.20

Content advocates for collective action and industry-wide standards on prompt injection. References multi-vendor efforts (OpenAI, Anthropic, Google, Microsoft) and standards bodies (MCP, A2A). Frames prompt injection as a shared problem requiring coordinated defense.

+0.40

Article 28 Social & International Order

Medium Framing

Editorial

+0.40

SETL

+0.24

Content frames prompt-injection defense as essential to maintaining social and international order based on human rights protections. Advocates for systemic, architectural approaches to prevent harm at scale.

+0.35

Article 1 Freedom, Equality, Brotherhood

Medium Framing

Editorial

+0.35

SETL

+0.23

Content discusses prompt injection as a threat to equal protection and non-discrimination in agent systems. Acknowledges that attack success rates vary, implying differential vulnerability.

+0.35

Article 21 Political Participation

Medium Framing

Editorial

+0.35

SETL

+0.23

Content frames access to secure, trustworthy systems as a public concern. Advocates for builders to adopt defensive practices, implying that agent-system security is a matter of public interest.

+0.35

Article 25 Standard of Living

Medium Framing

Editorial

+0.35

SETL

+0.19

Content frames prompt-injection defense as essential to user security and welfare in digital systems. Discusses how agent-system compromise can cause financial, data, and operational harm.

+0.35

Article 29 Duties to Community

Medium Framing

Editorial

+0.35

SETL

+0.23

Content frames builder and user responsibilities in agent-system design. Emphasizes that freedom from prompt-injection is balanced against security constraints; advocates for proportionate controls.

+0.30

Article 13 Freedom of Movement

Low Framing

Editorial

+0.30

SETL

+0.12

Content does not directly address freedom of movement, but the discussion of outbound connection limits and link-safety controls relates tangentially to agent mobility.

+0.25

Article 12 Privacy

Medium Framing

Editorial

+0.25

SETL

+0.11

Content discusses privacy threats from prompt injection: data leaks, unauthorized file reads, memory poisoning that persists across sessions. Frames privacy as a security concern in agent systems.

Article 4 No Slavery

No observable engagement with slavery, servitude, or forced labor.

Article 5 No Torture

No observable engagement with torture or cruel treatment.

Article 6 Legal Personhood

No observable engagement with personhood or legal recognition.

Article 7 Equality Before Law

No observable engagement with equal protection under law in legal/jurisdictional sense.

Article 8 Right to Remedy

No observable engagement with remedies for violations.

Article 10 Fair Hearing

No observable engagement with fair and public hearing or due process in judicial context.

Article 11 Presumption of Innocence

No observable engagement with criminal prosecution or legal innocence presumption.

Article 14 Asylum

No observable engagement with asylum or refuge.

Article 15 Nationality

No observable engagement with nationality or state membership.

Article 16 Marriage & Family

No observable engagement with marriage, family, or property rights.

Article 17 Property

No observable engagement with property rights in traditional sense.

Article 18 Freedom of Thought

No observable engagement with freedom of thought, conscience, or religion.

Article 22 Social Security

No observable engagement with social security, cultural participation, or economic welfare.

Article 23 Work & Equal Pay

No observable engagement with work, employment, or fair wages.

Article 24 Rest & Leisure

No observable engagement with rest, leisure, or time.

Structural Channel

What the site does

Domain Context Profile

Element	Modifier	Affects	Note
Legal & Terms
Privacy	—		No privacy policy or data handling disclosure visible on provided content.
Terms of Service	—		No Terms of Service visible on provided content.
Identity & Mission
Mission	+0.15	Article 3 Article 19 Article 27	OpenGuard appears focused on agent security and prompt-injection defense, supporting secure digital infrastructure and informed decision-making.
Editorial Code	—		No explicit editorial guidelines or corrections policy visible.
Ownership	—		Published by Jitera Labs (publisher field); OpenGuard Team (author). Commercial entity; no conflict apparent.
Access & Distribution
Access Model	+0.10	Article 19 Article 26	Content appears freely accessible; no paywall or registration barrier observed in provided markup.
Ad/Tracking	—		No visible ad tags or tracking pixels in provided HTML; CDN font loading present.
Accessibility	—		Semantic HTML structure present; color contrast adequate for dark theme; no ARIA labels visible in provided markup.

+0.35

Article 19 Freedom of Expression

High Framing Advocacy

Structural

+0.35

Context Modifier

+0.25

SETL

+0.39

Blog content is freely accessible, published under author attribution, and provides detailed technical information without paywalls or access restrictions.

+0.35

Article 26 Education

High Framing Advocacy

Structural

+0.35

Context Modifier

+0.10

SETL

+0.27

Blog post itself functions as educational material; freely accessible, detailed technical content provided without paywalls.

+0.30

Preamble Preamble

Medium Framing

Structural

+0.30

Context Modifier

0.00

SETL

+0.26

Clear, accessible blog format with navigation; no paywalls or access restrictions observed.

+0.30

Article 3 Life, Liberty, Security

High Framing Advocacy

Structural

+0.30

Context Modifier

+0.15

SETL

+0.37

Content advocates for security practices without restricting access or participation.

+0.30

Article 20 Assembly & Association

Medium Framing Advocacy

Structural

+0.30

Context Modifier

0.00

SETL

+0.20

Content is published as part of an open technical community; no restrictions on collective participation or assembly observable.

+0.30

Article 27 Cultural Participation

Medium Framing Advocacy

Structural

+0.30

Context Modifier

+0.15

SETL

+0.26

Blog format facilitates community knowledge-sharing; references public standards (MCP, A2A) and open-source practices.

+0.30

Article 30 No Destruction of Rights

Medium Framing Advocacy

Structural

+0.30

Context Modifier

0.00

SETL

+0.32

Content itself exemplifies non-restriction of rights; freely published technical analysis.

+0.25

Article 2 Non-Discrimination

Medium Framing

Structural

+0.25

Context Modifier

0.00

SETL

+0.35

Blog structure does not restrict or elevate any user group; content is equally accessible.

+0.25

Article 13 Freedom of Movement

Low Framing

Structural

+0.25

Context Modifier

0.00

SETL

+0.12

No observable restrictions on user movement or navigation on the website.

+0.25

Article 25 Standard of Living

Medium Framing

Structural

+0.25

Context Modifier

0.00

SETL

+0.19

No observable harm or restriction to user welfare on the website.

+0.25

Article 28 Social & International Order

Medium Framing

Structural

+0.25

Context Modifier

0.00

SETL

+0.24

Content supports shared standards and international vendor coordination; no restrictions on order or stability observable.

+0.20

Article 1 Freedom, Equality, Brotherhood

Medium Framing

Structural

+0.20

Context Modifier

0.00

SETL

+0.23

No differential access or content restriction by user status observed.

+0.20

Article 9 No Arbitrary Detention

Medium Framing

Structural

+0.20

Context Modifier

0.00

SETL

+0.28

Blog format is transparent and non-arbitrary in structure.

+0.20

Article 12 Privacy

Medium Framing

Structural

+0.20

Context Modifier

0.00

SETL

+0.11

Blog structure does not intrude on user privacy; no excessive data collection observable.

+0.20

Article 21 Political Participation

Medium Framing

Structural

+0.20

Context Modifier

0.00

SETL

+0.23

No observable participation or governance structures on the website itself.

+0.20

Article 29 Duties to Community

Medium Framing

Structural

+0.20

Context Modifier

0.00

SETL

+0.23

Blog structure does not restrict user freedoms; educational content is freely available.

Article 4 No Slavery

No structural signals related to Article 4.

Article 5 No Torture

No structural signals related to Article 5.

Article 6 Legal Personhood

No structural signals related to Article 6.

Article 7 Equality Before Law

No structural signals related to Article 7.

Article 8 Right to Remedy

No structural signals related to Article 8.

Article 10 Fair Hearing

No observable engagement with fair and public hearing or due process in judicial context.

Article 11 Presumption of Innocence

No structural signals related to Article 11.

Article 14 Asylum

No structural signals related to Article 14.

Article 15 Nationality

No structural signals related to Article 15.

Article 16 Marriage & Family

No structural signals related to Article 16.

Article 17 Property

No structural signals related to Article 17.

Article 18 Freedom of Thought

No structural signals related to Article 18.

Article 22 Social Security

No structural signals related to Article 22.

Article 23 Work & Equal Pay

No structural signals related to Article 23.

Article 24 Rest & Leisure

No structural signals related to Article 24.

Supplementary Signals

How this content communicates, beyond directional lean. Learn more

Epistemic Quality ℹ

How well-sourced and evidence-based is this content?

0.82 medium claims

Sources		0.8
Evidence		0.8
Uncertainty		0.8
Purpose		0.9

Propaganda Flags ℹ

2 manipulative rhetoric techniques found

2 techniques detected

appeal to fear

The headline 'The Webpage Has Instructions. The Agent Has Your Credentials' and the phrase 'The first major prompt-injection incident with real financial damage will probably involve a multi-agent workflow' invoke fear of system compromise and loss of control.

causal oversimplification

The claim that 'That incident...will do for agent security what the 2013 Target breach did for network segmentation' oversimplifies the relationship between a single incident and broad industry change.

Emotional Tone ℹ

Emotional character: positive/negative, intensity, authority

urgent

Valence		-0.3
Arousal		0.7
Dominance		0.6

Transparency ℹ

Does the content identify its author and disclose interests?

0.50

✓ Author ✗ Conflicts ✗ Funding

More signals: context, framing & audience

Solution Orientation ℹ

Does this content offer solutions or only describe problems?

0.65 mixed

Reader Agency

0.8

Stakeholder Voice ℹ

Whose perspectives are represented in this content?

0.55 5 perspectives

Speaks: institutioncorporation

About: individualscorporationgovernment

Temporal Framing ℹ

Is this content looking backward, at the present, or forward?

mixed short term

Geographic Scope ℹ

What geographic area does this content cover?

global

United States

Complexity ℹ

How accessible is this content to a general audience?

technical high jargon domain specific

Longitudinal 190 HN snapshots · 17 evals

Audit Trail 37 entries

2026-03-16 01:19	ap_publish	AP publish failed: 401	- -
2026-03-16 01:16	ap_publish	AP publish failed: 401	- -
2026-03-16 01:14	ap_publish	AP publish failed: 401	- -
2026-03-16 01:12	ap_publish	AP publish failed: 401	- -
2026-03-16 01:09	ap_publish	AP publish failed: 401	- -
2026-03-16 01:06	ap_publish	AP publish failed: 401	- -
2026-03-16 01:04	ap_publish	AP publish failed: 401	- -
2026-03-16 01:02	ap_publish	AP publish failed: 401	- -
2026-03-16 00:59	ap_publish	AP publish failed: 401	- -
2026-03-16 00:57	ap_publish	AP publish failed: 401	- -
2026-03-16 00:54	ap_publish	AP publish failed: 401	- -
2026-03-16 00:51	ap_publish	AP publish failed: 401	- -
2026-03-16 00:49	ap_publish	AP publish failed: 401	- -
2026-03-16 00:46	ap_publish	AP publish failed: 401	- -
2026-03-16 00:44	ap_publish	AP publish failed: 401	- -
2026-03-16 00:41	ap_publish	AP publish failed: 401	- -
2026-03-16 00:39	ap_publish	AP publish failed: 401	- -
2026-03-16 00:37	ap_publish	AP publish failed: 401	- -
2026-03-16 00:35	eval_success	PSQ evaluated: g-PSQ=0.440 (3 dims)	- -
2026-03-16 00:35	eval	Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) 0.00
2026-03-16 00:33	ap_publish	AP publish failed: 401	- -
2026-03-16 00:20	eval	Evaluated by llama-3.3-70b-wai-psq: +0.01 (Neutral)
2026-03-16 00:17	eval	Evaluated by llama-3.3-70b-wai: 0.00 (Neutral)
	reasoning Technical blog post on security threat
2026-03-15 23:59	eval	Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
	reasoning Technical blog post on prompt injection attacks in AI agents, no explicit human rights discussion.
2026-03-15 22:46	eval	Evaluated by claude-haiku-4-5-20251001: +0.42 (Moderate positive) 18,125 tokens
2026-03-15 21:37	eval	Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) 0.00
2026-03-15 21:32	eval	Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
	reasoning Technical blog post on prompt injection attacks in AI agents, no explicit human rights discussion.
2026-03-15 20:55	eval	Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) 0.00
2026-03-15 20:52	eval	Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
	reasoning Technical blog post on prompt injection attacks in AI agents, no explicit human rights discussion.
2026-03-15 20:18	eval	Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) 0.00
2026-03-15 20:16	eval	Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
	reasoning Technical blog post on prompt injection attacks in AI agents, no explicit human rights discussion.
2026-03-15 19:43	eval	Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) 0.00
2026-03-15 19:41	eval	Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
	reasoning Technical blog post on prompt injection attacks in AI agents, no explicit human rights discussion.
2026-03-15 19:05	eval	Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) 0.00
2026-03-15 19:03	eval	Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00
	reasoning Technical blog post on prompt injection attacks in AI agents, no explicit human rights discussion.
2026-03-15 18:16	eval	Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive)
2026-03-15 18:15	eval	Evaluated by llama-4-scout-wai: 0.00 (Neutral)
	reasoning Technical blog post on prompt injection attacks in AI agents, no explicit human rights discussion.

build ee2b489+gzrb · deployed 2026-03-10 22:52 UTC · evaluated 2026-03-16 01:19:08 UTC