Tags | Research Note

Tags

security

Before the Tool Call: Deterministic Pre-Action Authorization for Autonomous AI Agents

MINJA: Memory Injection Attacks on LLM Agents via Query-Only Interaction

Memory Poisoning Attack and Defense on Memory-Based LLM Agents: An Empirical Study

A-MemGuard: A Proactive Defense Framework for LLM-Based Agent Memory

SecAlign: Defending Against Prompt Injection with Preference Optimization

MELON: Provable Defense Against Indirect Prompt Injection Attacks in AI Agents

IPIGUARD: A Novel Tool Dependency Graph-Based Defense Against Indirect Prompt Injection in LLM Agents

AgentDojo: A Dynamic Environment to Evaluate Prompt Injection Attacks and Defenses for LLM Agents

StruQ: Defending Against Prompt Injection with Structured Queries

Attention is All You Need to Defend Against Indirect Prompt Injection Attacks in LLMs

Formalizing and Benchmarking Prompt Injection Attacks and Defenses

ISOLATEGPT: An Execution Isolation Architecture for LLM-Based Agentic Systems

Not what you've signed up for: Compromising Real-World LLM-Integrated Applications with Indirect Prompt Injection

machine-learning

SecAlign: Defending Against Prompt Injection with Preference Optimization

StruQ: Defending Against Prompt Injection with Structured Queries

Attention is All You Need to Defend Against Indirect Prompt Injection Attacks in LLMs

Formalizing and Benchmarking Prompt Injection Attacks and Defenses

ISOLATEGPT: An Execution Isolation Architecture for LLM-Based Agentic Systems

Not what you've signed up for: Compromising Real-World LLM-Integrated Applications with Indirect Prompt Injection

llm

PROV-AGENT: Unified Provenance for Tracking AI Agent Interactions in Agentic Workflows

SecAlign: Defending Against Prompt Injection with Preference Optimization

MELON: Provable Defense Against Indirect Prompt Injection Attacks in AI Agents

StruQ: Defending Against Prompt Injection with Structured Queries

Attention is All You Need to Defend Against Indirect Prompt Injection Attacks in LLMs

Formalizing and Benchmarking Prompt Injection Attacks and Defenses

ISOLATEGPT: An Execution Isolation Architecture for LLM-Based Agentic Systems

Not what you've signed up for: Compromising Real-World LLM-Integrated Applications with Indirect Prompt Injection

prompt-injection

SecAlign: Defending Against Prompt Injection with Preference Optimization

MELON: Provable Defense Against Indirect Prompt Injection Attacks in AI Agents

IPIGUARD: A Novel Tool Dependency Graph-Based Defense Against Indirect Prompt Injection in LLM Agents

AgentDojo: A Dynamic Environment to Evaluate Prompt Injection Attacks and Defenses for LLM Agents

StruQ: Defending Against Prompt Injection with Structured Queries

Attention is All You Need to Defend Against Indirect Prompt Injection Attacks in LLMs

Formalizing and Benchmarking Prompt Injection Attacks and Defenses

Not what you've signed up for: Compromising Real-World LLM-Integrated Applications with Indirect Prompt Injection

indirect-prompt-injection

Attention is All You Need to Defend Against Indirect Prompt Injection Attacks in LLMs

Not what you've signed up for: Compromising Real-World LLM-Integrated Applications with Indirect Prompt Injection

attack

Not what you've signed up for: Compromising Real-World LLM-Integrated Applications with Indirect Prompt Injection

llm-agent

MINJA: Memory Injection Attacks on LLM Agents via Query-Only Interaction

Memory Poisoning Attack and Defense on Memory-Based LLM Agents: An Empirical Study

A-MemGuard: A Proactive Defense Framework for LLM-Based Agent Memory

ISOLATEGPT: An Execution Isolation Architecture for LLM-Based Agentic Systems

isolation

ISOLATEGPT: An Execution Isolation Architecture for LLM-Based Agentic Systems

benchmark

AgentDojo: A Dynamic Environment to Evaluate Prompt Injection Attacks and Defenses for LLM Agents

Formalizing and Benchmarking Prompt Injection Attacks and Defenses

defense

Memory Poisoning Attack and Defense on Memory-Based LLM Agents: An Empirical Study

SecAlign: Defending Against Prompt Injection with Preference Optimization

MELON: Provable Defense Against Indirect Prompt Injection Attacks in AI Agents

IPIGUARD: A Novel Tool Dependency Graph-Based Defense Against Indirect Prompt Injection in LLM Agents

StruQ: Defending Against Prompt Injection with Structured Queries

Attention is All You Need to Defend Against Indirect Prompt Injection Attacks in LLMs

Formalizing and Benchmarking Prompt Injection Attacks and Defenses

attention

Attention Is All You Need

Attention is All You Need to Defend Against Indirect Prompt Injection Attacks in LLMs

structured-query

StruQ: Defending Against Prompt Injection with Structured Queries

llm-agents

IPIGUARD: A Novel Tool Dependency Graph-Based Defense Against Indirect Prompt Injection in LLM Agents

AgentDojo: A Dynamic Environment to Evaluate Prompt Injection Attacks and Defenses for LLM Agents

tool-use

IPIGUARD: A Novel Tool Dependency Graph-Based Defense Against Indirect Prompt Injection in LLM Agents

AgentDojo: A Dynamic Environment to Evaluate Prompt Injection Attacks and Defenses for LLM Agents

evaluation

AgentDojo: A Dynamic Environment to Evaluate Prompt Injection Attacks and Defenses for LLM Agents

ai

Attention Is All You Need

nlp

Attention Is All You Need

transformer

Attention Is All You Need

sequence-to-sequence

Attention Is All You Need

planning

IPIGUARD: A Novel Tool Dependency Graph-Based Defense Against Indirect Prompt Injection in LLM Agents

ai-agent

Before the Tool Call: Deterministic Pre-Action Authorization for Autonomous AI Agents

MELON: Provable Defense Against Indirect Prompt Injection Attacks in AI Agents

detection

MELON: Provable Defense Against Indirect Prompt Injection Attacks in AI Agents

preference-optimization

SecAlign: Defending Against Prompt Injection with Preference Optimization

dpo

SecAlign: Defending Against Prompt Injection with Preference Optimization

memory-defense

A-MemGuard: A Proactive Defense Framework for LLM-Based Agent Memory

adversarial-ml

MINJA: Memory Injection Attacks on LLM Agents via Query-Only Interaction

Memory Poisoning Attack and Defense on Memory-Based LLM Agents: An Empirical Study

A-MemGuard: A Proactive Defense Framework for LLM-Based Agent Memory

in-context-learning

MINJA: Memory Injection Attacks on LLM Agents via Query-Only Interaction

A-MemGuard: A Proactive Defense Framework for LLM-Based Agent Memory

memory-poisoning

MINJA: Memory Injection Attacks on LLM Agents via Query-Only Interaction

Memory Poisoning Attack and Defense on Memory-Based LLM Agents: An Empirical Study

authorization

Before the Tool Call: Deterministic Pre-Action Authorization for Autonomous AI Agents

tool-call

Before the Tool Call: Deterministic Pre-Action Authorization for Autonomous AI Agents

mcp

PROV-AGENT: Unified Provenance for Tracking AI Agent Interactions in Agentic Workflows

Before the Tool Call: Deterministic Pre-Action Authorization for Autonomous AI Agents

agentic-ai

Before the Tool Call: Deterministic Pre-Action Authorization for Autonomous AI Agents

distributed-systems

PROV-AGENT: Unified Provenance for Tracking AI Agent Interactions in Agentic Workflows

provenance

PROV-AGENT: Unified Provenance for Tracking AI Agent Interactions in Agentic Workflows

agentic-workflow

PROV-AGENT: Unified Provenance for Tracking AI Agent Interactions in Agentic Workflows

hpc

PROV-AGENT: Unified Provenance for Tracking AI Agent Interactions in Agentic Workflows

hallucination

PROV-AGENT: Unified Provenance for Tracking AI Agent Interactions in Agentic Workflows

responsible-ai

PROV-AGENT: Unified Provenance for Tracking AI Agent Interactions in Agentic Workflows