Tags
security
machine-learning
llm
prompt-injection
indirect-prompt-injection
attack
llm-agent
isolation
benchmark
defense
attention
structured-query
llm-agents
tool-use
evaluation
ai
nlp
transformer
sequence-to-sequence
planning
ai-agent
detection
preference-optimization
dpo
memory-defense
adversarial-ml
in-context-learning
memory-poisoning
authorization
tool-call
mcp
agentic-ai
distributed-systems
provenance
agentic-workflow
hpc
hallucination
responsible-ai
security
Before the Tool Call: Deterministic Pre-Action Authorization for Autonomous AI Agents
MINJA: Memory Injection Attacks on LLM Agents via Query-Only Interaction
Memory Poisoning Attack and Defense on Memory-Based LLM Agents: An Empirical Study
A-MemGuard: A Proactive Defense Framework for LLM-Based Agent Memory
SecAlign: Defending Against Prompt Injection with Preference Optimization
MELON: Provable Defense Against Indirect Prompt Injection Attacks in AI Agents
IPIGUARD: A Novel Tool Dependency Graph-Based Defense Against Indirect Prompt Injection in LLM Agents
AgentDojo: A Dynamic Environment to Evaluate Prompt Injection Attacks and Defenses for LLM Agents
StruQ: Defending Against Prompt Injection with Structured Queries
Attention is All You Need to Defend Against Indirect Prompt Injection Attacks in LLMs
Formalizing and Benchmarking Prompt Injection Attacks and Defenses
ISOLATEGPT: An Execution Isolation Architecture for LLM-Based Agentic Systems
Not what you've signed up for: Compromising Real-World LLM-Integrated Applications with Indirect Prompt Injection
machine-learning
SecAlign: Defending Against Prompt Injection with Preference Optimization
StruQ: Defending Against Prompt Injection with Structured Queries
Attention is All You Need to Defend Against Indirect Prompt Injection Attacks in LLMs
Formalizing and Benchmarking Prompt Injection Attacks and Defenses
ISOLATEGPT: An Execution Isolation Architecture for LLM-Based Agentic Systems
Not what you've signed up for: Compromising Real-World LLM-Integrated Applications with Indirect Prompt Injection
llm
PROV-AGENT: Unified Provenance for Tracking AI Agent Interactions in Agentic Workflows
SecAlign: Defending Against Prompt Injection with Preference Optimization
MELON: Provable Defense Against Indirect Prompt Injection Attacks in AI Agents
StruQ: Defending Against Prompt Injection with Structured Queries
Attention is All You Need to Defend Against Indirect Prompt Injection Attacks in LLMs
Formalizing and Benchmarking Prompt Injection Attacks and Defenses
ISOLATEGPT: An Execution Isolation Architecture for LLM-Based Agentic Systems
Not what you've signed up for: Compromising Real-World LLM-Integrated Applications with Indirect Prompt Injection
prompt-injection
SecAlign: Defending Against Prompt Injection with Preference Optimization
MELON: Provable Defense Against Indirect Prompt Injection Attacks in AI Agents
IPIGUARD: A Novel Tool Dependency Graph-Based Defense Against Indirect Prompt Injection in LLM Agents
AgentDojo: A Dynamic Environment to Evaluate Prompt Injection Attacks and Defenses for LLM Agents
StruQ: Defending Against Prompt Injection with Structured Queries
Attention is All You Need to Defend Against Indirect Prompt Injection Attacks in LLMs
Formalizing and Benchmarking Prompt Injection Attacks and Defenses
Not what you've signed up for: Compromising Real-World LLM-Integrated Applications with Indirect Prompt Injection
indirect-prompt-injection
Attention is All You Need to Defend Against Indirect Prompt Injection Attacks in LLMs
Not what you've signed up for: Compromising Real-World LLM-Integrated Applications with Indirect Prompt Injection
llm-agent
MINJA: Memory Injection Attacks on LLM Agents via Query-Only Interaction
Memory Poisoning Attack and Defense on Memory-Based LLM Agents: An Empirical Study
A-MemGuard: A Proactive Defense Framework for LLM-Based Agent Memory
ISOLATEGPT: An Execution Isolation Architecture for LLM-Based Agentic Systems
benchmark
AgentDojo: A Dynamic Environment to Evaluate Prompt Injection Attacks and Defenses for LLM Agents
Formalizing and Benchmarking Prompt Injection Attacks and Defenses
defense
Memory Poisoning Attack and Defense on Memory-Based LLM Agents: An Empirical Study
SecAlign: Defending Against Prompt Injection with Preference Optimization
MELON: Provable Defense Against Indirect Prompt Injection Attacks in AI Agents
IPIGUARD: A Novel Tool Dependency Graph-Based Defense Against Indirect Prompt Injection in LLM Agents
StruQ: Defending Against Prompt Injection with Structured Queries
Attention is All You Need to Defend Against Indirect Prompt Injection Attacks in LLMs
Formalizing and Benchmarking Prompt Injection Attacks and Defenses
attention
Attention Is All You Need
Attention is All You Need to Defend Against Indirect Prompt Injection Attacks in LLMs
structured-query
StruQ: Defending Against Prompt Injection with Structured Queries
llm-agents
IPIGUARD: A Novel Tool Dependency Graph-Based Defense Against Indirect Prompt Injection in LLM Agents
AgentDojo: A Dynamic Environment to Evaluate Prompt Injection Attacks and Defenses for LLM Agents
tool-use
IPIGUARD: A Novel Tool Dependency Graph-Based Defense Against Indirect Prompt Injection in LLM Agents
AgentDojo: A Dynamic Environment to Evaluate Prompt Injection Attacks and Defenses for LLM Agents
evaluation
AgentDojo: A Dynamic Environment to Evaluate Prompt Injection Attacks and Defenses for LLM Agents
transformer
Attention Is All You Need
sequence-to-sequence
Attention Is All You Need
ai-agent
Before the Tool Call: Deterministic Pre-Action Authorization for Autonomous AI Agents
MELON: Provable Defense Against Indirect Prompt Injection Attacks in AI Agents
preference-optimization
SecAlign: Defending Against Prompt Injection with Preference Optimization
adversarial-ml
MINJA: Memory Injection Attacks on LLM Agents via Query-Only Interaction
Memory Poisoning Attack and Defense on Memory-Based LLM Agents: An Empirical Study
A-MemGuard: A Proactive Defense Framework for LLM-Based Agent Memory
in-context-learning
MINJA: Memory Injection Attacks on LLM Agents via Query-Only Interaction
A-MemGuard: A Proactive Defense Framework for LLM-Based Agent Memory
memory-poisoning
MINJA: Memory Injection Attacks on LLM Agents via Query-Only Interaction
Memory Poisoning Attack and Defense on Memory-Based LLM Agents: An Empirical Study
mcp
PROV-AGENT: Unified Provenance for Tracking AI Agent Interactions in Agentic Workflows
Before the Tool Call: Deterministic Pre-Action Authorization for Autonomous AI Agents
distributed-systems
PROV-AGENT: Unified Provenance for Tracking AI Agent Interactions in Agentic Workflows
agentic-workflow
PROV-AGENT: Unified Provenance for Tracking AI Agent Interactions in Agentic Workflows