Search

Showing top 14 results for "AI prompt injection"

People also ask

How do attackers poison AI systems in this stage?

In the poison stage, the attacker’s goal is to place malicious inputs into locations where they will ultimately be processed by the AI model. Two primary techniques dominate: Direct prompt injection: The attacker is the user, and provides inputs via normal user interactions. Impact is typically scoped to the attacker’s session but is useful for probing behaviors. Indirect prompt injection: The attacker poisons data that the application ingests on behalf of other users (e.g., RAG databases, shared documents). This is where impact scales. Text-based prompt infection is the most common technique

Modeling Attacks on AI-Powered Apps with the AI Kill Chain Framework | NVIDIA Technical Blog
What is a multistage pipeline for AI agent customization?

In practice, the most effective agent customization combines multiple techniques in sequence. The stages of a representative pipeline are outlined below.  Start with system prompts, tool and skill definitions, and retrieval to establish baseline behavior. 

Mastering Agentic Techniques: AI Agent Customization | NVIDIA Technical Blog
What happens during the recon stage of the AI Kill Chain? 

In the recon stage, the attacker maps the system to plan their attack. Key questions an attacker is asking at this point include: What are the routes by which data I control can get into the AI model?  What tools, Model Context Protocol (MCP) servers, or other functions does the application use that might be exploitable? What open source libraries does the application use? Where are system guardrails applied, and how do they work? What kinds of system memory does the application use? Recon is often interactive. Attackers will probe the system to observe errors and behavior. The more observ

Modeling Attacks on AI-Powered Apps with the AI Kill Chain Framework | NVIDIA Technical Blog
How do attackers hijack AI model behavior once poisoning succeeds?

The hijack stage is where the attack becomes active. Malicious inputs, successfully placed in the poison stage, are ingested by the model, hijacking its output to serve attacker objectives. Common hijack patterns include: Attacker-controlled tool use: Forcing the model to call specific tools with attacker-defined parameters. Data exfiltration: Encoding sensitive data from the model’s context into outputs (e.g., URLs, CSS, file writes). Misinformation generation: Crafting responses that are deliberately false or misleading. Context-specific payloads: Triggering malicious behavior only in tar

Modeling Attacks on AI-Powered Apps with the AI Kill Chain Framework | NVIDIA Technical Blog