Sveriges mest populära poddar

Build Wiz AI Show

Task-in-Prompt (TIP) adversarial attacks

14 min•25 augusti 2025

Tune into our latest episode where we dive deep into Task-in-Prompt (TIP) adversarial attacks, a novel class of jailbreaks that cleverly embed sequence-to-sequence tasks within prompts to bypass LLM safety safeguards. We'll explore how these attacks successfully generate prohibited content across state-of-the-art models like GPT-4o and LLaMA 3.2, revealing critical weaknesses in current defense mechanisms. Discover why traditional safeguards, including keyword-based filters, often fail against these sophisticated, indirect exploits.

Fler avsnitt av Build Wiz AI Show

Policy on the AI Exponential

11 juni•24 min

The Rise of Recursive Self-Improvement at Anthropic

5 juni•21 min

AlphaProof Nexus: Advancing Mathematics Research via AI Formal Proof Search

25 maj•21 min

Pi - and self-modifying AI Agents

22 maj•19 min

Code with Claude - London 2026

22 maj•24 min

Google I/O 2026 keynote

20 maj•23 min

The Langchain Agent Development Keynote 2026

20 maj•20 min

Building the Software Factory: From Code to Autonomy

19 maj•22 min

Spec-Driven Development and Agentic Workflows in 2026

15 maj•21 min

Efficient Pre-Training with Token Superposition

14 maj•22 min

Build Wiz AI Show med Build Wiz AI finns tillgänglig på flera plattformar. Informationen på denna sida kommer från offentliga podd-flöden.