PersonaDrive: Human-Style Retrieval-Augmented VLA Agents for Closed-Loop Driving Simulation

Original Source

ArXiv AI (cs.AI)

by Mahmoud Srewa, Praneetsai Iddamsetty, Mohammad Abdullah Al Faruque, Salma Elmalaki

Read Full Article

arXiv:2606.12616v1 Announce Type: new Abstract: Closed-loop driving simulators typically populate their environments with non-ego traffic agents that behave largely the same way, produced either by rule-based traffic managers or by learned models trained toward a single behavioral mode. Recent work introduces style variation through post-hoc labels on observational data or LLM-inferred reward weights, but these signals act as proxies for what a style should reward rather than demonstrations of humans explicitly asked to drive in that style. We introduce PersonaDrive, a pipeline that conditions a vision-language-action (VLA) driving agent on retrieved demonstrations from a style-instructed human driving dataset, in which participants drive CARLA leaderboard routes under aggressive, neutral, and conservative instructions on a driver-in-the-loop rig. The pipeline has three stages: (i) offline triplet mining over per-style human driving data using a combined image-text similarity score; (ii) training a lightweight retrieval head that fuses frozen visual features with a small control encoder over per-style databases; and (iii) fine-tuning a single VLA backbone to treat retrieved context points as in-context behavioral demonstrations during waypoint prediction. At inference, the same backbone is conditioned on any style by swapping which per-style database the retrieval head queries, so selecting a style requires no per-style retraining while enabling human-style, style-diverse non-ego agents for closed-loop simulation. On Bench2Drive, PersonaDrive (no style) improves the driving score by 4.6% over SimLingo and 2.5% over HiP-AD, and under style conditioning attains the highest driving score in every style within a roughly 2% band (its weakest style surpassing the strongest baseline, DMW, by 5.4%), while average speed and acceleration rise by 18% and 25% from the conservative to the aggressive instruction.

Tags:AIAgent

Original Content Credit

This summary is sourced from ArXiv AI (cs.AI). For the complete article with full details, research data, and author insights, please visit the original source.

Visit ArXiv AI (cs.AI)

Anthropic’s safety warnings may have just backfired — the government has pulled the plug on its most powerful AI

TechCrunch AI

Industry News1m

Anthropic’s safety warnings may have just backfired — the government has pulled the plug on its most powerful AI

Anthropic isn't hiding its frustration. "We disagree that the finding of a narrow potential jailbreak should be cause for recalling a commercial model deployed to hundreds of millions of people," the company wrote in a blog post.

Jun 13, 2026

Anthropic Says It’s Taking Claude Fable 5 Offline to Comply With US Government Order

Wired AI

Industry News1m

Anthropic Says It’s Taking Claude Fable 5 Offline to Comply With US Government Order

“The government believes it has become aware of a method of bypassing, or ‘jailbreaking’ Fable 5,” the company said in a blog post.

Jun 13, 2026

Meta Employees Absolutely Hate Mark Zuckerberg’s Plan for a Companywide AI Hackathon

Wired AI

Industry News1m

Meta Employees Absolutely Hate Mark Zuckerberg’s Plan for a Companywide AI Hackathon

“I’m not sure that this company supports a hackathon culture anymore,” one employee posted in a forum open to the entire staff.

Jun 12, 2026

PersonaDrive: Human-Style Retrieval-Augmented VLA Agents for Closed-Loop Driving Simulation

Related Articles

Anthropic&#8217;s safety warnings may have just backfired — the government has pulled the plug on its most powerful AI

Anthropic Says It’s Taking Claude Fable 5 Offline to Comply With US Government Order

Meta Employees Absolutely Hate Mark Zuckerberg’s Plan for a Companywide AI Hackathon

Anthropic’s safety warnings may have just backfired — the government has pulled the plug on its most powerful AI