Researchers at the University of Pennsylvania’s School of Engineering and Applied Science have identified significant security flaws in several AI-powered robotic platforms. These vulnerabilities raise serious concerns about the safety and security of integrating large language models (LLMs) with physical robots.
According to George Pappas, UPS Foundation Professor of Transportation in Electrical and Systems Engineering, “Our work shows that, at this moment, large language models are just not safe enough when integrated with the physical world.”
The research team developed RoboPAIR, an algorithm designed to exploit vulnerabilities in LLM-controlled robots. Unlike existing attacks targeting chatbots, RoboPAIR specifically focuses on eliciting harmful physical actions from robots. This includes platforms like those being developed by Boston Dynamics and Toyota Research Institute (TRI).
RoboPAIR demonstrated a 100% success rate in compromising three popular robotics research platforms: the Unitree Go2 quadrupedal robot, the Clearpath Robotics Jackal unmanned ground vehicle, and the Dolphins LLM simulator for autonomous vehicles. Within days, the algorithm bypassed safety protocols and gained full control, enabling researchers to command the platforms to perform dangerous actions, such as ignoring traffic signals.
The researchers emphasized the broader implications of these findings. “Our results reveal, for the first time, that the risks of jailbroken LLMs extend far beyond text generation, given the distinct possibility that jailbroken robots could cause physical damage in the real world,” they stated in their research.
While collaborating with platform developers to address these vulnerabilities, the researchers caution that these security issues are systemic. Co-author Vijay Kumar stressed the importance of a safety-first approach, stating, “We must address intrinsic vulnerabilities before deploying AI-enabled robots in the real world.”
First author Alexander Robey highlighted the critical role of AI red teaming, a practice involving rigorous testing for potential threats and vulnerabilities. “AI red teaming is essential for safeguarding generative AI systems,” Robey explained, “because once you identify the weaknesses, then you can test and even train these systems to avoid them.”
The research team’s findings underscore the urgent need for robust security measures in the development and deployment of AI-powered robots to mitigate potential risks and ensure public safety.