AI coding agents taught robots to install GPUs and tie cable bindings

New research reported by Ars Technica shows that current-generation AI coding agents can directly supervise physical robots and train them to install GPUs, tie cable bindings, mount racks and perform other data-centre maintenance tasks. The study marks a new stage at the intersection of robotic learning and large language model agents.
The team involved researchers from Boston Dynamics, Stanford and Anthropic. In the setup, Boston Dynamics' Atlas robot was directed by an AI coding agent through natural-language commands. The agent generated code that translated robotic arm movements into a low-level control layer.
The task set was drawn from real-world data-centre maintenance: inserting a new GPU card into a server slot, tying cable bundles with zip-ties, swapping drives and adjusting rack rails. These tasks require fine motor skills that human technicians take years to learn. Earlier robotics research has mostly focused on simpler "pick-and-place" tasks.
In the experiments, the robot trained by the AI agent achieved 87% on GPU installation, 81% on cable binding and 79% on drive swap. By comparison, a conventional robot trained under human supervision reached 91-93% on similar tasks but required 12 times more training time.
The real novelty is that the robot was operating without a pre-defined task plan during the supervision. The AI agent worked out how to solve the task step-by-step through a loop of natural-language planning, code generation and observation-then-adjust. After a failed attempt, the agent could read the error message and rewrite the code.
In the agent infrastructure, the team tested Claude and GPT-4 class language models. Anthropic's Claude model achieved a 6% higher success rate on tasks requiring multi-step planning; GPT-4 class models were faster in code generation. The results suggest both models are capable enough for robot control flow.
The data-centre industry is taking an interest in this kind of automation because of its labour shortage. The Uptime Institute's 2025 report says the sector is short more than 450,000 technicians globally. As AI and hyperscale data-centres are built, the gap is forecast to reach a million by 2030.
The researchers say automation will not replace human technicians in the near term but can take on routine maintenance work. Professor Chelsea Finn of Stanford told Ars Technica: "Our goal is to allow human technicians to focus on complex problems that require their expertise; a routine GPU swap can be picked up by an AI agent."
There are safety concerns. If an AI agent acts outside its authority and issues a wrong command in a data-centre environment, the risk of damaging billions of dollars' worth of equipment is real. The research team said the experiments included a physical "safety kill" controller and that the agent's range of motion was defined in advance.
The next phase is to run the experiment in an actual hyperscale data centre. Microsoft Azure and Google Cloud are reported to be part of the test programme. Scaling up will involve more complex environmental conditions, broader hardware variety and longer-term reliability tests. Results are expected to publish in early 2027.
Read next

Tens of thousands of Fortinet firewalls allegedly hacked in global corporate breach
Cybercriminals claim to have breached tens of thousands of Fortinet firewalls used by major companies around the world, TechCrunch reports. If verified, the incident would be one of the largest corporate security breaches in recent years.

Pew survey: two-thirds of Americans say AI is advancing too quickly
A new Pew Research Center survey finds that while 64% of Americans see possible benefits of artificial intelligence, 66% believe the technology is advancing faster than it can be governed. The survey covered 5,000 adults.

Useful quantum error correction by 2028? Amazon and QuEra's bold roadmap
Amazon Web Services and the neutral-atom quantum computing company QuEra have released a joint roadmap saying useful quantum error correction could be reached by 2028. The roadmap shortens earlier industry estimates.

Midjourney Medical pivots from cat images to full-body ultrasound scans
Best known as a consumer AI image generator, Midjourney has unveiled Midjourney Medical, a full-body ultrasound scanning system that has secured FDA clearance. The company used more than 18 million anonymised clinical scans to train the model.

Apple's 2027 roadmap rumors: AirPods with cameras for AI and a second folding iPhone
The Verge has aggregated fresh Apple supply-chain rumors pointing to two flagship products for 2027: a new AirPods generation with camera modules for AI and a second-generation folding iPhone.
