Decentralized LLM-Driven Coordination of Acoustic Robots for Contactless Object Manipulation
This paper presents a decentralized framework that combines large language models (LLMs) with acoustic mobile robots for contactless object manipulation. Using Whisper speech recognition, LLM semantic parsing, and JSON task scheduling, the system converts spoken commands into coordinated multi-robot actions. Experiments with two TurtleBot3-based acoustic robots achieved success rates of 96% for sequential, 86% for parallel, and 70% for synchronized tasks, showcasing the potential of LLM-driven automation for human-robot interaction.
Article intelligence
Key points
- A decentralized framework integrates LLMs with acoustic robots for contactless object manipulation via natural language commands.
- The system uses Whisper, LLM parsing, JSON-based task representation, and distributed scheduling to handle sequential, parallel, and synchronized tasks.
- Experiments on two TurtleBot3 robots with ultrasonic arrays achieved 96% success for sequential, 86% for parallel, and 70% for synchronized operations.
- The approach demonstrates potential for intuitive human-robot interaction in distributed robotic systems.
Why it matters
This matters because a decentralized framework integrates LLMs with acoustic robots for contactless object manipulation via natural language commands.
Technical impact
May affect model selection, inference cost, product capability, and evaluation benchmarks.
[2605.29378] Decentralized LLM-Driven Coordination of Acoustic Robots for Contactless Object Manipulation
[Submitted on 28 May 2026]
Title:Decentralized LLM-Driven Coordination of Acoustic Robots for Contactless Object Manipulation
View a PDF of the paper titled Decentralized LLM-Driven Coordination of Acoustic Robots for Contactless Object Manipulation, by Yingying Wang and 2 other authors
View PDF HTML (experimental)
Abstract:Natural language interfaces can simplify interaction with multi-robot systems, especially when non-expert users need to issue high-level commands. Acoustic manipulation using ultrasonic phased arrays also enables contactless object handling for applications such as healthcare, laboratory automation, and precision transport. However, combining large language models (LLMs) with distributed acoustic mobile robots remains underexplored. This paper presents a decentralized framework for natural language-driven coordination of acoustic robots for contactless object manipulation. The system converts spoken instructions into executable multi-robot task plans using Whisper-based speech recognition, LLM-based semantic parsing, structured JSON task representation, and distributed scheduling. The JSON schema encodes robot assignments, temporal dependencies, spatial constraints, and synchronization requirements for sequential, parallel, and synchronized execution. The system is implemented on two TurtleBot3-based acoustic robots, each equipped with an ultrasonic phased array for contactless object transport. Experiments were conducted in three scenarios: sequential execution, parallel multi-robot transport, and synchronized cooperative manipulation. The system achieved task success rates of 96 percent for sequential tasks, 86 percent for parallel execution, and 70 percent for synchronized collaborative transport. These results show that natural language commands can be transformed into distributed robot actions for contactless manipulation, highlighting the potential of LLM-driven automation for human-robot interaction in distributed robotic systems.
Comments: This paper has been accepted for publication in the Proceedings of the 2026 IEEE 22nd International Conference on Automation Science and Engineering (CASE 2026), August 17-21, 2026, Shenyang, China
Subjects:
Robotics (cs.RO)
Cite as: arXiv:2605.29378 [cs.RO]
(or arXiv:2605.29378v1 [cs.RO] for this version)
https://doi.org/10.48550/arXiv.2605.29378
arXiv-issued DOI via DataCite (pending registration)
Submission history
From: Narsimlu Kemsaram Dr [view email] [v1] Thu, 28 May 2026 05:27:13 UTC (5,449 KB)
Full-text links:
Access Paper:
View a PDF of the paper titled Decentralized LLM-Driven Coordination of Acoustic Robots for Contactless Object Manipulation, by Yingying Wang and 2 other authors
View PDF
HTML (experimental)
TeX Source
view license
Current browse context:
cs.RO
new | recent | 2026-05
Change to browse by:
cs
References & Citations
NASA ADS
Google Scholar
Semantic Scholar
Loading...
Data provided by:
Bibliographic Tools
Bibliographic and Citation Tools
Bibliographic Explorer Toggle
Bibliographic Explorer (What is the Explorer?)
Connected Papers Toggle
Connected Papers (What is Connected Papers?)
Litmaps Toggle
Litmaps (What is Litmaps?)
scite.ai Toggle
scite Smart Citations (What are Smart Citations?)
Code, Data, Media
Code, Data and Media Associated with this Article
alphaXiv Toggle
alphaXiv (What is alphaXiv?)
Links to Code Toggle
CatalyzeX Code Finder for Papers (What is CatalyzeX?)
DagsHub Toggle
DagsHub (What is DagsHub?)
GotitPub Toggle
Gotit.pub (What is GotitPub?)
Huggingface Toggle
Hugging Face (What is Huggingface?)
ScienceCast Toggle
ScienceCast (What is ScienceCast?)
Demos
Demos
Replicate Toggle
Replicate (What is Replicate?)
Spaces Toggle
Hugging Face Spaces (What is Spaces?)
Spaces Toggle
TXYZ.AI (What is TXYZ.AI?)
Related Papers
Recommenders and Search Tools
Link to Influence Flower
Influence Flower (What are Influence Flowers?)
Core recommender toggle
CORE Recommender (What is CORE?)
Author
Venue
Institution
Topic
About arXivLabs
arXivLabs: experimental projects with community collaborators
arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website.
Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them.
Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs.
Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)