Skip to content
chinese-model-helps-humanoid-robots-adapt-to-tasks-without-training

Chinese model helps humanoid robots adapt to tasks without training

Chinese team unveils a novel data-efficient system that gives robots human-like object handling skills.

Researchers from Wuhan University have developed a new framework that could help robots manipulate objects more easily. Introduced in a new paper on arXiv, this approach should enable humanoid robots to grasp and handle a greater variety of objects than is currently possible.

At present, humanoid robots are great at tasks like using tools, grasping, and walking, but they suffer from inherent limitations. In most cases, they can fail tasks when an object changes shape or when lighting changes.

They can also struggle completing tasks the robot hasn’t been specifically trained to do. It is this lack of generalization that is widely seen as one of the technology’s major limitations.

To help overcome this, the Wuhan team set out to develop what it calls the recurrent geometric-prior multimodal policy, RGMP for short. This framework is designed to help humanoid robots have a kind of in-built common sense about things like shapes and space.

It also provides robots with a means to better select required skills for a task, and a more data-efficient way to learn movement patterns.

Making humanoid robots more generalized

The goal of it, ultimately, is to help robots pick the right action and adapt in new environments with far less training data than before. According to the team, RGMP consists of two main key parts.

The first is called the Geometric-Prior Skill Selector (GSS), which helps the robot decide which of its “tools” and skills is best suited to a task. Using things like its cameras, the robot can use GSS to work out an object’s shape, size, and orientation.

With this information in hand (so to speak), the robot can then work out what needs to be done to complete a given task (i.e, pick up, push, grip, hold with two hands, etc.).

The second is called Adaptive Recursive Gaussian Network (ARGN). Once the robot picks a skill, the ARGN helps the robot actually perform the task. It achieves this by modelling spatial relationships between the robot and the object

It can also help predict movements step-by-step, and is extremely data-efficient (needs far fewer training examples than typical deep learning methods).

This combination of ARGN and GSS helps robots better complete tasks without needing thousands of demonstrations and training. In testing, robots using the framework were able to achieve an impressive 87% success rate in novel tasks that the robots had no experience in completing.

Dramatic improvement over the competition

The team also found that the framework is around 5 times more data-efficient than current diffusion-policy-based models (which are currently state-of-the-art). This is impressive and could be very important in the future.

If robots can reliably manipulate objects without being retrained for each new situation, they can actually be used in tasks like helping around the home to clean, tidy, and perhaps even cook.

It will also take humanoid robots to the next level for tasks in places like warehouses. restaurants, and manufacturing. Looking ahead, the team now wants to expand RGMP so robots can learn new tasks with almost no human teaching.

They also plan to help RGMP infer the correct motion for completely new objects on their own and automatically generate task-specific motion patterns. “Our future research will focus on enhancing the RGMP framework’s ability to generalize across a wider variety of tasks,” study lead author Xuetao Li explains.

“We also plan to explore the automatic inference of task-specific action trajectories, enabling robots to infer manipulations for new objects based on minimal human input or prior knowledge, further eliminating the need for exhaustive teaching in dynamic environments,” he added.

You can view the study for yourself in the journal arXiv.

Recommended Articles

The Blueprint

Get the latest in engineering, tech, space & science – delivered daily to your inbox.

Christopher graduated from Cardiff University in 2004 with a Masters Degree in Geology. Since then, he has worked exclusively within the Built Environment, Occupational Health and Safety and Environmental Consultancy industries. He is a qualified and accredited Energy Consultant, Green Deal Assessor and Practitioner member of IEMA. Chris’s main interests range from Science and Engineering, Military and Ancient History to Politics and Philosophy.

colind88

Back To Top