Programming through demonstration – robot flipping pancakes

By Damir Beciri

3 Comments11 August 2010

If you haven’t learned to flip pancakes yet, here is a robot that will put you to shame. Acquiring new motor skills involves various forms of learning. The efficiency of the process lies in the interconnections between imitation and self-improvement strategies. A team of researchers from the Italian Institute of Technology are developing algorithms which enable robots to acquire skills after they are shown how to perform them.

Dr. Sylvain Calinon, Dr. Peter Kormushev and Professor Darwin G. Caldwellat from the Italian Institute of Technology in Genova, Italy, are pursuing the problem of programming through demonstration called kinesthetic teaching with a robot arm. The motion is encoded in a mixture of basis force fields through an extension of Dynamic Movement Primitives (DMP) that represents the synergies across the different variables through stiffness matrices. An Inverse Dynamics controller with variable stiffness is used for reproduction.

The skill is first demonstrated via kinesthetic teaching, and then refined by Policy learning by Weighting Exploration with the Returns (PoWER) algorithm. Compared to policy-gradient approaches, the reward is treated as a pseudo-probability, which allows Reinforcement Learning (RL) to use probabilistic estimation methods such as Expectation-Maximization (EM). The following video shows a Barrett WAM 7 DOFs manipulator learning to flip pancakes by RL.

After 50 trials, the robot learns that the first part of the task requires a stiff behavior to throw the pancake in the air, while the second part requires the hand to be compliant in order to catch the pancake without having it bounced off the pan.

In the experiments presented here, imitation learning is used as an initialization phase, and afterwards RL is used to explore for better solutions. Both processes could, however, be interlaced. Depending on the user’s availability, the user could occasionally participate in the evaluation of new policies explored by the robot. For example, the user can manually give reward or punishment signals to the RL module, or provide new examples in case the robot’s improvement is too slow. The researchers plan to consider such interaction in their future work.

For more information (and MATLAB source code of the algorithm) visit the publication page of a paper named: “Robot Motor Skill Coordination with EM-based Reinforcement Learning”.

Share:

This entry was posted on Wednesday, Aug 11th, 2010 at 8:00PM and filed under Robotics.

Tags: italian institute of technology, pancakes, programming through demonstration. Reinforcement Learning, robot arm, robots, self improvement

3 Comments — Leave your response!

Tom

Aug 13th, 2010 at 6:03AM
This example is only an application of the method presented in
http://www.robot-learning.de
and on
http://www.youtube.com/watch?v=qtqubguikMk
It’s cool to see that the method used to learn Ball-in-a-cup can also be
used for flipping pancakes!
Dr.A.Jagadeesh

Aug 19th, 2010 at 3:50AM
Do we really need a robot to make pancakes? Can’t we do it ourselves? In the name of Robots we are only promoting laziness in some cases.
Dr.a.Jagadeesh Nellore(AP),India
robot builder

Jan 31st, 2011 at 11:19PM
I believe its more about the principles, not about the general use of robots in pancace making.

Menu

Programming through demonstration – robot flipping pancakes

3 Comments — Leave your response!

Leave your response!

Antibacterial power of black silicon inspired by cicada wings

Green architecture – Junction House, Melbourne

Using light to dramatically improve conductivity at room temperature

Improving hydrogen production with copper nanowires

Butterfly biomimicry

Biomimicry of butterfly wing scale structure could cut bank fraud

Biomimicry of butterfly wings for more powerful solar cells

NanoTech Security KolourOptiks – butterly inspired anti-counterfiet technology

Butterfly wings biomimicry for dirt free coated surfaces

Subscribe

3 latest robotics articles

Recent comments

Article updates

Subscribe

Menu

Programming through demonstration – robot flipping pancakes

Share:

Related posts:

3 Comments — Leave your response!

Leave your response!

Butterfly biomimicry

Subscribe

3 latest robotics articles

Recent comments

Article updates

Subscribe