SMASH-G: A System for Modeling, Analyzing, and Synthesizing Hand Gestures

Nam Hee Gordon Kim

Tim Straubinger

Raw Data

ok02.mp4

OK

thumbs_up01.mp4

Thumbs up

paper02.mp4

Paper

scissors00.mp4

Scissors

call_me01.mp4

Call me

lets_drink00.mp4

Let's drink

Captured & Rendered into MANO Model

535p-ok.mp4

OK (captured)

535p-thumbs-up.mp4

Thumbs up (captured)

535p-paper.mp4

Paper (captured)

535p-scissors.mp4

Scissors (captured)

535p-call-me.mp4

Call me (captured)

535p-lets-drink.mp4

Let's drink (captured)

We note that the gestures call me and let's drink are not captured very well. For call me, the inherent occlusion involved in rotating the hand in the roll axis may be acting against the 3D hand pose estimator's performance. In let's drink, the inverse kinematic confuses the pinky with the ring finger.

Synthesized & Rendered into MANO Model

535p-ok-generated0.mp4

OK (synthesized)

535p-thumbs-up-generated0.mp4

Thumbs up (synthesized)

535p-paper_generated0.mp4

Paper (synthesized)

535p-scissors-generated.mp4

Scissors (synthesized)

535p-call-me-generated.mp4

Call me (synthesized)

535p-lets-drink-generated.mp4

Let's drink (synthesized)

The gestures call me and let's drink are yet again failure cases in synthesized trajectories. However, looking at the synthesized results reveals something interesting. The gesture call me seems to show a cyclic structure as the fingers open and close towards the end of the animation. This might be largely due to the roll-axis rotation, which throws off the joint estimation until occlusion is cleared. Let's drink, on the other hand, captures the extension of the pinky joints, albeit all other joints are involved. In all cases, the learned dynamics model is encapsulating some information about the motions of the gestures, showing that each model is a feasible representation of gestures, provided enough data is given for accurate training.

Synthesized with Varying Speeds

535p-paper-half.mp4

delta=0.5

535p-paper-full.mp4

delta=1.0

535p-paper-oneandhalf.mp4

delta=1.5