Results & Analysis
To evaluate the proposed constraints on the loss function and an attention mechanism, we prepared three types of baseline model. The first model, AE-LSTM, uses only constraints on the loss function; the second model, AE-LSTM, utilizes only the attention mechanism; and the third model, AE-LSTM without employing the proposed method, for comparing the success rate of cap-opening motions. Moreover, we quantitatively evaluate the partial success rate and complete success rate of cap-opening motions. Here are the definitions:
• Complete success: If the cap is opened and recognized, and the motion is immediately stopped, it is considered a complete success.
• Partial Success: If the cap-opening motion is successful but not recognized, resulting in the continued execution of the motion, it is considered a partial success.
The proposed AE-LSTM model (model I) using constraints on the loss function and an attention mechanism achieved the highest partial success rate and complete success rate. This indicates that the proposed method has acquired generalization performance for initial positions and objects. By imposing constraints on the loss function (model II), it was possible to switch to a stop motion immediately after the cap was opened, leading to an improvement in the complete success rate. On the other hand, for models (III and IV) without constraints, even if the cap was opened, they often continued the motion without switching to a stop. Additionally for model I and III, by focusing attention on appropriate modalities for each motion the accuracy of motion was enhanced, resulting in improved partial success rates for both sliding and cap- opening motions. Conversely, for models (II and IV) without the attention mechanism, the accuracy of sliding motions was low, leading to frequent failures in cap-opening due to the inability to move the object smoothly left and right. From those results it can be inferred that constraints on the loss function contributed to improving the complete success rate, while the attention mechanism contributed to increasing the partial success rate. Therefore, the proposed method, which combines both constraint and attention techniques, could achieve the most successful in-hand manipulation.