Design and Methods

0. High Level Architecture

1. perception

Perception/semantic understanding

RGB image processing / Binary mask generation

Distance Estimation / Center Dectection / 3D Pose Estimation

2. navigation

We created a framework for commanding the robot to perform moving actions in conjunction with the vision team. These included moving forwards and backwards and rotating clockwise and counterclockwise and stopping. Each motion could be executed at a velocity specified by the caller.

The framework would allow communication with the vision team via a ROS2 topic subscription to search for the block, and once detected, navigate towards it. Upon reaching the block, the robot would pick it up, search for the bin, and navigate towards it.

By implementing the navigation pipeline in this modular fashion, we were able to test the software separately and integrate it cleanly once it was working reliably.

3. manipulation

4. CHALLENGES & SOLUTIONS

4.1 Overcoming Long-Distance IK Failures via Waypoint Routing

Challenge

The numerical IK solver exhibited a significantly high failure rate when attempting to reach objects located far from the robot's current position. Over these long Cartesian distances, the solver frequently became trapped in local minima, failing to converge on a valid joint configuration regardless of the position tolerance.

Solution

Implemented a "Hover Point" (approach waypoint) strategy. By first routing the end-effector to a safe, elevated position directly above the target, the long trajectory was broken down into shorter, highly solvable segments. This approach bypassed the local minima traps, dramatically improving both the IK success rate and the overall reaching reliability.

4.2 re-grasp Collision Management

Challenge

During the final approach, the gripper's orientation adjustments would frequently knock over or hit the target object.

Solution

Adjusted the manipulation logic to strictly pre-align the gripper's orientation at the predefined hover point. The final approach was restricted to a straight vertical Cartesian descent, ensuring a clean and collision-free grasp.

4.3 Debugging Hardcoded Initial Pose Override

Challenge

The robot frequently exhibited erratic, unpredictable movements. After spending considerable time debugging what seemed like a complex planning issue, real-time monitoring of the joint space (/joint_states) revealed a sneaky bug: the IK solver was continuously pulling a hardcoded initial pose from the launch file instead of using the robot's actual physical configuration.

Solution

Removed the hardcoded dependency and explicitly mapped the live joint state data to be used as the starting seed for the IK solver, immediately restoring smooth and predictable motion.

4.4 Precision vs. IK Solvability Trade-off Mitigation

Challenge

The numerical IK solver exhibits a significant trade-off: tightening the position tolerance (ik_dt) improves Cartesian accuracy for distant targets but dramatically increases the IK failure rate (convergence fails in local minima traps). Conversely, relaxing the tolerance improves IK solvability but often leads to real-world grasp failures due to excessive positional error at the target object.

Solution

Implemented an "Adaptive Dynamic Tolerance" retry loop. The manipulation logic was adjusted to adopt a relaxed tolerance for the initial attempt, prioritizing a valid, collision-free trajectory over perfect precision. Upon grasp verification failure, subsequent attempts automatically tighten the tolerance (threshold reduction), iteratively enhancing positional accuracy until a successful enclosed grasp is confirmed.

4.5 Grasp Verification

Challenge

Reliably confirming a successful grasp is critical for autonomous state transitions. Without dedicated force or tactile sensors, the robot must distinguish between an Empty Grasp (where the fingers close completely) and a Successful Grasp (where an object physically blocks the fingers from closing).

Solution

We implemented a software-based check that monitors the discrepancy between the Gripper Command and the actual Joint State.

The Logic: If the system sends a CLOSE command but the encoders report a state that is NOT CLOSED, the system identifies that an object is present.
Thresholding: A small tolerance is used to account for encoder noise, ensuring that any state significantly different from "fully closed" is registered as a successful pick.

Verification Workflow

Command: Send CLOSE signal to the gripper.
Wait: Allow time for the gripper to reach its physical limit.
Compare: * If Command == State (Both Closed) -> Missed Grasp. Trigger retry loop.
- If Command != State (Blocked) -> Grasping Detected. Proceed to Navigation.

Page updated

Report abuse