Hey! I am Pujith Kachana, a Robotics Masters student at Carnegie Mellon University. I am advised by Dr. Ji Zhang and Dr. Wenshan Wang, and affiliated with the Field Robotics Center and the AirLab. I am primarily interested in 3D-grounded perception and reasoning, and my current work is focused on 3D scene understanding and visual-language navigation.
I am currently a Research Intern at Wayve, working on 3D foundation models for driving. Previously, I have worked at Amazon Robotics as an Applied Science Intern and a Software Development Co-op.
Before my time at CMU, I obtained my BS in Computer Science from Georgia Tech, where I worked with Dr. Danfei Xu on deformable manipulation and Dr. Ada Gavrilovska on secure visual servoing.
Ultimately, I want embodied agents to be able to understand and reason about the physical world in an inherently 3D manner, much like humans. My goal is to develop agents that not only understand 3D geometry and semantics but can also ground and fuse multimodal concepts, such as language, into the 3D world. Specific research topics of interest include 4D reconstruction, scene flow, and visual-language reasoning.
Rig3R: Rig-Aware Conditioning for Learned 3D Reconstruction
Preprint
Samuel Li*, Pujith Kachana*, Prajwal Chidananda, Saurabh Nair, Yasutaka Furukawa, Matthew Brown
[ARXIV]
VLA-3D: A Dataset for 3D Semantic Scene Understanding and Navigation
RSS 2024, SemRob Workshop
Haochen Zhang, Nader Zantout, Pujith Kachana, Zongyuan Wu, Ji Zhang, Wenshan Wang
Neural Field Dynamics Model for Granular Object Piles Manipulation
CoRL 2023
ICRA 2023, Representing and Manipulating Deformable Objects Workshop
(Oral Presentation, Best Paper Finalist)
Shangjie Xue, Shuo Cheng, Pujith Kachana, Danfei Xu
[ARXIV] [VIDEO] [PROJECT PAGE]
Persistent Pick: Enhanced Grasping with Tactile Feedback
AMLC 2023, Robot Learning Workshop
(Oral Presentation)
Pujith Kachana, Nathalie Hager, Taskin Padir