2024 career news

[November 2024] Invited speaker at the Cohere For AI community-led Reinforcement Learning group.

[August 2024] I am attending the Reinforcement Learning Conference in Amherst, MA, to present this constrained and robust reinforcement learning work.

[July 2024] I am attending the International Conference on Machine Learning in Vienna, Austria, to present this f-divergence robust reinforcement learning work.

[June 2024] I am attending the Reinforcement Learning for Stochastic Networks workshop in Toulouse, France, to present my research in an online learning theory session.

[June 2024] I co-designed the Reinforcement Learning coursework and taught core concepts at the AI Bootcamp in Caltech!

[May 2024] New fundamental result on constrained and robust reinforcement learning appearing in RLC 2024! Led by my undergrad mentee!

[May 2024] New fundamental result on generalized robust reinforcement learning appearing in ICML 2024!

2023 career news

[October 2023] New fundamental result on offline reinforcement learning available on arxiv.

[August 21, 2023] Started as a postdoctoral researcher at CalTech hosted by Prof. Adam Wierman and Prof. Eric Mazumdar! Grateful for this wonderful opportunity.

[August 12, 2023] I did the grad walk at TAMU. Officially, it is now Dr. Kishan Panaganti Badrinath! 🎓

[July 2023] Fundamental result on Robust Imitation Learning appearing in CDC 2023!

[June 13, 2023] I defended my PhD!

[January 2023] Work done on personalized recommendation system during Summer 2022 at Microsoft Research appearing in ICLR 2023!

[January 2023] Fundamental result on a Tabular Robust Reinforcement Learning appearing in AISTATS 2023!

2022 career news

[September 2022] "Robust Reinforcement Learning using Offline Data" got accepted into NeurIPS 2022!

[August 2022] I was a research intern for the summer at Microsoft Research for the Reinforcement Learning team.

[May 2022] Submitted an interesting and exciting research milestone to NeurIPS 2022 on Robust Reinforcement Learning!

[January 2022] Fundamental result on a Tabular Robust Reinforcement Learning appearing in AISTATS 2022.

2021 career news

[October 2021] TAMU Department article on the intern Outstanding Innovation award I received from Bell-Labs. Technical information is mentioned below.

[August 2021] Fundamental result on a Tabular Robust Reinforcement Learning problem appearing in CDC 2021.

[Summer 2021] Interned at Nokia/Bell-Labs with Matthew Andrews and the team.

[May 2021] My paper on Robust Reinforcement Learning found a home at ICML 2021!

Nokia/Bell-labs 2021 Summer Internship

My project title was Drone Collision Avoidance Navigation with Reinforcement Learning. I received the Bell Labs summer intern award for Outstanding Innovation for my summer work. This award was signed by Nishant Batra, Chief Strategy and Technology Officer, on August 12, 2021.

I had a valuable opportunity to intern with the Mathematics & Algorithms Research Group at Bell Labs−Research in Murray Hill, New Jersey, USA. I was advised by Dr. Matthew Andrews, Karina Palyutina, and Máté Hell. During this summer 2021 internship, I had a tremendous learning experience by interacting with various researchers and attending lab talks. My focus of this internship was applying deep reinforcement learning algorithms to autonomous drone navigation for avoiding obstacles in a physics-based simulator called Microsoft AirSim.

Reinforcement Learning (RL) algorithms typically learn sequential decision-making policies by training on a simulator. In the last several years, Deep reinforcement learning (DRL) has showcased multiple breakthroughs in games, robotics, healthcare, and many more applications. In the early days of DRL, a powerful algorithm AlphaGo was able to defeat the then Go World Champion Lee Sedol in 2016. It has since come far to provide insights for healthcare applications like the AlphaFold2 algorithm decoding protein folding. Robotics applications despite being high-dimensional complex problems, DRL recently has shown tremendous promise in both stationary and mobile robots, but much potential has to be unraveled that paves the way for good research like solving the sample efficient and end-to-end system problems.

I focused on a problem related to Unmanned Aerial Vehicles (UAV) or drones, a mobile robot with six degrees of freedom. I worked with Microsoft AirSim physics-based simulator that provides access to the near-real-world drone equipped with many sensors like the first person image view, depth image, collision sense with an obstacle, and much more realistic sensory information.

My problem statement was to address the obstacle collision avoidance issue in a given environment using RL algorithms. The high-dimensional nature of this problem that arises from the vast space of the environment and the six degrees of freedom which the drone inherits for motion makes this a challenging research problem to tackle. Additionally, RL algorithms rely on the reward functions for learning good sequential decision-making policies to solve the problem. Thus, reward shaping/designing is yet another challenge to overcome for a successful sequential decision-making policy. During this internship, I experimented with various sensors such as depth images to tackle the vast space issue and various distance-based reward function criteria for tackling the reward shaping challenge; while keeping the motion of the drone simple to focus on the sub-problems mentioned earlier. One of the autonomous solutions is the drone's first-person-view video which is playing on the left side of this post.

Apart from my core research activities during my internship, I also attended various activities like Friday lunches, technical talks hosted by Bell Labs which were very insightful and useful. My mentors at Nokia Bell Labs were very resourceful, apt, supportive, and really made the work environment fun!

Nokia-2021-award.pdf

Page updated

Report abuse