I am currently pursuing my masters degree in the Department of Computing Science at the University of Alberta. I am in my first year, so I am still exploring and updating my value function :P
I completed my bachelors degree in Computer Science and Engineering from Indian Institute of Technology (IIT), Patna, India. While as an undergrad, I did my thesis under the supervision of Dr Sriparna Saha and Prof. Pushpak Bhattacharyya.
My research work has been focused on using Hierarchical Reinforcement Learning to develop decision-making policies for chatbots, that can function in a generic manner for multiple domains, tasks and languages.
[Aug 19]: Joined University of Alberta as a Masters Student
[Aug 19]: Graduated from IIT Patna.
[Apr 19]: Manuscript communicated to ACM TALLIP on Multi-Intent Multi Lingual Chatbot.
[Apr 19]: Manuscript communicated to Interspeech on Multi Intent Dialogue Systems.
[Aug 18]: Completed my internship at IBM Research.
[Jul 18]: Granted the INAE Travel Grant to attend ASME IDETC.
[Jul 18]: Paper accepted in ICONIP 2018.
[Apr 18]: Paper accepted in 42nd Mechanism and Robotics 2018 (ASME IDETC).
I'm interested in Robotics and Artificial Intelligence, specifically in the use of Reinforcement Learning Algorithms for modelling control and decision making policies in Robots. I wish to work towards self-adaptability of agents in different environments, making them more robust to noise and stochasticity of their surroundings.
Introduces a novel MDP, which makes the model resilient to NLU (Natural Languague Understanding) failures. A policy for dialogue strategy in a task oriented setting particularly for airline-centric databases.
Fabrication, control and gait tuning of an alligator inspired robot having design based on a quadruped robot with an active spine. Achieved an improvement of 1.93
I interned at IBM Research India from May 2018 to August 2019 under the mentorship of Dr Kedar Kulkarni. I worked in the Operations Research Team, on Multi-Asset clustering and developing accurate and resource economic distance metric for clustering.
Indraprastha Institute of Information Technology, Delhi
I interned at IIIT Delhi from May 2017 to july 2017, under the mentorship of Dr Saket Anand and Dr Sanjit Kaul. I worked on extracting reward functions for autonomous car steering, using Inverse Reinforcement Learning framework. Here is a brief
Course Project for CS544 Network Science. This project finds the influential people in a Twitter network based on a set of tweets related to a particular event. In our case, it is the discovery of the Higgs Boson particle.
Interests and Hobbies
When I am not busy I like doing the following stuff