Dhawal Gupta

I am currently pursuing my masters degree in the Department of Computing Science at the University of Alberta. I am in my first year, so I am still exploring and updating my value function :P

I completed my bachelors degree in Computer Science and Engineering fromĀ Indian Institute of Technology (IIT), Patna, India. While as an undergrad, I did my thesis under the supervision of Dr Sriparna Saha andĀ Prof. Pushpak Bhattacharyya. My research work has been focused on using Hierarchical Reinforcement Learning to develop decision-making policies for chatbots, that can function in a generic manner for multiple domains, tasks and languages.


Email : dhawal [at] ualberta [dot] ca

GitHub  /  Google Scholar  /  LinkedIn  /  CV (available on request)

Avatar
Latest News
[Aug 19]: Joined University of Alberta as a Masters Student
[Aug 19]: Graduated from IIT Patna.
[Apr 19]: Manuscript communicated to ACM TALLIP on Multi-Intent Multi Lingual Chatbot.
[Apr 19]: Manuscript communicated to Interspeech on Multi Intent Dialogue Systems.
[Aug 18]: Completed my internship at IBM Research.
[Jul 18]: Granted the INAE Travel Grant to attend ASME IDETC.
[Jul 18]: Paper accepted in ICONIP 2018.
[Apr 18]: Paper accepted in 42nd Mechanism and Robotics 2018 (ASME IDETC).

Research

I'm interested in Robotics and Artificial Intelligence, specifically in the use of Reinforcement Learning Algorithms for modelling control and decision making policies in Robots. I wish to work towards self-adaptability of agents in different environments, making them more robust to noise and stochasticity of their surroundings.

A Generic Dialogue Manager using Reinforcement Learning in a Multilingual Multi-Intent Multi-Domain Setting
Dhawal Gupta
Bachelors Thesis, 2019 Submitted in partial fulfillment of the B.Tech degree at IIT Patna.
Thesis / Slides / Poster / Code (not yet public)

Reinforcment Learning based Dialogue Management Strategy
Tulika Saha, Dhawal Gupta, Sriparna Saha, Pushpak Bhattacharyya,
25th International Conference on Neural Information Processing (ICONIP), 2018
code

Introduces a novel MDP, which makes the model resilient to NLU (Natural Languague Understanding) failures. A policy for dialogue strategy in a task oriented setting particularly for airline-centric databases.

Bayesian Optimization based Terrestrial Gait Tuning for a 12-DOF Alligator-Inspired Robot with Active Body Undulation
Krishna Agrawal, Kushagra Jain, Dhawal Gupta, Raunak Srivastav, Abhijeet Agnihotri, Atul Thakur,
42nd Mechanisms and Robotics Conference (MR) ASME IDETC/CIE, 2018
video

Fabrication, control and gait tuning of an alligator inspired robot having design based on a quadruped robot with an active spine. Achieved an improvement of 1.93

Experience
prl

IBM Research

I interned at IBM Research India from May 2018 to August 2019 under the mentorship of Dr Kedar Kulkarni. I worked in the Operations Research Team, on Multi-Asset clustering and developing accurate and resource economic distance metric for clustering.

prl

Indraprastha Institute of Information Technology, Delhi

I interned at IIIT Delhi from May 2017 to july 2017, under the mentorship of Dr Saket Anand and Dr Sanjit Kaul. I worked on extracting reward functions for autonomous car steering, using Inverse Reinforcement Learning framework. Here is a brief report.

Competitions
prl

Smart Containers
Zenin Easa, Dhawal Gupta, Jimson Mathew, 2017
video / code / webapp / poster

Prototype of an IoT product named smart containers that can be used to monitor the food consumption, using minimal number of sensors. This project was presented in Intel HEC and ISED 2016.

prl

ABU Asia-Pacific Robot Contest (Robocon)
Ashwin Goyal, Dhawal Gupta, Atul Thakur (Mentor), 2017
video

Robocon is an annual Asia Pacific Competition. I was Founder and Vice Captain of the Robocon Team for IIT Patna.

Projects
prl

Using GANs to generate photo-realistic fundus images manifesting diabetic retinopathy
Dhawal Gupta, Raghav Jindal , Rushikesh Pedganokar
code/ report


Task of generating disease-manifested fundus images using DC-GAN. This was a part of CS551 Deep Learning course and our project was adjudged first among a total of 26 projects.

prl

Fuzzy Controller for Inverted Pendulum
Abhishek Agrawal, Raghv Jindal, Sahil Sharma, Dhawal Gupta
code/ report


Built a fuzzy controller for a inverted pendulum which works on different profile using the weighted centroid method for appropriate current calulcation for balancing the pendulum

prl

Simulation of Unix File System
Dhawal Gupta, Sahil Sharma, Tarun Garg, Ashutosh Drolia
code


Built a codebase to simulate the command in-memory for a Linux file system using C/C++ as part of the CS341 Operating Systems Lab

prl

Implementation of Distributed Hash Table (Chord Protocol)
Dhawal Gupta
code


An implementation of Chord protocol in Python with the facility to handle random adds and drops in the P2P network. Deployable on different Virtual Machines.

prl

Analysis of Twitter Network
Dhawal Gupta
code


Course Project for CS544 Network Science. This project finds the influential people in a Twitter network based on a set of tweets related to a particular event. In our case, it is the discovery of the Higgs Boson particle.

Interests and Hobbies

When I am not busy I like doing the following stuff

  • Reading (See what I am reading).
  • I like to play Video Games.
  • Tinkering with stuff.
  • Watching anime and reading manga.
Website layout from here