Dhawal Gupta

I am currently pursuing my masters degree in the Department of Computing Science at the University of Alberta. I am in my first year, so I am still exploring and updating my value function :P

I completed my bachelors degree in Computer Science and Engineering fromĀ Indian Institute of Technology (IIT), Patna, India. While as an undergrad, I did my thesis under the supervision of Dr Sriparna Saha andĀ Prof. Pushpak Bhattacharyya. My research work has been focused on using Hierarchical Reinforcement Learning to develop decision-making policies for chatbots, that can function in a generic manner for multiple domains, tasks and languages.

Email : dhawal [at] ualberta [dot] ca

GitHub  /  Google Scholar  /  LinkedIn  /  CV (available on request)

Latest News
[Aug 19]: Joined University of Alberta as a Masters Student
[Aug 19]: Graduated from IIT Patna.
[Apr 19]: Manuscript communicated to ACM TALLIP on Multi-Intent Multi Lingual Chatbot.
[Apr 19]: Manuscript communicated to Interspeech on Multi Intent Dialogue Systems.
[Aug 18]: Completed my internship at IBM Research.
[Jul 18]: Granted the INAE Travel Grant to attend ASME IDETC.
[Jul 18]: Paper accepted in ICONIP 2018.
[Apr 18]: Paper accepted in 42nd Mechanism and Robotics 2018 (ASME IDETC).


I'm interested in Robotics and Artificial Intelligence, specifically in the use of Reinforcement Learning Algorithms for modelling control and decision making policies in Robots. I wish to work towards self-adaptability of agents in different environments, making them more robust to noise and stochasticity of their surroundings.

A Generic Dialogue Manager using Reinforcement Learning in a Multilingual Multi-Intent Multi-Domain Setting
Dhawal Gupta
Bachelors Thesis, 2019 Submitted in partial fulfillment of the B.Tech degree at IIT Patna.
Thesis / Slides / Poster / Code (not yet public)

Reinforcment Learning based Dialogue Management Strategy
Tulika Saha, Dhawal Gupta, Sriparna Saha, Pushpak Bhattacharyya,
25th International Conference on Neural Information Processing (ICONIP), 2018

Introduces a novel MDP, which makes the model resilient to NLU (Natural Languague Understanding) failures. A policy for dialogue strategy in a task oriented setting particularly for airline-centric databases.

Bayesian Optimization based Terrestrial Gait Tuning for a 12-DOF Alligator-Inspired Robot with Active Body Undulation
Krishna Agrawal, Kushagra Jain, Dhawal Gupta, Raunak Srivastav, Abhijeet Agnihotri, Atul Thakur,
42nd Mechanisms and Robotics Conference (MR) ASME IDETC/CIE, 2018

Fabrication, control and gait tuning of an alligator inspired robot having design based on a quadruped robot with an active spine. Achieved an improvement of 1.93


IBM Research

I interned at IBM Research India from May 2018 to August 2019 under the mentorship of Dr Kedar Kulkarni. I worked in the Operations Research Team, on Multi-Asset clustering and developing accurate and resource economic distance metric for clustering.


Indraprastha Institute of Information Technology, Delhi

I interned at IIIT Delhi from May 2017 to july 2017, under the mentorship of Dr Saket Anand and Dr Sanjit Kaul. I worked on extracting reward functions for autonomous car steering, using Inverse Reinforcement Learning framework. Here is a brief report.


Smart Containers
Zenin Easa, Dhawal Gupta, Jimson Mathew, 2017
video / code / webapp / poster

Prototype of an IoT product named smart containers that can be used to monitor the food consumption, using minimal number of sensors. This project was presented in Intel HEC and ISED 2016.


ABU Asia-Pacific Robot Contest (Robocon)
Ashwin Goyal, Dhawal Gupta, Atul Thakur (Mentor), 2017

Robocon is an annual Asia Pacific Competition. I was Founder and Vice Captain of the Robocon Team for IIT Patna.


Using GANs to generate photo-realistic fundus images manifesting diabetic retinopathy
Dhawal Gupta, Raghav Jindal , Rushikesh Pedganokar
code/ report

Task of generating disease-manifested fundus images using DC-GAN. This was a part of CS551 Deep Learning course and our project was adjudged first among a total of 26 projects.


Fuzzy Controller for Inverted Pendulum
Abhishek Agrawal, Raghv Jindal, Sahil Sharma, Dhawal Gupta
code/ report

Built a fuzzy controller for a inverted pendulum which works on different profile using the weighted centroid method for appropriate current calulcation for balancing the pendulum


Simulation of Unix File System
Dhawal Gupta, Sahil Sharma, Tarun Garg, Ashutosh Drolia

Built a codebase to simulate the command in-memory for a Linux file system using C/C++ as part of the CS341 Operating Systems Lab


Implementation of Distributed Hash Table (Chord Protocol)
Dhawal Gupta

An implementation of Chord protocol in Python with the facility to handle random adds and drops in the P2P network. Deployable on different Virtual Machines.


Analysis of Twitter Network
Dhawal Gupta

Course Project for CS544 Network Science. This project finds the influential people in a Twitter network based on a set of tweets related to a particular event. In our case, it is the discovery of the Higgs Boson particle.

Interests and Hobbies

When I am not busy I like doing the following stuff

  • Reading (See what I am reading).
  • I like to play Video Games.
  • Tinkering with stuff.
  • Watching anime and reading manga.
Website layout from here