Hitesh Golchha

I am currently in an Applied Scientist at Amazon. I have work and research experience in the fields of NLP, Conversational AI (language generation and understanding), Reinforcement Learning and Representation Learning from companies like Amazon, Flipkart and research labs at UMass Amherst, IIT Patna and Bar Ilan University.

I love making educational content on my YouTube Channel ExplainingML. Some videos include LLM papers (Deepseekv3, Reformer) or proofs of RL algorithms for LLM Post Training (GRPO/Dr GRPO, DPO, PPO, TRPO, Policy Gradients, Key concepts in RL) as a Playlist.

Email  /  Google Scholar  /  Twitter  /  Github  /  YouTube  / 

profile photo
Publications
clean-usnob Language Guided Exploration for RL Agents in Text Environments
Hitesh Golchha*, Sahil Yerawar, Dhruvesh Patel, Soham Dan, Keerthiram Murugesan
NAACL 2024 Findings ,2024

Language-Guided Exploration (LGE) injects language priors into sparse-reward RL by using a GUIDE model to rank and prune actions for an EXPLORER agent. GUIDE is trained via contrastive learning to prefer task-relevant actions, dramatically shrinking the effective decision space. On ScienceWorld, this yields faster learning and higher returns than vanilla RL, Behavior Cloning, and Text Decision Transformers.


clean-usnob Courteously Yours: Inducing courteous behavior in Customer Care responses using Reinforced Pointer Generator Network
Hitesh Golchha*, Mauajama Firdaus*, Asif Ekbal, Pushpak Bhattacharyya
NAACL-HLT ,2019

We develop a system that transforms a generic customer care response into a 'Courteous' one. The model takes into context the emotions and content of the conversation history as well while generating and is trained on MLE+RL.


clean-usnob A Deep Multi-task Model for Dialogue Act Classification, Intent Detection and Slot Filling
Mauajama Firdaus, Hitesh Golchha, Asif Ekbal, Pushpak Bhattacharyya
Cognitive Computation ,2020

We compare multi-task approaches with standalone and pipeline ones for Spoken Language Understanding tasks on ATIS, TRAINS and FRAMES datasets.


clean-usnob Helping each Other: A Framework for Customer-to-Customer Suggestion Mining using a Semi-supervised Deep Neural Network
Hitesh Golchha, Deepak Gupta, Asif Ekbal, Pushpak Bhattacharyya
International Conference on Natural Language Processing (ICON),2018

We develop a linguistically informed, self-supervised model to identify customer-to-customer suggestions in Hotel and Electronics review datasets.


Work Experience
clean-usnob Amazon (Seattle, USA)
Applied Scientist, Jan 2024 - now

As an Applied Scientist at Amazon, I lead several ML projects for Amazon’s Cross-Border (Global Store) business, which enables international product discovery and purchasing across countries and marketplaces. My work spans Agentic systems, Deal-forecasting models, large-scale retrieval, Scalable Product Clustering deployed in production and directly supporting multi-million-dollar revenue streams.

clean-usnob JP Morgan Chase and Co.
Senior Associate - Data Science MRGR, July 2023 - Jan 2024

Built LLM-powered QA, summarization, and ML review systems for 400+ Model Risk & Governance (MRGR) users using LLaMA-13B with advanced RAG pipelines (section-aware chunking, hybrid parent-child retrieval, reranking, map-reduce and refine summarization). Also evaluated Information Extraction, OCR, operations and Fraud Detection Models in Consumer Investment Banking for Firmwide risks.

clean-usnob Amazon
Applied Scientist Intern, June 2022 - Aug 2022

Probing Product representations from vision, text, and multi-modal models to find Products having similar export-eligibility. Also made contributions to other ML projects within the team performing diverse subset sampling.

clean-usnob Flipkart
Software Development Engineer (Data Science), April 2019 - April 2021

Multiple research tasks in the Conversational AI: Intent detection, slot filling, multi-intent sentence segmentation, data augmentation, ASR beam rescoring. Challenges included handling multilinguality, class imbalance, miscalibration, augmentation and robustness. The voice assistants we developed are used by millions of Indians for E-commerce shopping every day. I also participated in hackathons on unsupervised reverse image search and question answering.

Research Experience
clean-usnob IESL, UMass Amherst
Graduate Student Researcher, Spring 2022 - June 2023

Text-based Games using RL : Using LLMs, RL and Contrastive Learning for playing text-based science games.
Energy based models for Multilabel Classification : Exploring architectures for Energy Based Models for Multilabel Classification which can model contextual label dependencies.
Worked with researchers from IESL lab under Prof. McCallum and IBM Research.

clean-usnob AI-NLP-ML Lab, IIT Patna
Junior Research Fellow, July - Dec 2018

Research in the domain of conversational agents: NLU and Style Transfer in Response Generation.
Advisor: Prof. Asif Ekbal and Prof. Pushpak Bhattacharyya.

clean-usnob NLP Lab, Bar Ilan University
Research Internship,May-Jul 2017

Research in Coreference Resolution of Propositions. I got an exposure to several tasks in NLP associated with creation of Natural Language Knowledge Graphs.
Advisor: Prof. Ido Dagan.

clean-usnob AI-NLP-ML Lab, IIT Patna
Undergradute Researcher, May 2016 - May 2018

Research in Suggestion Mining, Question Answering
Advisor: Prof. Asif Ekbal and Prof. Pushpak Bhattacharyya.

Please checkout Jon Barron's website if you want to use this Layout.