|
Hitesh Golchha
I am currently in an Applied Scientist at Amazon.
I have work and research experience in the fields of NLP, Conversational AI (language
generation and understanding), Reinforcement Learning and Representation Learning from
companies like Amazon, Flipkart and research labs at UMass Amherst, IIT Patna and Bar Ilan
University.
I love making educational content on my YouTube Channel ExplainingML. Some videos include LLM papers (Deepseekv3, Reformer) or proofs of RL algorithms for LLM Post Training (GRPO/Dr GRPO, DPO, PPO, TRPO, Policy Gradients, Key concepts in RL) as a Playlist.
Email  / 
Google Scholar  / 
Twitter  / 
Github  / 
YouTube  / 
|
|
|
Language Guided Exploration for RL Agents in Text Environments
Hitesh Golchha*, Sahil Yerawar, Dhruvesh Patel, Soham Dan, Keerthiram Murugesan
NAACL 2024 Findings ,2024
Language-Guided Exploration (LGE) injects language priors into sparse-reward RL by using a GUIDE model to rank and prune actions for an EXPLORER agent.
GUIDE is trained via contrastive learning to prefer task-relevant actions, dramatically shrinking the effective decision space.
On ScienceWorld, this yields faster learning and higher returns than vanilla RL, Behavior Cloning, and Text Decision Transformers.
|
|
Courteously Yours: Inducing courteous behavior in Customer Care responses using Reinforced Pointer Generator Network
Hitesh Golchha*, Mauajama Firdaus*, Asif Ekbal, Pushpak Bhattacharyya
NAACL-HLT ,2019
We develop a system that transforms a generic customer care response into a 'Courteous' one.
The model takes into context the emotions and content of the conversation history as well while generating and is trained on MLE+RL.
|
|
A Deep Multi-task Model for Dialogue Act Classification, Intent Detection and Slot Filling
Mauajama Firdaus, Hitesh Golchha, Asif Ekbal, Pushpak Bhattacharyya
Cognitive Computation ,2020
We compare multi-task approaches with standalone and pipeline ones for Spoken Language Understanding tasks on ATIS, TRAINS and FRAMES datasets.
|
|
Helping each Other: A Framework for Customer-to-Customer Suggestion Mining using a Semi-supervised Deep Neural Network
Hitesh Golchha, Deepak Gupta, Asif Ekbal, Pushpak Bhattacharyya
International Conference on Natural Language Processing (ICON),2018
We develop a linguistically informed, self-supervised model to identify customer-to-customer suggestions in Hotel and Electronics review datasets.
|
|
Amazon (Seattle, USA)
Applied Scientist, Jan 2024 - now
As an Applied Scientist at Amazon, I lead several ML projects for Amazon’s Cross-Border (Global Store) business, which enables international product discovery and purchasing across countries and marketplaces. My work spans Agentic systems, Deal-forecasting models, large-scale retrieval, Scalable Product Clustering deployed in production and directly supporting multi-million-dollar revenue streams.
|
|
JP Morgan Chase and Co.
Senior Associate - Data Science MRGR, July 2023 - Jan 2024
Built LLM-powered QA, summarization, and ML review systems for 400+ Model Risk & Governance (MRGR) users using LLaMA-13B with advanced RAG pipelines (section-aware chunking, hybrid parent-child retrieval, reranking, map-reduce and refine summarization). Also evaluated Information Extraction, OCR, operations and Fraud Detection Models in Consumer Investment Banking for Firmwide risks.
|
|
Amazon
Applied Scientist Intern, June 2022 - Aug 2022
Probing Product representations from vision, text, and multi-modal models to find Products having similar export-eligibility.
Also made contributions to other ML projects within the team performing diverse subset sampling.
|
|
Flipkart
Software Development Engineer (Data Science), April 2019 - April 2021
Multiple research tasks in the Conversational AI: Intent detection, slot filling, multi-intent sentence segmentation, data augmentation, ASR beam rescoring. Challenges included handling
multilinguality, class imbalance, miscalibration, augmentation and robustness. The voice assistants we developed are used by millions of Indians for E-commerce shopping every day.
I also participated in hackathons on unsupervised reverse image search and question answering.
|
|
IESL, UMass Amherst
Graduate Student Researcher, Spring 2022 - June 2023
– Text-based Games using RL : Using LLMs, RL and Contrastive Learning for playing text-based science games.
– Energy based models for Multilabel Classification : Exploring architectures for Energy Based Models for
Multilabel Classification which can model contextual label dependencies.
Worked with researchers from IESL lab under Prof. McCallum and IBM Research.
|
|
AI-NLP-ML Lab, IIT Patna
Junior Research Fellow, July - Dec 2018
Research in the domain of conversational agents: NLU and Style Transfer in Response Generation.
Advisor: Prof. Asif Ekbal and
Prof. Pushpak Bhattacharyya.
|
|
NLP Lab, Bar Ilan University
Research Internship,May-Jul 2017
Research in Coreference Resolution of Propositions.
I got an exposure to several tasks in NLP associated with creation of Natural Language Knowledge Graphs.
Advisor: Prof. Ido Dagan.
|
|
AI-NLP-ML Lab, IIT Patna
Undergradute Researcher, May 2016 - May 2018
Research in Suggestion Mining, Question Answering
Advisor: Prof. Asif Ekbal and
Prof. Pushpak Bhattacharyya.
|
Please checkout Jon Barron's website if you want to use this Layout.
|