Hitesh Golchha

I am currently in an Applied Scientist at Amazon. I have work and research experience in the fields of NLP, Conversational AI (language generation and understanding), Reinforcement Learning and Representation Learning from companies like Amazon, Flipkart and research labs at UMass Amherst, IIT Patna and Bar Ilan University.

I love making educational content on my YouTube Channel ExplainingML. Some videos include LLM papers (Deepseekv3, Reformer) or proofs of RL algorithms for LLM Post Training (GRPO/Dr GRPO, DPO, PPO, TRPO, Policy Gradients, Key concepts in RL) as a Playlist.

Email / Google Scholar / Twitter / Github / YouTube /

Publications

	Language Guided Exploration for RL Agents in Text Environments Hitesh Golchha, Sahil Yerawar, Dhruvesh Patel, Soham Dan, Keerthiram Murugesan NAACL 2024 Findings* ,2024 Language-Guided Exploration (LGE) injects language priors into sparse-reward RL by using a GUIDE model to rank and prune actions for an EXPLORER agent. GUIDE is trained via contrastive learning to prefer task-relevant actions, dramatically shrinking the effective decision space. On ScienceWorld, this yields faster learning and higher returns than vanilla RL, Behavior Cloning, and Text Decision Transformers.
	Courteously Yours: Inducing courteous behavior in Customer Care responses using Reinforced Pointer Generator Network Hitesh Golchha, Mauajama Firdaus, Asif Ekbal, Pushpak Bhattacharyya NAACL-HLT ,2019 We develop a system that transforms a generic customer care response into a 'Courteous' one. The model takes into context the emotions and content of the conversation history as well while generating and is trained on MLE+RL.
	A Deep Multi-task Model for Dialogue Act Classification, Intent Detection and Slot Filling Mauajama Firdaus, Hitesh Golchha, Asif Ekbal, Pushpak Bhattacharyya Cognitive Computation ,2020 We compare multi-task approaches with standalone and pipeline ones for Spoken Language Understanding tasks on ATIS, TRAINS and FRAMES datasets.
	Helping each Other: A Framework for Customer-to-Customer Suggestion Mining using a Semi-supervised Deep Neural Network Hitesh Golchha, Deepak Gupta, Asif Ekbal, Pushpak Bhattacharyya International Conference on Natural Language Processing (ICON),2018 We develop a linguistically informed, self-supervised model to identify customer-to-customer suggestions in Hotel and Electronics review datasets.

Work Experience

	Amazon (Seattle, USA) Applied Scientist, Jan 2024 - now As an Applied Scientist at Amazon, I lead several ML projects for Amazon’s Cross-Border (Global Store) business, which enables international product discovery and purchasing across countries and marketplaces. My work spans Agentic systems, Deal-forecasting models, large-scale retrieval, Scalable Product Clustering deployed in production and directly supporting multi-million-dollar revenue streams.
	JP Morgan Chase and Co. Senior Associate - Data Science MRGR, July 2023 - Jan 2024 Built LLM-powered QA, summarization, and ML review systems for 400+ Model Risk & Governance (MRGR) users using LLaMA-13B with advanced RAG pipelines (section-aware chunking, hybrid parent-child retrieval, reranking, map-reduce and refine summarization). Also evaluated Information Extraction, OCR, operations and Fraud Detection Models in Consumer Investment Banking for Firmwide risks.
	Amazon Applied Scientist Intern, June 2022 - Aug 2022 Probing Product representations from vision, text, and multi-modal models to find Products having similar export-eligibility. Also made contributions to other ML projects within the team performing diverse subset sampling.
	Flipkart Software Development Engineer (Data Science), April 2019 - April 2021 Multiple research tasks in the Conversational AI: Intent detection, slot filling, multi-intent sentence segmentation, data augmentation, ASR beam rescoring. Challenges included handling multilinguality, class imbalance, miscalibration, augmentation and robustness. The voice assistants we developed are used by millions of Indians for E-commerce shopping every day. I also participated in hackathons on unsupervised reverse image search and question answering.

Research Experience

	IESL, UMass Amherst Graduate Student Researcher, Spring 2022 - June 2023 – Text-based Games using RL : Using LLMs, RL and Contrastive Learning for playing text-based science games. – Energy based models for Multilabel Classification : Exploring architectures for Energy Based Models for Multilabel Classification which can model contextual label dependencies. Worked with researchers from IESL lab under Prof. McCallum and IBM Research.
	AI-NLP-ML Lab, IIT Patna Junior Research Fellow, July - Dec 2018 Research in the domain of conversational agents: NLU and Style Transfer in Response Generation. Advisor: Prof. Asif Ekbal and Prof. Pushpak Bhattacharyya.
	NLP Lab, Bar Ilan University Research Internship,May-Jul 2017 Research in Coreference Resolution of Propositions. I got an exposure to several tasks in NLP associated with creation of Natural Language Knowledge Graphs. Advisor: Prof. Ido Dagan.
	AI-NLP-ML Lab, IIT Patna Undergradute Researcher, May 2016 - May 2018 Research in Suggestion Mining, Question Answering Advisor: Prof. Asif Ekbal and Prof. Pushpak Bhattacharyya.

Please checkout Jon Barron's website if you want to use this Layout.