Enhancing question generation through diversity-seeking reinforcement learning with bilevel policy decomposition

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Citations (Scopus)
59 Downloads (Pure)

Abstract

Recent advancements in question generation (QG) have been significantly propelled by reinforcement learning (RL). Although extensive reward models have been designed to capture the attributes of ideal questions, their associated learning challenges, particularly in sample efficiency and diversity, remain underexplored. This paper introduces a bilevel policy decomposition (BPD) framework and a diversity-seeking RL (DSRL) objective to address these issues. The BPD framework utilizes two cascading policies to divide QG into two more manageable sub-tasks: answer-centric summary generation and summary-augmented QG, facilitating exploration and accelerating policy learning. Concurrently, the DSRL objective preserves the inherent diversity of QG by ensuring the bilevel policies align probabilistically with their reward models rather than merely maximizing returns. Our integrated approach, named BPD-DSRL, demonstrates superior performance over existing baselines on multiple question quality and diversity metrics across various QG benchmarks.
Original languageEnglish
Title of host publication39th AAAI Conference on Artificial Intelligence: Proceedings
EditorsToby Walsh, Julie Shah, Zico Kolter
PublisherAssociation for the Advancement of Artificial Intelligence (AAAI)
Pages25083-25091
Number of pages9
Volume39
Edition23
ISBN (Electronic)9781577358978
DOIs
Publication statusPublished - 11 Apr 2025
Event39th Annual AAAI Conference on Artificial Intelligence, AAAI 2025 - Philadelphia, United States
Duration: 25 Feb 202504 Mar 2025

Publication series

NameProceedings of the AAAI Conference on Artificial Intelligence: AAAI-25 Technical Tracks 23
PublisherAAAI Press
Number23
Volume39
ISSN (Print)2159-5399
ISSN (Electronic)2374-3468

Conference

Conference39th Annual AAAI Conference on Artificial Intelligence, AAAI 2025
Country/TerritoryUnited States
CityPhiladelphia
Period25/02/202504/03/2025

Bibliographical note

Publisher Copyright:
Copyright © 2025, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved.

Keywords

  • Natural Language Processing (NLP)
  • question answering
  • generation

ASJC Scopus subject areas

  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'Enhancing question generation through diversity-seeking reinforcement learning with bilevel policy decomposition'. Together they form a unique fingerprint.

Cite this