Enhancing question generation through diversity-seeking reinforcement learning with bilevel policy decomposition

Tianyu Ren, Hui Wang*, Karen Rafferty

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Downloads (Pure)

Abstract

Recent advancements in question generation (QG) have been significantly propelled by reinforcement learning (RL). Although extensive reward models have been designed to capture the attributes of ideal questions, their associated learning challenges, particularly in sample efficiency and diversity, remain underexplored. This paper introduces a bilevel policy decomposition (BPD) framework and a diversity-seeking RL (DSRL) objective to address these issues. The BPD framework utilizes two cascading policies to divide QG into two more manageable sub-tasks: answer-centric summary generation and summary-augmented QG, facilitating exploration and accelerating policy learning. Concurrently, the DSRL objective preserves the inherent diversity of QG by ensuring the bilevel policies align probabilistically with their reward models rather than merely maximizing returns. Our integrated approach, named BPD-DSRL, demonstrates superior performance over existing baselines on multiple question quality and diversity metrics across various QG benchmarks.
Original languageEnglish
Title of host publication39th AAAI Conference on Artificial Intelligence: Proceedings
EditorsToby Walsh, Julie Shah, Zico Kolter
PublisherAssociation for the Advancement of Artificial Intelligence (AAAI)
Pages25083-25091
ISBN (Electronic)9781577358978
DOIs
Publication statusPublished - 11 Apr 2025
Event39th Annual AAAI Conference on Artificial Intelligence 2025 - Philadelphia, United States
Duration: 25 Feb 202504 Mar 2025
Conference number: 39
https://aaai.org/conference/aaai/aaai-25/

Publication series

NameProceedings of the AAAI Conference on Artificial Intelligence: AAAI-25 Technical Tracks 23
PublisherAAAI Press
Number23
Volume39
ISSN (Print)2159-5399
ISSN (Electronic)2374-3468

Conference

Conference39th Annual AAAI Conference on Artificial Intelligence 2025
Abbreviated titleAAAI-25
Country/TerritoryUnited States
CityPhiladelphia
Period25/02/202504/03/2025
Internet address

Keywords

  • Natural Language Processing (NLP)
  • question answering
  • generation

Fingerprint

Dive into the research topics of 'Enhancing question generation through diversity-seeking reinforcement learning with bilevel policy decomposition'. Together they form a unique fingerprint.

Cite this