Activities per year
Abstract
Recent advancements in question generation (QG) have been significantly propelled by reinforcement learning (RL). Although extensive reward models have been designed to capture the attributes of ideal questions, their associated learning challenges, particularly in sample efficiency and diversity, remain underexplored. This paper introduces a bilevel policy decomposition (BPD) framework and a diversity-seeking RL (DSRL) objective to address these issues. The BPD framework utilizes two cascading policies to divide QG into two more manageable sub-tasks: answer-centric summary generation and summary-augmented QG, facilitating exploration and accelerating policy learning. Concurrently, the DSRL objective preserves the inherent diversity of QG by ensuring the bilevel policies align probabilistically with their reward models rather than merely maximizing returns. Our integrated approach, named BPD-DSRL, demonstrates superior performance over existing baselines on multiple question quality and diversity metrics across various QG benchmarks.
Original language | English |
---|---|
Title of host publication | 39th AAAI Conference on Artificial Intelligence: Proceedings |
Editors | Toby Walsh, Julie Shah, Zico Kolter |
Publisher | Association for the Advancement of Artificial Intelligence (AAAI) |
Pages | 25083-25091 |
ISBN (Electronic) | 9781577358978 |
DOIs | |
Publication status | Published - 11 Apr 2025 |
Event | 39th Annual AAAI Conference on Artificial Intelligence 2025 - Philadelphia, United States Duration: 25 Feb 2025 → 04 Mar 2025 Conference number: 39 https://aaai.org/conference/aaai/aaai-25/ |
Publication series
Name | Proceedings of the AAAI Conference on Artificial Intelligence: AAAI-25 Technical Tracks 23 |
---|---|
Publisher | AAAI Press |
Number | 23 |
Volume | 39 |
ISSN (Print) | 2159-5399 |
ISSN (Electronic) | 2374-3468 |
Conference
Conference | 39th Annual AAAI Conference on Artificial Intelligence 2025 |
---|---|
Abbreviated title | AAAI-25 |
Country/Territory | United States |
City | Philadelphia |
Period | 25/02/2025 → 04/03/2025 |
Internet address |
Keywords
- Natural Language Processing (NLP)
- question answering
- generation
Fingerprint
Dive into the research topics of 'Enhancing question generation through diversity-seeking reinforcement learning with bilevel policy decomposition'. Together they form a unique fingerprint.Activities
- 1 Oral presentation
-
Enhancing Question Generation through Diversity-Seeking Reinforcement Learning with Bilevel Policy Decomposition
Ren, T. (Presenter), Wang, H. (Advisor) & Rafferty, K. (Advisor)
02 Mar 2025Activity: Talk or presentation types › Oral presentation