A new fourier Q operator network based reinforcement learning method for continuous action space decision-making in manufacturing

Yang Ni, Yingguang Li*, Changqing Liu, Yan Jin

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

3 Downloads (Pure)

Abstract

The problems of continuous action space decision-making are widespread in industrial manufacturing. However, when dealing with these problems, existing reinforcement learning (RL) methods relies on a large number of training samples, which is always unacceptable given the limited availability or expensive nature of data, such as low-volume manufacturing. This paper proposes a new Fourier Q operator network (FQON) based RL method. The input of FQON is the expected state function and its output the Q-value function, and both functions take the action in RL as independent variables. The infinite-dimensional mapping between the function domains is established by a set of parameters that can be used with different discretization, which fixes the mapping complexity regardless of the action space resolution. By taking the advantages of the fast calculation using on Fourier kernel operator, the mapping complexity is highly reduced, and it enables that FQON can realize the decision-making in continuous action space using a small amount of training samples. Taking machining deformation control of an aero-engine casing as a case study, experimental results showed that FQON based RL method can control the deformation well with limited training samples.
Original languageEnglish
Article number102641
Number of pages12
JournalRobotics and Computer-Integrated Manufacturing
Volume86
Early online date11 Sept 2023
Publication statusPublished - 01 Apr 2024

Keywords

  • Fourier Q operator network
  • reinforcement learning
  • decision-making
  • continuous action space
  • machining deformation control

Fingerprint

Dive into the research topics of 'A new fourier Q operator network based reinforcement learning method for continuous action space decision-making in manufacturing'. Together they form a unique fingerprint.

Cite this