Abstract: This paper investigates the linear quadratic optimal output feedback control problem for an unknown linear continuous-time system. Combined with adaptive dynamic programming and optimal ...
Abstract: Recent work has found that two types of dataset characteristics including the dataset’s coverage and data quality are critical for offline reinforcement learning (RL). To improve the policy, ...