Tuning heuristics and convergence analysis of reinforcement learning algorithm for online data-based optimal control design. (2020). Research, Society and Development, 9(2), e188922128. https://doi.org/10.33448/rsd-v9i2.2128