Tuning heuristics and convergence analysis of reinforcement learning algorithm for online data-based optimal control design. Research, Society and Development, [S. l.], v. 9, n. 2, p. e188922128, 2020. DOI: 10.33448/rsd-v9i2.2128. Disponível em: https://ojs34.rsdjournal.org/index.php/rsd/article/view/2128. Acesso em: 29 jun. 2025.