Mallick, P., Chen, Z., & Zamani, M. (2022). Reinforcement learning using expectation maximization based guided policy search for stochastic dynamics. Neurocomputing (Amsterdam), 484, 79-88. https://doi.org/10.1016/j.neucom.2021.01.142
Chicago Style (17th ed.) CitationMallick, Prakash, Zhiyiong Chen, and Mohsen Zamani. "Reinforcement Learning Using Expectation Maximization Based Guided Policy Search for Stochastic Dynamics." Neurocomputing (Amsterdam) 484 (2022): 79-88. https://doi.org/10.1016/j.neucom.2021.01.142.
MLA (9th ed.) CitationMallick, Prakash, et al. "Reinforcement Learning Using Expectation Maximization Based Guided Policy Search for Stochastic Dynamics." Neurocomputing (Amsterdam), vol. 484, 2022, pp. 79-88, https://doi.org/10.1016/j.neucom.2021.01.142.