Random Balance between Monte Carlo and Temporal Difference in off-policy Reinforcement Learning for Less Sample-Complexity
Chayoung Kim, Seohee Park, Woosik Lee, Journal of Internet Computing and Services, Vol. 21, No. 5, pp. 1-7, Oct. 2020
Keywords: Deep Q-Network, Temporal Difference, Monte Carlo, Reinforcement Learning, Variation and Bias Balance
Abstract
Statistics
Show / Hide Statistics
Statistics (Past 3 Years)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.
Statistics (Past 3 Years)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.
|
|
Cite this article
[APA Style]
Kim, C., Park, S., & Lee, W. (2020). Random Balance between Monte Carlo and Temporal Difference in off-policy Reinforcement Learning for Less Sample-Complexity. Journal of Internet Computing and Services, 21(5), 1-7. DOI: 10.7472/jksii.2020.21.5.1.
[IEEE Style]
C. Kim, S. Park, W. Lee, "Random Balance between Monte Carlo and Temporal Difference in off-policy Reinforcement Learning for Less Sample-Complexity," Journal of Internet Computing and Services, vol. 21, no. 5, pp. 1-7, 2020. DOI: 10.7472/jksii.2020.21.5.1.
[ACM Style]
Chayoung Kim, Seohee Park, and Woosik Lee. 2020. Random Balance between Monte Carlo and Temporal Difference in off-policy Reinforcement Learning for Less Sample-Complexity. Journal of Internet Computing and Services, 21, 5, (2020), 1-7. DOI: 10.7472/jksii.2020.21.5.1.

