Ujava.org reinforcement-learning

download Ujava.org reinforcement-learning

If you can't read please download the document

Transcript of Ujava.org reinforcement-learning

  1. 1. Reinforcement Learning ujava.org Workshop 2015-06-27 www.idosi.com CEO Shindong KANG ()
  2. 2. www.idosi.comujava.org
  3. 3. www.idosi.comspaceapi.org
  4. 4. www.idosi.comReinforcement Learning for Brick Game
  5. 5. www.idosi.comReinforcement Learning for Brick Game
  6. 6. www.idosi.comTo Flip Pancake
  7. 7. www.idosi.comCrawling Robot on Carpet
  8. 8. www.idosi.comPavlov's Dog
  9. 9. www.idosi.comPavlov
  10. 10. www.idosi.comReinforcement ()
  11. 11. www.idosi.comMarkov Chain
  12. 12. www.idosi.comMarkov Process
  13. 13. www.idosi.comMarkov Decision Process (MDP))
  14. 14. www.idosi.comNon-Deterministic Search
  15. 15. www.idosi.comGrid World
  16. 16. www.idosi.comGoal
  17. 17. www.idosi.comAction
  18. 18. www.idosi.comMDP
  19. 19. www.idosi.comMarkov Property
  20. 20. www.idosi.comPolicy
  21. 21. www.idosi.comOptimal Policy
  22. 22. www.idosi.comRacing's Probability
  23. 23. www.idosi.comRacing's Reward
  24. 24. www.idosi.comSearch Tree
  25. 25. www.idosi.comQ-state
  26. 26. www.idosi.comDiscounting
  27. 27. www.idosi.comDiscounting
  28. 28. www.idosi.comPolicy with Discouting
  29. 29. www.idosi.comDiscouting Factor
  30. 30. www.idosi.comDiscouting Factor
  31. 31. www.idosi.comDiscouting Factor
  32. 32. www.idosi.comReinforcement
  33. 33. www.idosi.comSum of Rewards
  34. 34. www.idosi.comOptimal Quantities
  35. 35. www.idosi.comValues of States
  36. 36. www.idosi.comMDP
  37. 37. www.idosi.comMDP
  38. 38. www.idosi.comMDP
  39. 39. www.idosi.comMDP
  40. 40. www.idosi.comMDP
  41. 41. www.idosi.comMDP
  42. 42. www.idosi.comMDP
  43. 43. www.idosi.comMDP
  44. 44. www.idosi.comReinforcement Learning
  45. 45. www.idosi.comMDP of all infos
  46. 46. www.idosi.comRL of no infos
  47. 47. www.idosi.comMDP vs. RL
  48. 48. www.idosi.comModel-Based Learning (RL)
  49. 49. www.idosi.comObserved Episodes
  50. 50. www.idosi.comLearned Model
  51. 51. www.idosi.comDirect Evaluation
  52. 52. www.idosi.comProblems with Direct Evaluation
  53. 53. www.idosi.comTemporal Difference Learning
  54. 54. www.idosi.comTemporal Difference Learning
  55. 55. www.idosi.comTemporal Difference Learning
  56. 56. www.idosi.comExpoential Moving Average
  57. 57. www.idosi.comQ-Value Iteration
  58. 58. www.idosi.comQ-Learning
  59. 59. www.idosi.comQ-Learning Demo
  60. 60. Thank you ! () Intelligent City Ltd. Shindong KANG www.idosi.com [email protected]