An Experimental Process Validation of Reinforcement Learning in Machine Learning
Amulya Sakhamuru
Amulya Sakhamuru, Bachelor of Technology in Computer Science, Sri Sarathi Institute Engineering of Technology, JNTU Kakinada, Andhra Pradesh, India.
Manuscript received on 03 June 2019 | Revised Manuscript received on 07 June 2019 | Manuscript published on 30 June 2019 | PP: 2140-2142 | Volume-8 Issue-8, June 2019 | Retrieval Number: H7532068819/19©BEIESP
Open Access | Ethics and Policies | Cite | Mendeley | Indexing and Abstracting
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC-BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)
Abstract: The lifelong learning of reusable skills could be a vital demand for embodied agents that act during a complicated, dynamic setting and are visage with completely different tasks over their lifespan. To deal with the question of how an associate degree agent will be trained helpful skills resourcefully throughout a biological process amount, i.e., once no job is an obligation for him and no external reward is provided. Learning of skills during a biological process must be progressive and self-motivated. We tend to find a brand new progressive, task- independent talent discovery approach that’s suited to constant domains. Additionally, the agent learns actual skills supported by intrinsic motivational mechanisms that settle on that skills learning is decisive at a given purpose in time. We tend to estimate the approach during a reinforcement learning setup in 2 continuous domains with complicated dynamics. We tend to expect that associate degree as such driven, talent learning agent outperforms associate degree agent that learn task solutions from scratch. Besides, we tend to match up to completely different intrinsic motivation mechanisms and the way they economically create use of the agent’s biological process amount. In this paper we tend to present primary results from a process study of as such driven reinforcement learning designed at permitting artificial agents to construct and extend hierarchies of reusable skills that are required for knowledgeable independence. However despite the quality inherent within the planet, humans are still capable of constructing predictions concerning however the globe behaves and victimization this info to create selections. To grasp however, we tend to think about how humans learn to play games.
Keywords: Reinforcement Learning Stratified Reinforcement Learning, Intrinsic Motivation, and Reusable Skills.
Scope of the Article: Machine Learning