reinforcement learning
(Q830687)
type of machine learning where an agent learns how to behave in an environment by performing actions and receiving rewards or penalties in return, aiming to maximize the cumulative reward over time
type of machine learning where an agent learns how to behave in an environment by performing actions and receiving rewards or penalties in return, aiming to maximize the cumulative reward over time
Language:
Current Data About
reinforcement learning
(P10) |
A-novel-approach-to-locomotion-learning-Actor-Critic-architecture-using-central-pattern-generators-Movie1.ogv
|
||||
(P31) |
(Q111862379)
(Q130609847) |
||||
(P279) |
(Q2539)
|
||||
(P361) |
(Q2539)
|
||||
(P373) |
Reinforcement learning
|
||||
(P461) |
(Q123916004)
|
||||
(P910) |
(Q87071489)
|
||||
(P1343) |
(Q133280541)
|
||||
(P1482) |
https://stackoverflow.com/tags/reinforcement-learning
https://ai.stackexchange.com/tags/reinforcement-learning |
||||
(P2179) |
10010261
|
||||
(P8687) |
30816
|
other details
description | type of machine learning where an agent learns how to behave in an environment by performing actions and receiving rewards or penalties in return, aiming to maximize the cumulative reward over time |
External Links