Class information for:
Level 1: REINFORCEMENT LEARNING//Q LEARNING//MARKOV DECISION PROCESSES

Basic class information

Class id #P Avg. number of
references
Database coverage
of references
5817 1571 30.7 33%



Bar chart of Publication_year

Last years might be incomplete

Hierarchy of classes

The table includes all classes above and classes immediately below the current class.



Cluster id Level Cluster label #P
1 4 ECONOMICS//EDUCATION & EDUCATIONAL RESEARCH//PSYCHOL 3876184
532 3       COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE//REINFORCEMENT LEARNING//ICCA JOURNAL 19644
1047 2             ICCA JOURNAL//REINFORCEMENT LEARNING//ICGA JOURNAL 10100
5817 1                   REINFORCEMENT LEARNING//Q LEARNING//MARKOV DECISION PROCESSES 1571

Terms with highest relevance score



rank Term termType Chi square Shr. of publ. in
class containing
term
Class's shr. of
term's tot. occurrences
#P with
term in
class
1 REINFORCEMENT LEARNING authKW 2182803 33% 22% 517
2 Q LEARNING authKW 454757 5% 27% 86
3 MARKOV DECISION PROCESSES authKW 371342 7% 19% 103
4 PERFORMANCE POTENTIAL authKW 357251 2% 74% 25
5 MULTIAGENT LEARNING authKW 312979 2% 44% 37
6 ACTOR CRITIC ALGORITHMS authKW 236917 1% 76% 16
7 POMDP authKW 228878 2% 34% 35
8 TEMPORAL DIFFERENCE LEARNING authKW 222309 2% 37% 31
9 REINFORCEMENT LEARNING RL authKW 146026 2% 28% 27
10 POLICY ITERATION authKW 145869 2% 24% 31

Web of Science journal categories



Rank Term Chi square Shr. of publ. in
class containing
term
Class's shr. of
term's tot. occurrences
#P with
term in
class
1 Computer Science, Artificial Intelligence 87675 61% 0% 963
2 Automation & Control Systems 17751 22% 0% 348
3 Robotics 11685 8% 0% 125
4 Computer Science, Cybernetics 3968 5% 0% 75
5 Operations Research & Management Science 1779 8% 0% 126
6 Computer Science, Theory & Methods 1697 10% 0% 151
7 Computer Science, Information Systems 753 6% 0% 98
8 Engineering, Electrical & Electronic 557 13% 0% 210
9 Computer Science, Interdisciplinary Applications 275 4% 0% 66
10 Mathematics, Applied 210 6% 0% 93

Address terms



Rank Term Chi square Shr. of publ. in
class containing
term
Class's shr. of
term's tot. occurrences
#P with
term in
class
1 INTELLIGENT NETWORKED SYST CFINS 73475 1% 34% 11
2 REINFORCEMENT LEARNING ARTIFICIAL INTELLIGENCE 58306 0% 100% 3
3 TEAM SEQUEL 51825 0% 67% 4
4 CORP TECHNOL INFORMAT COMMUN 4 38870 0% 100% 2
5 FIBO 38870 0% 100% 2
6 IND COMMERCE MANAGEMENT 38870 0% 100% 2
7 IMS MALIS GRP 34981 0% 60% 3
8 DISTRIBUTED SENSOR SYST GRP 25912 0% 67% 2
9 GUIDANCE IMAGING SOLUT 25912 0% 67% 2
10 MAIA PROJECT TEAM 25912 0% 67% 2

Journals



Rank Term Chi square Shr. of publ. in
class containing
term
Class's shr. of
term's tot. occurrences
#P with
term in
class
1 JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH 98235 4% 8% 67
2 MACHINE LEARNING 88308 5% 6% 75
3 AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS 66358 2% 9% 39
4 JOURNAL OF MACHINE LEARNING RESEARCH 55584 4% 4% 67
5 ADAPTIVE BEHAVIOR 20955 1% 5% 22
6 LECTURE NOTES IN ARTIFICIAL INTELLIGENCE 18589 8% 1% 126
7 ARTIFICIAL INTELLIGENCE 17955 3% 2% 46
8 DISCRETE EVENT DYNAMIC SYSTEMS-THEORY AND APPLICATIONS 14633 1% 4% 17
9 ROBOTICS AND AUTONOMOUS SYSTEMS 11994 2% 2% 37
10 IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS 8687 2% 2% 30

Author Key Words



Rank Term Chi square Shr. of publ. in
class containing
term
Class's shr. of
term's tot. occurrences
#P with
term in
class
LCSH search Wikipedia search
1 REINFORCEMENT LEARNING 2182803 33% 22% 517 Search REINFORCEMENT+LEARNING Search REINFORCEMENT+LEARNING
2 Q LEARNING 454757 5% 27% 86 Search Q+LEARNING Search Q+LEARNING
3 MARKOV DECISION PROCESSES 371342 7% 19% 103 Search MARKOV+DECISION+PROCESSES Search MARKOV+DECISION+PROCESSES
4 PERFORMANCE POTENTIAL 357251 2% 74% 25 Search PERFORMANCE+POTENTIAL Search PERFORMANCE+POTENTIAL
5 MULTIAGENT LEARNING 312979 2% 44% 37 Search MULTIAGENT+LEARNING Search MULTIAGENT+LEARNING
6 ACTOR CRITIC ALGORITHMS 236917 1% 76% 16 Search ACTOR+CRITIC+ALGORITHMS Search ACTOR+CRITIC+ALGORITHMS
7 POMDP 228878 2% 34% 35 Search POMDP Search POMDP
8 TEMPORAL DIFFERENCE LEARNING 222309 2% 37% 31 Search TEMPORAL+DIFFERENCE+LEARNING Search TEMPORAL+DIFFERENCE+LEARNING
9 REINFORCEMENT LEARNING RL 146026 2% 28% 27 Search REINFORCEMENT+LEARNING+RL Search REINFORCEMENT+LEARNING+RL
10 POLICY ITERATION 145869 2% 24% 31 Search POLICY+ITERATION Search POLICY+ITERATION

Core articles

The table includes core articles in the class. The following variables is taken into account for the relevance score of an article in a cluster c:
(1) Number of references referring to publications in the class.
(2) Share of total number of active references referring to publications in the class.
(3) Age of the article. New articles get higher score than old articles.
(4) Citation rate, normalized to year.



Rank Reference # ref.
in cl.
Shr. of ref. in
cl.
Citations
1 BUSONIU, L , BABUSKA, R , DE SCHUTTER, B , (2008) A COMPREHENSIVE SURVEY OF MULTIAGENT REINFORCEMENT LEARNING.IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS. VOL. 38. ISSUE 2. P. 156 -172 30 86% 262
2 GOSAVI, A , (2009) REINFORCEMENT LEARNING: A TUTORIAL SURVEY AND RECENT ADVANCES.INFORMS JOURNAL ON COMPUTING. VOL. 21. ISSUE 2. P. 178 -192 43 73% 35
3 TAYLOR, ME , STONE, P , (2009) TRANSFER LEARNING FOR REINFORCEMENT LEARNING DOMAINS: A SURVEY.JOURNAL OF MACHINE LEARNING RESEARCH. VOL. 10. ISSUE . P. 1633-1685 22 85% 128
4 PIETQUIN, O , GEIST, M , (2013) ALGORITHMIC SURVEY OF PARAMETRIC VALUE FUNCTION APPROXIMATION.IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS. VOL. 24. ISSUE 6. P. 845 -867 23 85% 9
5 GRONDMAN, I , BUSONIU, L , LOPES, GAD , BABUSKA, R , (2012) A SURVEY OF ACTOR-CRITIC REINFORCEMENT LEARNING: STANDARD AND NATURAL POLICY GRADIENTS.IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS. VOL. 42. ISSUE 6. P. 1291-1307 26 72% 27
6 XU, X , ZUO, L , HUANG, ZH , (2014) REINFORCEMENT LEARNING ALGORITHMS WITH FUNCTION APPROXIMATION: RECENT ADVANCES AND APPLICATIONS.INFORMATION SCIENCES. VOL. 261. ISSUE . P. 1 -31 35 45% 24
7 OLIEHOEK, FA , SPAAN, MTJ , AMATO, C , WHITESON, S , (2013) INCREMENTAL CLUSTERING AND EXPANSION FOR FASTER OPTIMAL PLANNING IN DECENTRALIZED POMDPS.JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH. VOL. 46. ISSUE . P. 449-509 23 82% 3
8 MATIGNON, L , LAURENT, GJ , LE FORT-PIAT, N , (2012) INDEPENDENT REINFORCEMENT LEARNERS IN COOPERATIVE MARKOV GAMES: A SURVEY REGARDING COORDINATION PROBLEMS.KNOWLEDGE ENGINEERING REVIEW. VOL. 27. ISSUE 1. P. 1 -31 18 86% 13
9 YU, C , ZHANG, MJ , REN, FH , TAN, GZ , (2015) MULTIAGENT LEARNING OF COORDINATION IN LOOSELY COUPLED MULTIAGENT SYSTEMS.IEEE TRANSACTIONS ON CYBERNETICS. VOL. 45. ISSUE 12. P. 2853 -2867 18 82% 1
10 SNEL, M , WHITESON, S , (2014) LEARNING POTENTIAL FUNCTIONS AND THEIR REPRESENTATIONS FOR MULTI-TASK REINFORCEMENT LEARNING.AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS. VOL. 28. ISSUE 4. P. 637 -681 18 82% 0

Classes with closest relation at Level 1



Rank Class id link
1 31705 PARTIALLY OBSERVED MARKOV DECISION PROCESS//BLACKWELL DOMINANCE//ACTIVE STATE TRACKING
2 17239 ADAPTIVE DYNAMIC PROGRAMMING//ADAPTIVE CRITIC DESIGNS//ADAPTIVE DYNAMIC PROGRAMMING ADP
3 16494 ROBOT SOCCER//ROBOCUP//SOCCER ROBOTS
4 37405 UNIVERSAL PSYCHOMETRICS//AESTHETICS THEORY//AGENT POLICY
5 11954 IMITATION LEARNING//DEVELOPMENTAL ROBOTICS//LEARNING FROM DEMONSTRATION
6 21199 LEARNING CLASSIFIER SYSTEMS//XCS//CLASSIFIER SYSTEMS
7 10960 AUTOMATED PLANNING//JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH//ARTIFICIAL INTELLIGENCE
8 13929 EVOLUTIONARY ROBOTICS//ARTIFICIAL LIFE//ADAPTIVE BEHAVIOR
9 9283 OPTIMAL STATIONARY POLICY//CONTINUOUS TIME MARKOV DECISION PROCESS//ESTADIST CALCULO
10 22282 LEARNING AUTOMATA//LEARNING AUTOMATA LA//WEAK ESTIMATORS

Go to start page