Multiagent reinforcement learning using Non-Parametric Approximation

David Luviano  Cruz; Francesco José  García Luna; Luis Asunción  Pérez Domínguez

doi:10.22463/0122820X.1738

Ver / Descargar

PDF

FLIP (English)

HTML

How to Cite

Cruz, D. L. ., García Luna, F. J. ., & Pérez Domínguez, L. A. . (2018). Multiagent reinforcement learning using Non-Parametric Approximation. Respuestas, 23(2), 53–61. https://doi.org/10.22463/0122820X.1738

More Citation Formats

ACM ACS APA ABNT Chicago Harvard IEEE MLA Turabian Vancouver

Download Citation

Endnote/Zotero/Mendeley (RIS) BibTeX

Published: Jul 1, 2018

Doi

https://doi.org/10.22463/0122820X.1738

Dimensions

PlumX

Issue

Vol. 23 No. 2 (2018)

Section

Investigation articles

License Terms (SEE)

Esta obra está bajo una licencia internacional Creative Commons Atribución-NoComercial 4.0.

David Luviano Cruz

Universidad Autónoma de Ciudad de Juárez

https://orcid.org/0000-0002-4778-8873

Francesco José García Luna

Universidad Autónoma de Ciudad de Juárez

https://orcid.org/0000-0002-8571-914X

Luis Asunción Pérez Domínguez

Universidad Autónoma de Ciudad de Juárez

https://orcid.org/0000-0003-2541-4595

Abstract

This paper presents a hybrid control proposal for multi-agent systems, where the advantages of the reinforcement learning and nonparametric functions are exploited. A modified version of the Q-learning algorithm is used which will provide data training for a Kernel, this approach will provide a sub optimal set of actions to be used by the agents. The proposed algorithm is experimentally tested in a path generation task in an unknown environment for mobile robots.

Keywords

Multiagent systems, nonparametric approximator, reinforcement learning, trajectory generation

Downloads

Download data is not yet available.

References

P. Stone, M. Veloso, “Multiagent systems: A survey from machine learning perspective”, Autonomous Robots, vol.8, no.3, pp. 345-383, 2000.

M. Wooldridge, An Introduction to Multi Agent Systems, Baffins Lane, Chichester, England: John Wiley & Sons. 1992.

L. Busoniu, R. Babuska and B. De Schuttert, “Multi-agent Reinforcement Learning: An Overview”, Delf Center for System and Control, Delf University of Technology, pp. 183-221, 2010.

J.M. Vidal, “Learning in multiagent systems: An introduction from a game-theoretic perspective”, In: Alonso E., Kudenko D., Kazakov D. (eds) Adaptive Agents and Multi-Agent Systems. Lecture Notes in Computer Science, vol. 2636. Springer, Berlin, Heidelberg, pp. 202-215, 2003.

R. Postoyan, L. Busoniu, D. Nesic and J. Daafouz, “Stability Analysis of Discrete-Time Infinite-Horizon Optimal Control with Discounted Cost”. IEEE Transactions on Automatic Control, vol. 62, no. 6, pp. 2736–2749, 2017

B. Kiumarsi, K.G. Vamvoudakis, M. Hamidreza and F.L. Lewis. "Optimal and Autonomous Control Using Reinforcement Learning: A Survey", IEEE Transactions on Neural Networks and Learning Systems, vol. 29, no. 6, pp. 2042-2062, 2018.

C. Watkins, P. Dayan, “Q Learning: Technical Note”, Machine Learning, vol.8, pp. 279-292, 1992.

C. Boutilier, “Planning Learning and Coordination in Multiagent Decision Processes”, In Proceedings of the Sixth Conference on Theoretical Aspects of Rationality and Knowledge (TARK96), 1996, pp. 195-202, 1996.

Y. Ishiwaka, T. Sato and Y. Kakazu, “An approach to the pursuit problem on a heterogeneous multiagent system using reinforcement learning”, Robotics and Autonomous Systems, vol. 43, no. 4, pp.245-256, 2003.

A. Nadaraya, “On Estimating Regression”, Theory of Probability and its Applications, vol. 9, no.1, pp. 141-142, 1964.

Multiagent reinforcement learning using Non-Parametric Approximation

Aprendizaje por reforzamiento para sistemas multiagentes utilizando Aproximación No Paramétrica

Downloads

Make a Submission

Information

Tutorials

about

QR Code

scholar

Most read in the last 30 days

Portales Institucionales

Enlaces de Interés

Contactos

Article Sidebar

Main Article Content

Downloads

Article Details

Make a Submission

Information

Tutorials

about

QR Code

scholar

Most read in the last 30 days

Portales Institucionales