Oppositional extension of reinforcement learning techniques

M. Mahootchi; H. R. Tizhoosh; K. Ponnambalam

doi:10.1016/j.ins.2014.02.024

Oppositional extension of reinforcement learning techniques

M. Mahootchi, H. R. Tizhoosh, K. Ponnambalam

Artificial Intelligence and Informatics

Research output: Contribution to journal › Article › peer-review

Abstract

In this paper, we present different opposition schemes for four reinforcement learning methods: Q-learning, Q(λ), Sarsa, and Sarsa(λ) under assumptions that are reasonable for many real-world problems where type-II opposites generally better reflect the nature of the problem at hand. It appears that the aggregation of opposition-based schemes with regular learning methods can significantly speed up the learning process, especially where the number of observations is small or the state space is large. We verify the performance of the proposed methods using two different applications: a grid-world problem and a single water reservoir management problem.

Original language	English (US)
Pages (from-to)	101-114
Number of pages	14
Journal	Information Sciences
Volume	275
DOIs	https://doi.org/10.1016/j.ins.2014.02.024
State	Published - Aug 10 2014

Keywords

Grid world
Opposition-based learning
Q-learning
Reinforcement learning
Reservoir management
Sarsa

ASJC Scopus subject areas

Software
Control and Systems Engineering
Theoretical Computer Science
Computer Science Applications
Information Systems and Management
Artificial Intelligence

Access to Document

10.1016/j.ins.2014.02.024

Cite this

@article{ca28768d13204b4284f4b337cb258294,

title = "Oppositional extension of reinforcement learning techniques",

abstract = "In this paper, we present different opposition schemes for four reinforcement learning methods: Q-learning, Q(λ), Sarsa, and Sarsa(λ) under assumptions that are reasonable for many real-world problems where type-II opposites generally better reflect the nature of the problem at hand. It appears that the aggregation of opposition-based schemes with regular learning methods can significantly speed up the learning process, especially where the number of observations is small or the state space is large. We verify the performance of the proposed methods using two different applications: a grid-world problem and a single water reservoir management problem.",

keywords = "Grid world, Opposition-based learning, Q-learning, Reinforcement learning, Reservoir management, Sarsa",

author = "M. Mahootchi and Tizhoosh, {H. R.} and K. Ponnambalam",

year = "2014",

month = aug,

day = "10",

doi = "10.1016/j.ins.2014.02.024",

language = "English (US)",

volume = "275",

pages = "101--114",

journal = "Information Sciences",

issn = "0020-0255",

publisher = "Elsevier Inc.",

}

TY - JOUR

T1 - Oppositional extension of reinforcement learning techniques

AU - Mahootchi, M.

AU - Tizhoosh, H. R.

AU - Ponnambalam, K.

PY - 2014/8/10

Y1 - 2014/8/10

N2 - In this paper, we present different opposition schemes for four reinforcement learning methods: Q-learning, Q(λ), Sarsa, and Sarsa(λ) under assumptions that are reasonable for many real-world problems where type-II opposites generally better reflect the nature of the problem at hand. It appears that the aggregation of opposition-based schemes with regular learning methods can significantly speed up the learning process, especially where the number of observations is small or the state space is large. We verify the performance of the proposed methods using two different applications: a grid-world problem and a single water reservoir management problem.

AB - In this paper, we present different opposition schemes for four reinforcement learning methods: Q-learning, Q(λ), Sarsa, and Sarsa(λ) under assumptions that are reasonable for many real-world problems where type-II opposites generally better reflect the nature of the problem at hand. It appears that the aggregation of opposition-based schemes with regular learning methods can significantly speed up the learning process, especially where the number of observations is small or the state space is large. We verify the performance of the proposed methods using two different applications: a grid-world problem and a single water reservoir management problem.

KW - Grid world

KW - Opposition-based learning

KW - Q-learning

KW - Reinforcement learning

KW - Reservoir management

KW - Sarsa

UR - http://www.scopus.com/inward/record.url?scp=84900824395&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84900824395&partnerID=8YFLogxK

U2 - 10.1016/j.ins.2014.02.024

DO - 10.1016/j.ins.2014.02.024

M3 - Article

AN - SCOPUS:84900824395

SN - 0020-0255

VL - 275

SP - 101

EP - 114

JO - Information Sciences

JF - Information Sciences

ER -

Oppositional extension of reinforcement learning techniques

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this