Improving gradient-based learning algorithms for large scale feedforward networks

M. Ventresca; H. R. Tizhoosh

doi:10.1109/IJCNN.2009.5178798

Improving gradient-based learning algorithms for large scale feedforward networks

M. Ventresca, H. R. Tizhoosh

Artificial Intelligence and Informatics

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Abstract

Large scale neural networks have many hundreds or thousands of parameters (weights and biases) to learn, and as a result tend to have very long training times. Small scale networks can be trained quickly by using second-order information, but these fail for large architectures due to high computational cost. Other approaches employ local search strategies, which also add to the computational cost. In this paper we present a simple method, based on opposite transfer functions which greatly improve the convergence rate and accuracy of gradientbased learning algorithms. We use two variants of the backpropagation algorithm and common benchmark data to highlight the improvements. We find statistically significant improvements in both converegence speed and accuracy.

Original language	English (US)
Title of host publication	2009 International Joint Conference on Neural Networks, IJCNN 2009
Pages	3212-3219
Number of pages	8
DOIs	https://doi.org/10.1109/IJCNN.2009.5178798
State	Published - 2009
Event	2009 International Joint Conference on Neural Networks, IJCNN 2009 - Atlanta, GA, United States Duration: Jun 14 2009 → Jun 19 2009

Publication series

Name	Proceedings of the International Joint Conference on Neural Networks

Conference

Conference	2009 International Joint Conference on Neural Networks, IJCNN 2009
Country/Territory	United States
City	Atlanta, GA
Period	6/14/09 → 6/19/09

Keywords

Backpropgation
Gradient-based learning
Large scale networks
Opposite transfer functions
Opposition-based computing

ASJC Scopus subject areas

Software
Artificial Intelligence

Access to Document

10.1109/IJCNN.2009.5178798

Cite this

Improving gradient-based learning algorithms for large scale feedforward networks. / Ventresca, M.; Tizhoosh, H. R.
2009 International Joint Conference on Neural Networks, IJCNN 2009. 2009. p. 3212-3219 5178798 (Proceedings of the International Joint Conference on Neural Networks).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Ventresca, M & Tizhoosh, HR 2009, Improving gradient-based learning algorithms for large scale feedforward networks. in 2009 International Joint Conference on Neural Networks, IJCNN 2009., 5178798, Proceedings of the International Joint Conference on Neural Networks, pp. 3212-3219, 2009 International Joint Conference on Neural Networks, IJCNN 2009, Atlanta, GA, United States, 6/14/09. https://doi.org/10.1109/IJCNN.2009.5178798

@inproceedings{affba1c21b304a8680cd163d9002bf76,

title = "Improving gradient-based learning algorithms for large scale feedforward networks",

abstract = "Large scale neural networks have many hundreds or thousands of parameters (weights and biases) to learn, and as a result tend to have very long training times. Small scale networks can be trained quickly by using second-order information, but these fail for large architectures due to high computational cost. Other approaches employ local search strategies, which also add to the computational cost. In this paper we present a simple method, based on opposite transfer functions which greatly improve the convergence rate and accuracy of gradientbased learning algorithms. We use two variants of the backpropagation algorithm and common benchmark data to highlight the improvements. We find statistically significant improvements in both converegence speed and accuracy.",

keywords = "Backpropgation, Gradient-based learning, Large scale networks, Opposite transfer functions, Opposition-based computing",

author = "M. Ventresca and Tizhoosh, {H. R.}",

year = "2009",

doi = "10.1109/IJCNN.2009.5178798",

language = "English (US)",

isbn = "9781424435531",

series = "Proceedings of the International Joint Conference on Neural Networks",

pages = "3212--3219",

booktitle = "2009 International Joint Conference on Neural Networks, IJCNN 2009",

note = "2009 International Joint Conference on Neural Networks, IJCNN 2009 ; Conference date: 14-06-2009 Through 19-06-2009",

}

TY - GEN

T1 - Improving gradient-based learning algorithms for large scale feedforward networks

AU - Ventresca, M.

AU - Tizhoosh, H. R.

PY - 2009

Y1 - 2009

N2 - Large scale neural networks have many hundreds or thousands of parameters (weights and biases) to learn, and as a result tend to have very long training times. Small scale networks can be trained quickly by using second-order information, but these fail for large architectures due to high computational cost. Other approaches employ local search strategies, which also add to the computational cost. In this paper we present a simple method, based on opposite transfer functions which greatly improve the convergence rate and accuracy of gradientbased learning algorithms. We use two variants of the backpropagation algorithm and common benchmark data to highlight the improvements. We find statistically significant improvements in both converegence speed and accuracy.

AB - Large scale neural networks have many hundreds or thousands of parameters (weights and biases) to learn, and as a result tend to have very long training times. Small scale networks can be trained quickly by using second-order information, but these fail for large architectures due to high computational cost. Other approaches employ local search strategies, which also add to the computational cost. In this paper we present a simple method, based on opposite transfer functions which greatly improve the convergence rate and accuracy of gradientbased learning algorithms. We use two variants of the backpropagation algorithm and common benchmark data to highlight the improvements. We find statistically significant improvements in both converegence speed and accuracy.

KW - Backpropgation

KW - Gradient-based learning

KW - Large scale networks

KW - Opposite transfer functions

KW - Opposition-based computing

UR - http://www.scopus.com/inward/record.url?scp=70449455565&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=70449455565&partnerID=8YFLogxK

U2 - 10.1109/IJCNN.2009.5178798

DO - 10.1109/IJCNN.2009.5178798

M3 - Conference contribution

AN - SCOPUS:70449455565

SN - 9781424435531

T3 - Proceedings of the International Joint Conference on Neural Networks

SP - 3212

EP - 3219

BT - 2009 International Joint Conference on Neural Networks, IJCNN 2009

T2 - 2009 International Joint Conference on Neural Networks, IJCNN 2009

Y2 - 14 June 2009 through 19 June 2009

ER -

Improving gradient-based learning algorithms for large scale feedforward networks

Abstract

Publication series

Conference

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this