Recurrent Neural Networks (RNNs): Architectures, Training Tricks, and Introduction to Influential Research

Susmita Das; Amara Tariq; Thiago Santos; Sai Sandeep Kantareddy; Imon Banerjee

doi:10.1007/978-1-0716-3195-9_4

Recurrent Neural Networks (RNNs): Architectures, Training Tricks, and Introduction to Influential Research

Susmita Das, Amara Tariq, Thiago Santos, Sai Sandeep Kantareddy, Imon Banerjee

Diagnostic Radiology

Research output: Chapter in Book/Report/Conference proceeding › Chapter

Abstract

Recurrent neural networks (RNNs) are neural network architectures with hidden state and which use feedback loops to process a sequence of data that ultimately informs the final output. Therefore, RNN models can recognize sequential characteristics in the data and help to predict the next likely data point in the data sequence. Leveraging the power of sequential data processing, RNN use cases tend to be connected to either language models or time-series data analysis. However, multiple popular RNN architectures have been introduced in the field, starting from SimpleRNN and LSTM to deep RNN, and applied in different experimental settings. In this chapter, we will present six distinct RNN architectures and will highlight the pros and cons of each model. Afterward, we will discuss real-life tips and tricks for training the RNN models. Finally, we will present four popular language modeling applications of the RNN models –text classification, summarization, machine translation, and image-to-text translation– thereby demonstrating influential research in the field.

Original language	English (US)
Title of host publication	Neuromethods
Publisher	Humana Press Inc.
Pages	117-138
Number of pages	22
DOIs	https://doi.org/10.1007/978-1-0716-3195-9_4
State	Published - 2023

Publication series

Name	Neuromethods
Volume	197
ISSN (Print)	0893-2336
ISSN (Electronic)	1940-6045

Keywords

Bidirectional RNN (BRNN)
Deep RNN
GRU
LSTM
Language modeling
Recurrent neural network (RNN)

ASJC Scopus subject areas

Psychiatry and Mental health
General Pharmacology, Toxicology and Pharmaceutics
General Biochemistry, Genetics and Molecular Biology
General Neuroscience

Access to Document

10.1007/978-1-0716-3195-9_4

Cite this

@inbook{01f3ab26cd53426996d2eba6088d30b8,

title = "Recurrent Neural Networks (RNNs): Architectures, Training Tricks, and Introduction to Influential Research",

abstract = "Recurrent neural networks (RNNs) are neural network architectures with hidden state and which use feedback loops to process a sequence of data that ultimately informs the final output. Therefore, RNN models can recognize sequential characteristics in the data and help to predict the next likely data point in the data sequence. Leveraging the power of sequential data processing, RNN use cases tend to be connected to either language models or time-series data analysis. However, multiple popular RNN architectures have been introduced in the field, starting from SimpleRNN and LSTM to deep RNN, and applied in different experimental settings. In this chapter, we will present six distinct RNN architectures and will highlight the pros and cons of each model. Afterward, we will discuss real-life tips and tricks for training the RNN models. Finally, we will present four popular language modeling applications of the RNN models –text classification, summarization, machine translation, and image-to-text translation– thereby demonstrating influential research in the field.",

keywords = "Bidirectional RNN (BRNN), Deep RNN, GRU, LSTM, Language modeling, Recurrent neural network (RNN)",

author = "Susmita Das and Amara Tariq and Thiago Santos and Kantareddy, {Sai Sandeep} and Imon Banerjee",

note = "Publisher Copyright: {\textcopyright} 2023, The Author(s).",

year = "2023",

doi = "10.1007/978-1-0716-3195-9_4",

language = "English (US)",

series = "Neuromethods",

publisher = "Humana Press Inc.",

pages = "117--138",

booktitle = "Neuromethods",

}

TY - CHAP

T1 - Recurrent Neural Networks (RNNs)

T2 - Architectures, Training Tricks, and Introduction to Influential Research

AU - Das, Susmita

AU - Tariq, Amara

AU - Santos, Thiago

AU - Kantareddy, Sai Sandeep

AU - Banerjee, Imon

PY - 2023

Y1 - 2023

N2 - Recurrent neural networks (RNNs) are neural network architectures with hidden state and which use feedback loops to process a sequence of data that ultimately informs the final output. Therefore, RNN models can recognize sequential characteristics in the data and help to predict the next likely data point in the data sequence. Leveraging the power of sequential data processing, RNN use cases tend to be connected to either language models or time-series data analysis. However, multiple popular RNN architectures have been introduced in the field, starting from SimpleRNN and LSTM to deep RNN, and applied in different experimental settings. In this chapter, we will present six distinct RNN architectures and will highlight the pros and cons of each model. Afterward, we will discuss real-life tips and tricks for training the RNN models. Finally, we will present four popular language modeling applications of the RNN models –text classification, summarization, machine translation, and image-to-text translation– thereby demonstrating influential research in the field.

AB - Recurrent neural networks (RNNs) are neural network architectures with hidden state and which use feedback loops to process a sequence of data that ultimately informs the final output. Therefore, RNN models can recognize sequential characteristics in the data and help to predict the next likely data point in the data sequence. Leveraging the power of sequential data processing, RNN use cases tend to be connected to either language models or time-series data analysis. However, multiple popular RNN architectures have been introduced in the field, starting from SimpleRNN and LSTM to deep RNN, and applied in different experimental settings. In this chapter, we will present six distinct RNN architectures and will highlight the pros and cons of each model. Afterward, we will discuss real-life tips and tricks for training the RNN models. Finally, we will present four popular language modeling applications of the RNN models –text classification, summarization, machine translation, and image-to-text translation– thereby demonstrating influential research in the field.

KW - Bidirectional RNN (BRNN)

KW - Deep RNN

KW - GRU

KW - LSTM

KW - Language modeling

KW - Recurrent neural network (RNN)

UR - http://www.scopus.com/inward/record.url?scp=85172003729&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85172003729&partnerID=8YFLogxK

U2 - 10.1007/978-1-0716-3195-9_4

DO - 10.1007/978-1-0716-3195-9_4

M3 - Chapter

AN - SCOPUS:85172003729

T3 - Neuromethods

SP - 117

EP - 138

BT - Neuromethods

PB - Humana Press Inc.

ER -

Recurrent Neural Networks (RNNs): Architectures, Training Tricks, and Introduction to Influential Research

Abstract

Publication series

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this