Home Publications ConferencesReluVictorCAEPIA2021

ReLU-based activations: analysis and experimental study for deep learning

Research areas:

Uncategorized

Year:

2021

Type of Publication:

In Proceedings

Keywords:

analysis activations, RELU, RELU activations, deep learning

Authors:

Volume:

12882

Book title:

Proceedings of the XIX Conference of the Spanish Association for Artificial Intelligence (CAEPIA)

Series:

Lecture Notes in Artificial Intelligence (LNAI)

Pages:

33-43

Organization:

Malaga, Spain

Month:

22nd-24th September

ISBN:

978-3-030-85712-7

ISSN:

0302-9743

BibTex:

@conference{ReluVictorCAEPIA2021,
author = "V{\'i}ctor Manuel Vargas and David Guijo-Rubio and Pedro Antonio Guti{\'e}rrez and C{\'e}sar Herv{\'a}s-Mart{\'i}nez",
abstract = "Activation functions are used in neural networks as a tool to introduce non-linear transformations into the model and, thus, enhance its representation capabilities. They also determine the output range of the hidden layers and the final output.  Traditionally, artificial neural networks mainly used the sigmoid activation function as the depth of the network was limited. Nevertheless, this function tends to saturate the gradients when the number of hidden layers increases. For that reason, in the last years, most of the works published related to deep learning and convolutional networks use the Rectified Linear Unit (ReLU), given that it provides good convergence properties and speeds up the training process thanks to the simplicity of its derivative. However, this function has some known drawbacks that gave rise to new proposals of alternatives activation functions based on ReLU. In this work, we describe, analyse and compare different recently proposed alternatives to test whether these functions improve the performance of deep learning models regarding the standard ReLU.",
booktitle = "Proceedings of the XIX Conference of the Spanish Association for Artificial Intelligence (CAEPIA)",
doi = "10.1007/978-3-030-85713-4_4",
isbn = "978-3-030-85712-7",
issn = "0302-9743",
keywords = "analysis activations, RELU, RELU activations, deep learning",
month = "22nd-24th September",
organization = "Malaga, Spain",
pages = "33-43",
publisher = "Springer",
series = " Lecture Notes in Artificial Intelligence (LNAI)",
title = "{R}e{LU}-based activations: analysis and experimental study for deep learning",
url = "doi.org/10.1007/978-3-030-85713-4_4",
volume = "12882",
year = "2021",
}

Abstract:

Activation functions are used in neural networks as a tool to introduce non-linear transformations into the model and, thus, enhance its representation capabilities. They also determine the output range of the hidden layers and the final output. Traditionally, artificial neural networks mainly used the sigmoid activation function as the depth of the network was limited. Nevertheless, this function tends to saturate the gradients when the number of hidden layers increases. For that reason, in the last years, most of the works published related to deep learning and convolutional networks use the Rectified Linear Unit (ReLU), given that it provides good convergence properties and speeds up the training process thanks to the simplicity of its derivative. However, this function has some known drawbacks that gave rise to new proposals of alternatives activation functions based on ReLU. In this work, we describe, analyse and compare different recently proposed alternatives to test whether these functions improve the performance of deep learning models regarding the standard ReLU.

Online version [Bibtex] [RIS] [MODS]

Back