Document Type
Article
Disciplines
2.2 ELECTRICAL, ELECTRONIC, INFORMATION ENGINEERING, Electrical and electronic engineering
Abstract
In recent years, various deep neural networks with different learning paradigms have been widely employed in various applications, including medical diagnosis, image analysis, self-driving vehicles and others. The activation functions employed in deep neural networks have a huge impact on the training model and the reliability of the model. The Rectified Linear Unit (ReLU) has recently emerged as the most popular and extensively utilized activation function. ReLU has some flaws, such as the fact that it is only active when the units are positive during back-propagation and zero otherwise. This causes neurons to die (dying ReLU) and a shift in bias. However, unlike ReLU activation functions, Swish activation functions do not remain stable or move in a single direction. This research proposes a new activation function named NIPUNA for deep neural networks. We test this activation by training on customized convolutional neural networks (CCNN). On benchmark datasets (Fashion MNIST images of clothes, MNIST dataset of handwritten digits), the contributions are examined and compared to various activation functions. The proposed activation function can outperform traditional activation functions.
DOI
https://doi.org/10.3390/ axioms12030246
Recommended Citation
Madhu, Golla; Kautish, Sandeep; Abdulaziz Alnowibet, Khalid; Zawbaa, Hossam; and Wagdy Mohamed, Ali, "NIPUNA: A Novel Optimizer Activation Function for Deep Neural Networks" (2023). Articles. 367.
https://arrow.tudublin.ie/engscheleart2/367
Funder
The research is funded by Researchers Supporting Program at King Saud University, (RSP2023R305).
Creative Commons License
This work is licensed under a Creative Commons Attribution-Share Alike 4.0 International License.
Publication Details
https://www.mdpi.com/2075-1680/12/3/246
Madhu, G.; Kautish, S.; Alnowibet, K.A.; Zawbaa, H.M.; Mohamed, A.W. NIPUNA: A Novel Optimizer Activation Function for Deep Neural Networks. Axioms, 2023, 12, 246.
https://doi.org/10.3390/ axioms12030246