Implementation of parameterized soft-exponential activation function.

Last update: Feb 23, 2022

Overview

Soft-Exponential-Activation-Function:

Implementation of parameterized soft-exponential activation function. In this implementation, the parameters are the same for all neurons initially starting with -0.01. This activation function revolves around the idea of a "soft" exponential function. The soft-exponential function is a function that is very similar to the exponential function, but it is not as steep at the beginning and it is more gradual at the end. The soft-exponential function is a good choice for neural networks that have a lot of connections and a lot of neurons.

This activation function is under the idea that the function is logarithmic, linear, exponential and smooth.

The equation for the soft-exponential function is:

$$ f(\alpha,x)= \left{ \begin{array}{ll} -\frac{ln(1-\alpha(x + \alpha))}{\alpha} & \alpha < 0\ x & \alpha = 0 \ \frac{e^{\alpha x} - 1}{\alpha} + \alpha & \alpha > 0 \ \end{array} \right. $$

Problems faced:

1. Misinformation about the function

From a paper by A continuum among logarithmic, linear, and exponential functions, and its potential to improve generalization in neural networks, here in Figure 2, the soft-exponential function is shown as a logarithmic function. This is not the case.

The real figure should be shown here:

Here we can see in some cases the soft-exponential function is undefined for some values of $\alpha$,$x$ and $\alpha$,$x$ is not a constant.

2. Negative values inside logarithm

Here comes the tricky part. The soft-exponential function is defined for all values of $\alpha$ and $x$. However, the logarithm is not defined for negative values.

In the issues under Keras, one of the person has suggested to use the following function $sinh^{-1}()$ instead of the $\ln()$.

3. Initialization of alpha

Starting with an initial value of -0.01, the soft-exponential function was steep at the beginning and it is more gradual at the end. This was a good idea.

Performance:

First picture showing the accuracy of the soft-exponential function.

This shows the loss of the soft-exponential function.

Model Structure:

_________________________________________________________________
 Layer (type)                Output Shape              Param #   
=================================================================
 input_1 (InputLayer)        [(None, 28, 28)]          0         
                                                                 
 flatten (Flatten)           (None, 784)               0         
                                                                 
 dense_layer (Dense_layer)   (None, 128)               100480    
                                                                 
 parametric_soft_exp (Parame  (None, 128)              128       
 tricSoftExp)                                                    
                                                                 
 dense_layer_1 (Dense_layer)  (None, 128)              16512     
                                                                 
 parametric_soft_exp_1 (Para  (None, 128)              128       
 metricSoftExp)                                                  
                                                                 
 dense (Dense)               (None, 10)                1290      
                                                                 
=================================================================
Total params: 118,538
Trainable params: 118,538
Non-trainable params: 0

Implementation of parameterized soft-exponential activation function.

Related tags

Overview

Soft-Exponential-Activation-Function:

Problems faced:

1. Misinformation about the function

2. Negative values inside logarithm

3. Initialization of alpha

Performance:

Acknowledgements:

Owner

Shuvrajeet Das

Randstad Artificial Intelligence Challenge (powered by VGEN). Soluzione proposta da Stefano Fiorucci (anakin87) - primo classificato

A Deep learning based streamlit web app which can tell with which bollywood celebrity your face resembles.

PyTorch implementation of NeurIPS 2021 paper: "CoFiNet: Reliable Coarse-to-fine Correspondences for Robust Point Cloud Registration"

【steal piano】GitHub偷情分析工具！

1st Solution For NeurIPS 2021 Competition on ML4CO Dual Task

Vision-and-Language Navigation in Continuous Environments using Habitat

Frequency Spectrum Augmentation Consistency for Domain Adaptive Object Detection

An open source app to help calm you down when needed.

Learning Neural Network Subspaces

This is an official implementation for "DeciWatch: A Simple Baseline for 10x Efficient 2D and 3D Pose Estimation"

Structure-Preserving Deraining with Residue Channel Prior Guidance (ICCV2021)

Deep learning-based approach to discovering Granger causality networks in multivariate time series

MNIST, but with Bezier curves instead of pixels

Project repo for the paper SILT: Self-supervised Lighting Transfer Using Implicit Image Decomposition

NLMpy - A Python package to create neutral landscape models

we propose a novel deep network, named feature aggregation and refinement network (FARNet), for the automatic detection of anatomical landmarks.

LogAvgExp - Pytorch Implementation of LogAvgExp

Python script that takes an Impulse response .wav and a input .wav to demonstrate audio convolution.

Trans-Encoder: Unsupervised sentence-pair modelling through self- and mutual-distillations

Code and models for "Rethinking Deep Image Prior for Denoising" (ICCV 2021)