Multi-Sample Dropout: Method that reduces the training time by 4 times

2 min readJun 17, 2019

Multi-Sample Dropout introduced in the paper Multi-Sample Dropout for Accelearted Training and Better Generalization is a new way to expand the traditional Dropout by using multiple dropout masks for the same mini-batch.

The original dropout creates a randomly selected subset (called a dropout sample) from the input in each training iteration while the multi-sample dropout creates multiple dropout samples. The loss is calculated for each sample, and then the losses are averaged to obtain the final loss.

The paper shows that multi-sample dropout significantly accelerates training by reducing the number of iterations until convergence for image classification tasks using the old way of training neural networks i.e. using a constant learning rate and decaying it. So I test this method for cyclic learning and see if I can reproduce the results from the paper.

Note:- If you are not familiar with cyclic learning I wrote a jupyter notebook explaining the 4 key papers that introduced all the techniques by Leslie N. Smith, Reproducing Leslie N. Smith’s papers using fastai.

Multi-Sample Dropout: Method that reduces the training time by 4 times

Table of Contents:

Sign up to discover human stories that deepen your understanding of the world.

Free

Membership

Written by Kushajveer Singh

No responses yet