[slides] Ladder Variational Autoencoders

The paper introduces the Ladder Variational Autoencoder (LVAE), a novel inference model designed to improve the performance of Variational Autoencoders (VAEs) in unsupervised learning. VAEs are powerful models but struggle with deep hierarchies of stochastic variables, limiting their effectiveness. The LVAE addresses this by recursively correcting the generative distribution using a data-dependent approximate likelihood, similar to the Ladder Network. This approach allows the model to share information between the inference and generative models, enhancing the optimization process. The authors demonstrate that the LVAE achieves state-of-the-art predictive log-likelihood and tighter log-likelihood lower bounds compared to traditional VAEs. They also show that the LVAE learns a deeper and more distributed hierarchy of latent variables, providing a more structured and informative representation of the data. Additionally, the paper highlights the importance of batch normalization and a deterministic warm-up period for training deep stochastic models. Experiments on datasets like MNIST, OMNIGLOT, and NORB confirm the LVAE's superior performance, especially in handling complex datasets with multiple layers of latent variables. The LVAE outperforms other advanced methods such as Normalizing Flows and Variational Gaussian Processes, achieving higher log-likelihood scores and tighter lower bounds. The paper concludes by discussing the benefits of the LVAE's recursive inference model and its potential applications in semi-supervised learning and more sophisticated inference schemes.The paper introduces the Ladder Variational Autoencoder (LVAE), a novel inference model designed to improve the performance of Variational Autoencoders (VAEs) in unsupervised learning. VAEs are powerful models but struggle with deep hierarchies of stochastic variables, limiting their effectiveness. The LVAE addresses this by recursively correcting the generative distribution using a data-dependent approximate likelihood, similar to the Ladder Network. This approach allows the model to share information between the inference and generative models, enhancing the optimization process. The authors demonstrate that the LVAE achieves state-of-the-art predictive log-likelihood and tighter log-likelihood lower bounds compared to traditional VAEs. They also show that the LVAE learns a deeper and more distributed hierarchy of latent variables, providing a more structured and informative representation of the data. Additionally, the paper highlights the importance of batch normalization and a deterministic warm-up period for training deep stochastic models. Experiments on datasets like MNIST, OMNIGLOT, and NORB confirm the LVAE's superior performance, especially in handling complex datasets with multiple layers of latent variables. The LVAE outperforms other advanced methods such as Normalizing Flows and Variational Gaussian Processes, achieving higher log-likelihood scores and tighter lower bounds. The paper concludes by discussing the benefits of the LVAE's recursive inference model and its potential applications in semi-supervised learning and more sophisticated inference schemes.

Ladder Variational Autoencoders

27 May 2016 | Casper Kaae Sønderby, Tapani Raiko, Lars Maaløe, Søren Kaae Sønderby, Ole Winther