[slides and audio] Supervised Learning of Universal Sentence Representations from Natural Language Inference Data

This paper explores the effectiveness of supervised learning for training universal sentence representations using the Stanford Natural Language Inference (SNLI) dataset. The authors compare various sentence encoder architectures, including recurrent neural networks (RNNs) and convolutional neural networks (CNNs), and find that a bidirectional LSTM (BiLSTM) with max pooling outperforms other methods on a wide range of transfer tasks. They demonstrate that sentence embeddings trained on SNLI consistently outperform unsupervised methods like SkipThought vectors, suggesting that natural language inference tasks are suitable for transfer learning in NLP. The paper also discusses the impact of different training methods and optimization algorithms on the performance of the sentence encoders. The authors conclude that their BiLSTM-max model, trained on SNLI, performs better than SkipThought vectors on multiple tasks and is faster to train. The paper provides a comprehensive evaluation of the proposed approach on various transfer tasks, including binary and multi-class classification, entailment, semantic relatedness, and image caption retrieval.This paper explores the effectiveness of supervised learning for training universal sentence representations using the Stanford Natural Language Inference (SNLI) dataset. The authors compare various sentence encoder architectures, including recurrent neural networks (RNNs) and convolutional neural networks (CNNs), and find that a bidirectional LSTM (BiLSTM) with max pooling outperforms other methods on a wide range of transfer tasks. They demonstrate that sentence embeddings trained on SNLI consistently outperform unsupervised methods like SkipThought vectors, suggesting that natural language inference tasks are suitable for transfer learning in NLP. The paper also discusses the impact of different training methods and optimization algorithms on the performance of the sentence encoders. The authors conclude that their BiLSTM-max model, trained on SNLI, performs better than SkipThought vectors on multiple tasks and is faster to train. The paper provides a comprehensive evaluation of the proposed approach on various transfer tasks, including binary and multi-class classification, entailment, semantic relatedness, and image caption retrieval.

Supervised Learning of Universal Sentence Representations from Natural Language Inference Data

8 Jul 2018 | Alexis Conneau, Douwe Kiela, Holger Schwenk, Loïc Barrault, Antoine Bordes