[slides and audio] Biographies%2C Bollywood%2C Boom-boxes and Blenders%3A Domain Adaptation for Sentiment Classification

This paper presents domain adaptation techniques for sentiment classification, focusing on adapting classifiers from one domain to another. The authors extend the structural correspondence learning (SCL) algorithm to sentiment classification, achieving significant improvements in accuracy. They also introduce a measure of domain similarity, the A-distance, which correlates well with the potential for adaptation between domains. The study uses Amazon product reviews for four domains: books, DVDs, electronics, and kitchen appliances. The authors show that by selecting pivot features based on mutual information with the source labels, they can improve the performance of SCL. They also demonstrate how to correct feature misalignments using a small amount of labeled target domain data. The A-distance is used to measure the loss due to adaptation between domains. It is calculated based on the divergence of two domains after the SCL projection. The A-distance correlates well with adaptation loss, indicating that it can be used to select source domains that are likely to adapt well to a target domain. The authors evaluate their methods on a dataset of Amazon product reviews and show that their techniques significantly reduce the error due to adaptation. They also show that using a small amount of labeled target domain data can help correct misalignments in the feature space. The study highlights the importance of domain adaptation in sentiment classification, where the same classifier can be applied to different domains. The authors propose methods to adapt classifiers between domains, which can be useful in applications where annotated data is limited. The results show that their methods outperform traditional supervised baselines and provide a promising approach for domain adaptation in sentiment classification.This paper presents domain adaptation techniques for sentiment classification, focusing on adapting classifiers from one domain to another. The authors extend the structural correspondence learning (SCL) algorithm to sentiment classification, achieving significant improvements in accuracy. They also introduce a measure of domain similarity, the A-distance, which correlates well with the potential for adaptation between domains. The study uses Amazon product reviews for four domains: books, DVDs, electronics, and kitchen appliances. The authors show that by selecting pivot features based on mutual information with the source labels, they can improve the performance of SCL. They also demonstrate how to correct feature misalignments using a small amount of labeled target domain data. The A-distance is used to measure the loss due to adaptation between domains. It is calculated based on the divergence of two domains after the SCL projection. The A-distance correlates well with adaptation loss, indicating that it can be used to select source domains that are likely to adapt well to a target domain. The authors evaluate their methods on a dataset of Amazon product reviews and show that their techniques significantly reduce the error due to adaptation. They also show that using a small amount of labeled target domain data can help correct misalignments in the feature space. The study highlights the importance of domain adaptation in sentiment classification, where the same classifier can be applied to different domains. The authors propose methods to adapt classifiers between domains, which can be useful in applications where annotated data is limited. The results show that their methods outperform traditional supervised baselines and provide a promising approach for domain adaptation in sentiment classification.

Biographies, Bollywood, Boom-boxes and Blenders: Domain Adaptation for Sentiment Classification

June 2007 | John Blitzer, Mark Dredze, Fernando Pereira