[slides and audio] Analyzing the effectiveness and applicability of co-training

This paper explores the effectiveness and applicability of co-training algorithms in supervised learning, particularly in text classification tasks. Co-training algorithms leverage a natural division of features into two disjoint sets, which can significantly improve classification accuracy when combined with labeled and unlabeled data. The authors compare co-training algorithms with other methods, such as Expectation-Maximization (EM), and demonstrate that co-training algorithms outperform EM when a natural feature split exists. They also show that co-training algorithms can perform well even on datasets without a natural feature split by using random splits. The paper discusses the robustness of co-training algorithms to violations of their assumptions and suggests improvements to enhance their discriminative power. Experimental results on real-world datasets, such as the WebKB Course and News 2x2, support these findings. The authors conclude by highlighting the potential of co-training algorithms in handling datasets with feature divisions and plan future work to further refine and extend these methods.This paper explores the effectiveness and applicability of co-training algorithms in supervised learning, particularly in text classification tasks. Co-training algorithms leverage a natural division of features into two disjoint sets, which can significantly improve classification accuracy when combined with labeled and unlabeled data. The authors compare co-training algorithms with other methods, such as Expectation-Maximization (EM), and demonstrate that co-training algorithms outperform EM when a natural feature split exists. They also show that co-training algorithms can perform well even on datasets without a natural feature split by using random splits. The paper discusses the robustness of co-training algorithms to violations of their assumptions and suggests improvements to enhance their discriminative power. Experimental results on real-world datasets, such as the WebKB Course and News 2x2, support these findings. The authors conclude by highlighting the potential of co-training algorithms in handling datasets with feature divisions and plan future work to further refine and extend these methods.

Analyzing the Effectiveness and Applicability of Co-training

2000 | Kamal Nigam, Rayid Ghani