Queries and Concept Learning

Queries and Concept Learning

1988 | DANA ANGLUIN
This paper explores the use of queries in concept learning, focusing on various types of queries such as membership, equivalence, subset, superset, disjointness, and exhaustiveness. It discusses efficient learning methods for formal domains like regular languages, context-free languages, pattern languages, and propositional formulas. The paper also compares equivalence queries with Valiant's criterion of probably approximately correct (PAC) identification under random sampling. The paper introduces a formal framework for studying the power of different query types in concept learning. It presents lower bound techniques and discusses the necessity of different queries for exact identification. For example, it shows that exhaustive search may require up to 2^n - 1 queries in the worst case, while a logarithmic strategy using majority vote can reduce the number of queries. The paper also examines the relationship between equivalence queries and stochastic equivalence testing, and how PAC-identification can be achieved using sampling oracles. It provides examples of learning algorithms for specific domains, such as k-CNF and k-DNF formulas, and discusses the efficiency of these methods. The paper further explores the use of membership queries in learning systems and their role in identifying monotone DNF formulas. It also discusses subset and superset queries, and how they can be used to identify pattern languages. Disjointness queries are analyzed in the context of debugging Prolog programs and identifying k-CNF formulas. Finally, the paper presents a lower bound for all six query types, demonstrating that N - 1 queries are necessary for certain domains, even with the full set of query types. This highlights the importance of query types in the efficiency and correctness of learning algorithms.This paper explores the use of queries in concept learning, focusing on various types of queries such as membership, equivalence, subset, superset, disjointness, and exhaustiveness. It discusses efficient learning methods for formal domains like regular languages, context-free languages, pattern languages, and propositional formulas. The paper also compares equivalence queries with Valiant's criterion of probably approximately correct (PAC) identification under random sampling. The paper introduces a formal framework for studying the power of different query types in concept learning. It presents lower bound techniques and discusses the necessity of different queries for exact identification. For example, it shows that exhaustive search may require up to 2^n - 1 queries in the worst case, while a logarithmic strategy using majority vote can reduce the number of queries. The paper also examines the relationship between equivalence queries and stochastic equivalence testing, and how PAC-identification can be achieved using sampling oracles. It provides examples of learning algorithms for specific domains, such as k-CNF and k-DNF formulas, and discusses the efficiency of these methods. The paper further explores the use of membership queries in learning systems and their role in identifying monotone DNF formulas. It also discusses subset and superset queries, and how they can be used to identify pattern languages. Disjointness queries are analyzed in the context of debugging Prolog programs and identifying k-CNF formulas. Finally, the paper presents a lower bound for all six query types, demonstrating that N - 1 queries are necessary for certain domains, even with the full set of query types. This highlights the importance of query types in the efficiency and correctness of learning algorithms.
Reach us at info@study.space