Web Mining Research: A Survey

Web Mining Research: A Survey

July 2000 | Raymond Kosala, Hendrik Blockeel
The paper "Web Mining Research: A Survey" by Raymond Kosala provides an overview of the field of Web mining, which involves using data mining techniques to extract information from web documents and services. The authors address the confusion surrounding the term "Web mining" and propose three main categories of Web mining: Web content mining, Web structure mining, and Web usage mining. They discuss the relationship between Web mining and related fields such as information retrieval (IR), natural language processing (NLP), and machine learning. The paper also explores the connection between Web mining and the agent paradigm, where software agents are used to perform data mining tasks. The authors survey various research methods and applications in each category, highlighting the challenges and opportunities in Web mining, particularly in handling multimedia data and complex web structures. The paper concludes by discussing future research directions, including information integration and topic detection and tracking (TDT).The paper "Web Mining Research: A Survey" by Raymond Kosala provides an overview of the field of Web mining, which involves using data mining techniques to extract information from web documents and services. The authors address the confusion surrounding the term "Web mining" and propose three main categories of Web mining: Web content mining, Web structure mining, and Web usage mining. They discuss the relationship between Web mining and related fields such as information retrieval (IR), natural language processing (NLP), and machine learning. The paper also explores the connection between Web mining and the agent paradigm, where software agents are used to perform data mining tasks. The authors survey various research methods and applications in each category, highlighting the challenges and opportunities in Web mining, particularly in handling multimedia data and complex web structures. The paper concludes by discussing future research directions, including information integration and topic detection and tracking (TDT).
Reach us at info@study.space