601.466/666
Information Retrieval and Web Agents (Spring 2024)
Instructor:
Survey, Piazza, Slides, Final Exam and Help Sessions:
Syllabus:
Textbooks:
- Manning, P. Raghavan and H. Schuetze, Introduction to Information Retrieval, Cambridge
University Press, 2008. PRIMARY TEXT
-
R. Baeza-Yates and B. Ribeiro-Neto, Modern Information Retrieval,
Addison Wesley Longman, 1999.
- C. Wong. Web Client Programming. O'Reilly and Associates, 1997. (This web link should contain the complete text for the book, which is out-of-print)
- W. Frakes and R. Baeza-Yates. Information Retrieval: Data
Structures and Algorithms. Englewood Cliffs, N.J. : Prentice Hall, 1992.
-
F. Cheong, Internet Agents: Spiders, Wanderers, Brokers, and Bots.
Indianapolis, IN : New Riders, 1996.
Assignments:
Resources:
Topics:
- Information Retrieval - Topics include a
comprehensive study of current document retrieval
models, mail/news routing and filtering, document clustering,
automatic indexing, query expansion,
relevance feedback,
user modelling,
and usage pattern analysis.
- Information Extraction and Visualization - This segment
of the course will focus on additional language
processing steps for template filling and information extraction
from retrieved documents, including
sense tagging, semantic template filling and summarization, as well as information visualization.
Emphasis will be placed on recent, primarily statistical methods.
- Web Agents and WWW Applications - The final
segment of the course will explore current issues in
information retrieval and data mining on the World Wide Web.
It will focus on case studies of web agents, spiders, robots
and search engines,
exploring both their practical implementation and the
economic and legal issues surrounding their use.
One of the hot technologies of the 21st century!