40324 Modern Information Retrieval

Course Description

Information retrieval is the process through which a computer system can respond to a user's query for text-based information on a specific topic. Information retrieval was one of the first and remains one of the most important problems in the domain of natural language processing. Web search is the application of information retrieval techniques to the largest corpus of text anywhere and it is the area in which most people interact with information retrieval systems most frequently. In this course, we will cover basic and advanced techniques for building text-based information systems, including efficient text indexing, Boolean and vector-space retrieval models, evaluation and interface, issues, information retrieval techniques for the web, including crawling, link-based algorithms, and metadata usage, document clustering and classification, traditional and machine learning-based ranking approaches, questiona and answering systems, and recommender systems.

Course Information

Required Texts

  1. [MRS] Christopher D. Manning and Prabhakar Raghavan, and Hinrich Schutze, Introduction to Information Retrieval, Cambridge University Press, 2008.

  2. [HNG] Hang Li, Learning to Rank for Information Retrieval and Natural Language Processing, Morgan & Claypool, 2011.

  3. [MC] Bhaskar Mitra and Nick Craswell, An Introduction to Neural Information Retrieval, Foundations and Trends in Information Retrieval, Vol. 13, No. 1, pp. 1-126, 2018.

Grading Policy

  1. 30%: Mid-term exam (1401/09/14).

  2. 30%: Final exam (1401/10/27).

  3. 30%: Homeworks.

  4. 10%: Quiz.

Lecture Schedule


Lecture Lecture Date Topics Related Readings and Links Homeworks & Assignments Quizes
1 1401-07-02Introduction Chapter 1 of MRS
2
2
3
1401-07-04
1401-07-23
1401-07-25
Boolean information retrieval
and document preprocessing
Chapters 1 & 2 of MRS
4 1401-08-30 Dictionaries and
tolerant retrieval
Chapter 3 of MRS
5 1401-08-02 Index Construction Chapter 4 of MRS
6 1401-08-07 Index compression Chapter 5 of MRS
7 1401-08-09 Vector space modelChapter 6 of MRS
8 1401-08-14 Scores in a complete search system Chapter 7 of MRS
9 1401-09-16 Evaluation in information retrieval Chapter 8 of MRS
101401-08-21Relevance feedback and query expansion Chapter 9 of MRS
111401-08-23Probabilistic Information Retrieval Chapter 11 of MRS
121401-08-28Language Models for Information RetrievalChapter 12 of MRS
13
14
15
16
1401-08-30
1401-09-05
1401-09-07
Probabilistic text classification
Vector space text classification
Chapters 13-15 of MRS
171401-09-12 Text clustering Chapters 16 & 17 of MRS
18 1401-09-14 Mid-term exam
19
20
1401-09-19
1401-09-21
Text clustering Chapters 16 & 17 of MRS
201401-09-26Dimensionality reduction and feature selection Chapter 13 of MRS
211401-09-28 Learning to rank Chapters 1-3 of HNG
221401-10-03 Latent Semantic Indexing Chapter 18 of MRS
23
24
1401-10-05
1401-10-10
Web cralwing and searchChapters 19 & 20 of MRS
251401-10-12 Link Analysis Chapter 21 of MRS
26
27
1401-10-17
1401-10-19
Neural information retrieval
Some other related topics
MC
1401-10-27 Final exam At 9-12