Week1 :Unit 1: Introduction and Course Overview

·      FOA section 1.1
By reading the FOA, I learn that the book is a closer look at the process of finding out about, reserch activities that allow a decision-maker to draw on others' knowledge. And the FOA process of browsing readers can be imagined to involve three phases : 1) asking a question 2) constructing an answer 3) assessing the answer. What's more,  give the schematic of search engine.
·      IES section 1.1 and 1.2
I learned that information retriveval is concerned with representingm searching, and manipulating large collections of electronic text and other human-language data.In detail, I learn about the web search, desktop and file system search and how others IR applications works associated with the storage, manipulation, and retrieval of human-language data.Then, I learn the basic IR system architecture, the components of an IR system. What's more, the update and modify of the documents. And the two principal aspects to measuring IR system performance: efficiency and effectiveness. And the Effectiveness is more difficult to measure thatn efficiency. 
·      MIR section 1.1 - 1.4
By reading the Modern Information Retrieval, firstly, I have a glance at the development of the IR using in the Libraries and Digital Libraries. Then, I learn the IR problem: the primary goal of an IR system is to retrieve all the documents that are relevant to a suer query while retrieving as few nonrelevant document as possible. That is to say, the notion of relevance is of central importance in IR. As metioned on user, their ability is to translate their information need into a query in the language provided by the system, which is to say searching and browsing. What's more, in chapter 1.3, introducing the processes of retrieve and ranking of the documents in response to a user query. In setting up an IR system, we need to assemble the document collection, construct a crawler module. As in the retrieval and ranking processes, when given the documents of the collection, we first apply text operations to them such as eliminating stopwords, stemming, and selecting a subset of all terms for use as indexing terms. Then, the retrieved documents are ranked according to a likelihood of relevance to the user. Then, I learn the creation of web, tge advant of the e-publishing age the web changed search and the practical issues, such as security and copyright.


评论