Introduction to Information Retrieval

Introduction to Information Retrieval
Author :
Publisher : Cambridge University Press
Total Pages :
Release :
ISBN-10 : 9781139472104
ISBN-13 : 1139472100
Rating : 4/5 (04 Downloads)

Synopsis Introduction to Information Retrieval by : Christopher D. Manning

Class-tested and coherent, this textbook teaches classical and web information retrieval, including web search and the related areas of text classification and text clustering from basic concepts. It gives an up-to-date treatment of all aspects of the design and implementation of systems for gathering, indexing, and searching documents; methods for evaluating systems; and an introduction to the use of machine learning methods on text collections. All the important ideas are explained using examples and figures, making it perfect for introductory courses in information retrieval for advanced undergraduates and graduate students in computer science. Based on feedback from extensive classroom experience, the book has been carefully structured in order to make teaching more natural and effective. Slides and additional exercises (with solutions for lecturers) are also available through the book's supporting website to help course instructors prepare their lectures.

Document Retrieval Index

Document Retrieval Index
Author :
Publisher :
Total Pages : 640
Release :
ISBN-10 : UOM:39015055037413
ISBN-13 :
Rating : 4/5 (13 Downloads)

Synopsis Document Retrieval Index by :

Information Retrieval: Uncertainty and Logics

Information Retrieval: Uncertainty and Logics
Author :
Publisher : Springer Science & Business Media
Total Pages : 362
Release :
ISBN-10 : 0792383028
ISBN-13 : 9780792383024
Rating : 4/5 (28 Downloads)

Synopsis Information Retrieval: Uncertainty and Logics by : Fabio Crestani

A collection of papers proposing, developing, and implementing logical IR models. After an introductory chapter on non-classical logic as the appropriate formalism with which to build IR models, papers are divided into groups on three approaches: logical models, uncertainty models, and meta-models. Topics include preferential models of query by navigation, a logic for multimedia information retrieval, logical imaging and probabilistic information retrieval, and an axiomatic aboutness theory for information retrieval. Can be used as a text for a graduate course on information retrieval or database systems, and as a reference for researchers and practitioners in industry. Annotation copyrighted by Book News, Inc., Portland, OR

Indexing and Retrieval of Non-Text Information

Indexing and Retrieval of Non-Text Information
Author :
Publisher : Walter de Gruyter
Total Pages : 440
Release :
ISBN-10 : 9783110260588
ISBN-13 : 3110260581
Rating : 4/5 (88 Downloads)

Synopsis Indexing and Retrieval of Non-Text Information by : Diane Rasmussen Neal

The scope of this volume will encompass a collection of research papers related to indexing and retrieval of online non-text information. In recent years, the Internet has seen an exponential increase in the number of documents placed online that are not in textual format. These documents appear in a variety of contexts, such as user-generated content sharing websites, social networking websites etc. and formats, including photographs, videos, recorded music, data visualizations etc. The prevalence of these contexts and data formats presents a particularly challenging task to information indexing and retrieval research due to many difficulties, such as assigning suitable semantic metadata, processing and extracting non-textual content automatically, and designing retrieval systems that "speak in the native language" of non-text documents.

Information Retrieval

Information Retrieval
Author :
Publisher : MIT Press
Total Pages : 633
Release :
ISBN-10 : 9780262528870
ISBN-13 : 0262528878
Rating : 4/5 (70 Downloads)

Synopsis Information Retrieval by : Stefan Buttcher

An introduction to information retrieval, the foundation for modern search engines, that emphasizes implementation and experimentation. Information retrieval is the foundation for modern search engines. This textbook offers an introduction to the core topics underlying modern search technologies, including algorithms, data structures, indexing, retrieval, and evaluation. The emphasis is on implementation and experimentation; each chapter includes exercises and suggestions for student projects. Wumpus—a multiuser open-source information retrieval system developed by one of the authors and available online—provides model implementations and a basis for student work. The modular structure of the book allows instructors to use it in a variety of graduate-level courses, including courses taught from a database systems perspective, traditional information retrieval courses with a focus on IR theory, and courses covering the basics of Web retrieval. In addition to its classroom use, Information Retrieval will be a valuable reference for professionals in computer science, computer engineering, and software engineering.

Clustering and Information Retrieval

Clustering and Information Retrieval
Author :
Publisher : Springer Science & Business Media
Total Pages : 331
Release :
ISBN-10 : 9781461302278
ISBN-13 : 1461302277
Rating : 4/5 (78 Downloads)

Synopsis Clustering and Information Retrieval by : Weili Wu

Clustering is an important technique for discovering relatively dense sub-regions or sub-spaces of a multi-dimension data distribution. Clus tering has been used in information retrieval for many different purposes, such as query expansion, document grouping, document indexing, and visualization of search results. In this book, we address issues of cluster ing algorithms, evaluation methodologies, applications, and architectures for information retrieval. The first two chapters discuss clustering algorithms. The chapter from Baeza-Yates et al. describes a clustering method for a general metric space which is a common model of data relevant to information retrieval. The chapter by Guha, Rastogi, and Shim presents a survey as well as detailed discussion of two clustering algorithms: CURE and ROCK for numeric data and categorical data respectively. Evaluation methodologies are addressed in the next two chapters. Ertoz et al. demonstrate the use of text retrieval benchmarks, such as TRECS, to evaluate clustering algorithms. He et al. provide objective measures of clustering quality in their chapter. Applications of clustering methods to information retrieval is ad dressed in the next four chapters. Chu et al. and Noel et al. explore feature selection using word stems, phrases, and link associations for document clustering and indexing. Wen et al. and Sung et al. discuss applications of clustering to user queries and data cleansing. Finally, we consider the problem of designing architectures for infor mation retrieval. Crichton, Hughes, and Kelly elaborate on the devel opment of a scientific data system architecture for information retrieval.

Learning to Rank for Information Retrieval

Learning to Rank for Information Retrieval
Author :
Publisher : Springer Science & Business Media
Total Pages : 282
Release :
ISBN-10 : 9783642142673
ISBN-13 : 3642142672
Rating : 4/5 (73 Downloads)

Synopsis Learning to Rank for Information Retrieval by : Tie-Yan Liu

Due to the fast growth of the Web and the difficulties in finding desired information, efficient and effective information retrieval systems have become more important than ever, and the search engine has become an essential tool for many people. The ranker, a central component in every search engine, is responsible for the matching between processed queries and indexed documents. Because of its central role, great attention has been paid to the research and development of ranking technologies. In addition, ranking is also pivotal for many other information retrieval applications, such as collaborative filtering, definition ranking, question answering, multimedia retrieval, text summarization, and online advertisement. Leveraging machine learning technologies in the ranking process has led to innovative and more effective ranking models, and eventually to a completely new research area called “learning to rank”. Liu first gives a comprehensive review of the major approaches to learning to rank. For each approach he presents the basic framework, with example algorithms, and he discusses its advantages and disadvantages. He continues with some recent advances in learning to rank that cannot be simply categorized into the three major approaches – these include relational ranking, query-dependent ranking, transfer ranking, and semisupervised ranking. His presentation is completed by several examples that apply these technologies to solve real information retrieval problems, and by theoretical discussions on guarantees for ranking performance. This book is written for researchers and graduate students in both information retrieval and machine learning. They will find here the only comprehensive description of the state of the art in a field that has driven the recent advances in search engine development.

Readings in Information Retrieval

Readings in Information Retrieval
Author :
Publisher : Morgan Kaufmann
Total Pages : 614
Release :
ISBN-10 : 1558604545
ISBN-13 : 9781558604544
Rating : 4/5 (45 Downloads)

Synopsis Readings in Information Retrieval by : Karen Sparck Jones

This compilation of original papers on information retrieval presents an overview, covering both general theory and specific methods, of the development and current status of information retrieval systems. Each chapter contains several papers carefully chosen to represent substantive research work that has been carried out in that area, each is preceded by an introductory overview and followed by supported references for further reading.

Language Modeling for Information Retrieval

Language Modeling for Information Retrieval
Author :
Publisher : Springer Science & Business Media
Total Pages : 253
Release :
ISBN-10 : 9789401701716
ISBN-13 : 9401701717
Rating : 4/5 (16 Downloads)

Synopsis Language Modeling for Information Retrieval by : W. Bruce Croft

A statisticallanguage model, or more simply a language model, is a prob abilistic mechanism for generating text. Such adefinition is general enough to include an endless variety of schemes. However, a distinction should be made between generative models, which can in principle be used to synthesize artificial text, and discriminative techniques to classify text into predefined cat egories. The first statisticallanguage modeler was Claude Shannon. In exploring the application of his newly founded theory of information to human language, Shannon considered language as a statistical source, and measured how weH simple n-gram models predicted or, equivalently, compressed natural text. To do this, he estimated the entropy of English through experiments with human subjects, and also estimated the cross-entropy of the n-gram models on natural 1 text. The ability of language models to be quantitatively evaluated in tbis way is one of their important virtues. Of course, estimating the true entropy of language is an elusive goal, aiming at many moving targets, since language is so varied and evolves so quickly. Yet fifty years after Shannon's study, language models remain, by all measures, far from the Shannon entropy liInit in terms of their predictive power. However, tbis has not kept them from being useful for a variety of text processing tasks, and moreover can be viewed as encouragement that there is still great room for improvement in statisticallanguage modeling.

XML Retrieval

XML Retrieval
Author :
Publisher : Morgan & Claypool Publishers
Total Pages : 112
Release :
ISBN-10 : 9781598297867
ISBN-13 : 1598297864
Rating : 4/5 (67 Downloads)

Synopsis XML Retrieval by : Mounia Lalmas

Documents usually have a content and a structure. The content refers to the text of the document, whereas the structure refers to how a document is logically organized. An increasingly common way to encode the structure is through the use of a mark-up language. Nowadays, the most widely used mark-up language for representing structure is the eXtensible Mark-up Language (XML). XML can be used to provide a focused access to documents, i.e. returning XML elements, such as sections and paragraphs, instead of whole documents in response to a query. Such focused strategies are of particular benefit for information repositories containing long documents, or documents covering a wide variety of topics, where users are directed to the most relevant content within a document. The increased adoption of XML to represent a document structure requires the development of tools to effectively access documents marked-up in XML. This book provides a detailed description of query languages, indexing strategies, ranking algorithms, presentation scenarios developed to access XML documents. Major advances in XML retrival were seen from 2002 as a result of INEX, the Initiative for Evaluation of XML Retrieval. INEX, also described in this book, provided test sets for evaluating XML retrieval effectiveness. Many of the developments and results described in this book were investigated within INEX. Table of Contents: Introduction / Basic XML Concepts / Historical Perspectives / Query Languages / Indexing Strategies / Ranking Strategies / Presentation strategies / Evaluating XML Retrieval Effectiveness / Conclusions