
《现代信息检索(英文版·第2版)》详细介绍了信息检索的所有主要概念和技术,以及有关信息检索方面的所有新变化,使读者既可以对现代信息检索有一个全面的了解,又可以获取现代信息检索所有关键主题的详细知识。《现代信息融合技术在组合导航中的应用》的主要内容由信息检索领域的代表人物Baeza-Yares和]Ribeiro-Neto编著,对于那些希望深入研究关键领域的读者,书中还提供了由其他主要研究人员编写的关于特殊主题的发展现状。与上一版相比,本版在内容和结构上都有大量调整、更新和充实,其中新增内容在60%到70%左右。具体更新情况如下新增了文本分类、网络信息爬取、结构化文本检索和企业搜索等章节,以及关于开源搜索的一个附录。·全面改写了用户界面、多媒体检索和数字图书馆等内容。拓展了一些章节,介绍了信息检索方面的新的重要进展,如语言模型、新的评价方法、查询的特点、基于聚类和分布式信息检索等。
编辑推荐
《现代信息检索(英文版第2版)》:经典原版书库 作者简介
作者:(西班牙)贝泽-耶茨(Ricardo Baeza-Yates) (巴西)里贝罗-内特(Berthier Ribeiro-Neto)
Ricardo Baeza-Yates,于加拿大滑铁卢大学获得计算机科学博士学位,现为雅虎欧洲和拉丁美洲研究院副总裁,主管雅虎在巴塞罗纳(西班牙)和圣地亚哥(智利)(的研究中心,并监管海法研究中心。他曾担任智利计算机科学学会主席、智利大学计算机科学系Web研究中心主任、ICREA教授,并且他还在巴塞罗纳法布拉大学创立了信息与通信技术系Web研究组。现在他仍是智利大学和法布拉大学的兼职教授。他的主要研究方向为算法与数据结构、信息检索、用户界面以及可视化在数据库中的应用等。
Berthier Ribeiro-Neto,于加利福尼亚大学洛杉矶分校获得计算机科学博士学位,现任巴西Mitqas Gerais联合大学计算机科学系副教授,同时也是ACM、ASIS及IEEE会员。他的主要研究方向是信息检索系统、数字图书馆、Web界面及视频点播。 目录
Preface to the Second Edition
Preface to the First Edition
Authors' Acknowledgements to the Second Edition
Authors' Acknowledgements to the First Edition
Publishers' Acknowledgements
1Introduction
1.1Information Retrieval
1.1.1Early Developments
1.1.2Information Retrieval in Libraries and Digital Libraries
1.1.3IR at the Center of the Stage
1.2The IR Problem
1.2.1The User's Task
1.2.2Information versus Data Retrieval
1.3The IR System
1.3.1Software Architecture of the IR System
1.3.2The Retrieval and Ranking Processes
1.4The Web
1.4.1A Brief History
1.4.2The e-Publishing Era
1.4.3How the Web Changed Search
1.4.4Practical Issues on the Web
1.5Organization of the Book
1.5.1Focus of the Book
1.5.2Book Contents
1.6The Book Web Site: A Teaching Resource
1.7Bibliographic Discussion
User Interfaces for Search
by Marti Hearst
2.1Introduction
2.2How People Search
2.2.1Information Lookup versus Exploratory Search
2.2.2Classic versus Dynamic Model of Information Seeking .
2.2.3Navigation versus Search
2.2.4Observations cf the Search Process
2.3Search Interfaces Today
2.3.1Getting Started
2.3.2Query Specification
2.3.3Query Specification Interfaces
2.3.4 Retrieval Results Display
2.3.5Query Reformulation
2.3.6Organizing Search Results
2.4Visualization in Search Interfaces
2.4.1Visualizing Bcolesn Syntax
2.4.2Visualizing Query Terms within Retrieval Results
2.4.3Visualizing Relationships Among Words and Documents
2.4.4Visualization for Text Mining
2.5Design and Evaluation of Search Interfaces
2.6Trends and Research Issues
2.7Bibliographic Discussion
Modeling
3.1IR Models
3.1.1Modeling and Rankirg
3.1.2Characterization cf an IR Model
3.1.3A Taxonomy of IR Models
3.2Classic Information Retrieval
3.2.1Basic Concepts
3.2.2The Boolean Model
3.2.3Term Weighting
3.2ATF-IDF Weights
3.2.5Document Length Normalization
3.2.6The Vector Model
3.2.7The Probabilistic Mcdel
3.2.8Brief Comparison of Classic Models
3.3Alternative Set Theoretic Models
3.3.1Set-Based Model
3.3.2Extended Boolean Model
3.3.3Fuzzy Set Model
3.4Alternative Algebraic Models
3.4.1Generalized Vector Space Model
3.4.2Latent Semantic Indexing Moo'el
3.4.3Neural Netwozk Model
3.5Alternative Probabilistic Mcdels
3.5.1BM25
3.5.2Language Models
3.5.3Divergence from Randomness
3.5.4Bayesian Network Models
3.6Other Models
……
4Retrieval Evaluation
5Relevance Feedback and Query Expansion
6Documents:Languages Properties
7Queries:Languages Properties
8Text Classiftcation
9Indexiong and Searching
10Parallel and Distributed IR
11Web Retrieval
12Web Crawling
13Structured Text Retrieval
14Multimedia Information retrieval
15Enterprise Search
16Library Systems
17Digital Libraries 文摘
版权页:
插图:
Libraries were among the first institutions to adopt IR systems for retrieving information. Usually, library systems were initially developed by academic institutions and later by commercial vendors. In the first generation, such systems consisted of anautomation of existing processes such as card catalogs searching, restricted to authornames and titles. In the second generation, increased search functionality was added to include subject headings, keywords, and query operators. In the third generation,which is currently being deployed, the focus has been on improved graphical in terfaces,electronic f
rms, hypertext features, and open system architectures.
Traditional library management system vendors include Endeavor InformationSystems Inc., Innovative Interfaces Inc., and EOS International. Among systems developed with a research focus, we distinguish MELVYL developed by the California Digital Library at University of California, and the Cheshire system developed originally at UC Berkeley and lately in cooperation with the University of Liverpool.Further details on these library systems can be found in Chapter 16.1.1.3IR at the Center of the Stage Despite its maturity, until recently, IR was seen as a narrow area of interest restrictedmainly to librarians and infrmation experts.Such a tendentious vision prevailed for many years, despite the rapid dissemination, among users of modern personalcomputers, of IR tools for multimedia and hypertext applications. In the beginning of the 1990s, a single fact changed once and for all these perceptionsthe in troductionof the World Wide Web.
The Web, invented in 1989 by Tim Berners-Lee, has become a universal repository of human knowedge and culture. Its success is based on the conception of a standarduser interface which is always the same, no matter the computational environmentused to run the interface, and which allows any user to create their own documents.As a result, millions of users have created billions of documents that compose the largest human repository of knowledge in history. An immediate consequence is that finding useful information on the Web is not always a simple task and usually requiresposing a query to a search engine, i.e., running a search. And search is all aboutIR and its technologies. Thus, a hnost overnight, IR has gained a place with other technologies at the center of the stage.
ISBN | 9787111331742 |
---|---|
出版社 | 机械工业出版社 |
作者 | 贝泽-耶茨(Ricardo Baeza-Yates) |
尺寸 | 32 |