Dictionary-Based Document Retrieval Project

Collaborative Research
Intelligent Use of Dictionaries for Document Retrieval


This project is sponsored by research grants from the National Science Foundation .

Principal Investigator at University at Illinois at Chicago

PI: Prof. Clement Yu
Department of Computer Science
University at Illinois at Chicago
NSF grant number: IIS-0738652

Principal Investigator at SUNY at Binghamton

PI: Prof. Weiyi Meng
Department of Computer Science
State University of New York at Binghamton
NSF grant number: IIS-0738727

Project Overview

In this project, a dictionary-based approach for document retrieval will be investigated. Recent work indicates that when the dictionaries are used in a special way, retrieval effectiveness can be improved significantly. It is important to understand the exact reasons behind the apparent success of the new approach so that general principles of utilizing dictionaries automatically can be identified and be widely used. This exploratory research project aims to investigate such principles within a given domain and demonstrate the effectiveness of the identified principles. This project is also to seek evidence that certain dictionary constructs (e.g., the frequency of use of a word in different context) can lead to significant gains in effectiveness if they are added to a dictionary. Longer-term goals of the project include the identification of domain independent principles of using different types of dictionaries in the same system, and the development of tools to add useful dictionary constructs across dictionaries and to assist users in query expansion semi-automatically when there are multiple dictionaries.

Any opinions, findings and conclusions or recomendations expressed on this sites are those of the PIs and do not necessarily reflect the views of the National Science Foundation (NSF).

Related Publications

  1. Shuang Liu, Fang Liu, Clement Yu, and Weiyi Meng. An Effective Approach to Document Retrieval via Utilizing WordNet and Recognizing Phrases. Proceedings of the 27th Annual International ACM SIGIR Conference, pp.266-272, Sheffield, UK, July 2004.
  2. Shuang Liu, Clement Yu, and Weiyi Meng. Word Sense Disambiguation in Queries. ACM International Conference on Information and Knowledge Management (CIKM2005), pp.525-532, Bremen, Germany, November 2005.
  3. Wei Zhou, Clement Yu, Neil Smalheiser, Vetle Torvik and Jie Hong. Knowledge Intensive Conceptual Retrieval and Passage Extraction of Biomedical Literature. ACM SIGIR Conference, pp.655-662, July 2007.
  4. Wei Zhang, Shuang Liu, Clement Yu, Chaojing Sun, Fang Liu, Weiyi Meng. Recognition and Classification of Noun Phrases in Queries for Effective Retrieval . ACM Sixteenth Conference on Information and Knowledge Management (CIKM), pp.711-720, Lisboa, Portugal, November 2007.
  5. Wei Zhang, Clement Yu, Weiyi Meng. Opinion Retrieval from Blogs. ACM Sixteenth Conference on Information and Knowledge Management (CIKM), pp.831-840, Lisboa, Portugal, November 2007.
  6. Wei Zhou, Clement Yu, Weiyi Meng. A System for Finding Biological Entities that Satisfy Certain Conditions from Texts. ACM 17th Conference on Information and Knowledge Management (CIKM 2008), Napa Valley, California, October 2008 (to appear).
  7. Wei Zhang, Lifeng Jia, Clement Yu, Weiyi Meng. Improve the Effectiveness of the Opinion Retrieval and Opinion Polarity Classification. ACM 17th Conference on Information and Knowledge Management (CIKM 2008), poster paper, Napa Valley, California, October 2008 (to appear).

Last change: July 15, 2008 / meng@cs.binghamton.edu