期刊论文详细信息
ETRI Journal
Dependency Structure Applied to LanguageModeling for Information Retrieval
关键词: dependency structure;    information retrieval;    term dependency;    Language model;   
Others  :  1185411
DOI  :  10.4218/etrij.06.0105.0020
PDF
【 摘 要 】

In this paper, we propose a new language model, namely, a dependency structure language model, for information retrieval to compensate for the weaknesses of unigram and bigram language models. The dependency structure language model is based on the first-order dependency model and the dependency parse tree generated by a linguistic parser. So, long-distance dependencies can be naturally captured by the dependency structure language model. We carried out extensive experiments to verify the proposed model, where the dependency structure model gives a better performance than recently proposed language models and the Okapi BM25 method, and the dependency structure is more effective than unigram and bigram in language modeling for information retrieval.

【 授权许可】

   

【 预 览 】
附件列表
Files Size Format View
20150520110956964.pdf 1KB PDF download
【 参考文献 】
  • [1]D. Miller, T. Leek, and R. M. Schwartz, "A Hidden Markov Model Information Retrieval System," Proc. 22nd Annual Int’l ACM SIGIR Conf. on Research and Development in Information Retrieval, 1999, pp. 214?222.
  • [2]J. M. Ponte and W. B. Croft, "A Language Modeling Approach to Information Retrieval," Proc. 21st Annual Int’l ACM SIGIR Conf. on Research and Development in Information Retrieval, 1998, pp. 275?281.
  • [3]F. Song and W. B. Croft, "A General Language Model for Information Retrieval (Poster Abstract)," Proc. of the 22nd Annual Int’l ACM SIGIR Conf. on Research and Development in Information Retrieval, 1999, pp. 279?280.
  • [4]C. Zhai and J. Lafferty, "A Study of Smoothing Methods for Language Models Applied to Ad Hoc Information Retrieval," Research and Development in Information Retrieval, 2001, pp. 334?342.
  • [5]E. Charniak, "Immediate-Head Parsing for Language Models," Proc. Thirty-Ninth Annual Meeting of the Association for Computational Linguistics and Seventeenth Int’l Conf. on Computational Linguistics, 2001, pp. 116?123.
  • [6]C. Chelba and F. Jelinek, "Exploiting Syntactic Structure for Language Modeling," Proc. Thirty-Sixth Annual Meeting of the Association for Computational Linguistics and Seventeenth Int’l Conf. Computational Linguistics, San Francisco, California, 1998,
  • [7]B. Roark, "Probabilistic Top-Down Parsing and Language Modeling," Computational Linguistics, vol. 27, no. 2, June 2001, 249?276.
  • [8]C. V. Rijsbergen, Information retrieval, Butterworths, 1979.
  • [9]J. Gao, J.-Y. Nie, G. Wu, and G. Cao, "Dependence Language Model for Information Retrieval," Proc. 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2004, pp. 170?177.
  • [10]R. Duda and P. Hart, Pattern Classification and Scene Analysis, A Wiley-Interscience Publication, 1973.
  • [11]C. Chow and C. Liu, "Approximating Discrete Probability Distributions with Dependence Trees," IEEE Transactions on Information Theory, vol. IT-14, May 1968, 462?467.
  • [12]D. Hays, "Dependency Theory: Formalism and Some Observations," Language, vol. 40, no. 4, 1964, 511?525.
  • [13]D. Lin, "Principa - An Efficient, Broad-Coverage, Principle Based Parser," Proc. Fifteenth International Conference on Computational Linguistics, COLING-ACL, 1994, pp. 109?126.
  • [14]MINIPAR: 1998. http://www.cs.ualberta.ca/_lindek/minipar.htm.
  • [15]S. Robertson, S. Walker, S. Jones, M. M. Hancock-Beaulieu, and M. Gatford, "Okapi at TREC-3," Proc. Second Text Retrieval Conf. (TREC-3), 1995.
  文献评价指标  
  下载次数:9次 浏览次数:22次