In the TREC 2008, the team from the State University of New York at Buffalo participated in the Legal track and the Blog track. For the Legal track, we worked on the interactive search task using the Webbased Legacy Tobacco Document Library Boolean search system. Our experiment achieved reasonable precision but suffered significantly from low recall. These results, together with the appealing and adjudica tion results, suggest that the concept of document relevance in legal ediscovery deserve further investigation. For the Blog distillation task, our official runs were based on a re duced document model in which only text from several most contentbearing fields were indexed. This approach indeed yielded encouraging retrieval effectiveness while signifi cantly decreasing the index size. We also studied query independence/dependence and linkbased features for finding relevant feeds. For the Blog opinion and polarity tasks, we mainly investigated the usefulness of opinionated words contained in the SentiGI lexicon. Our experiment results showed that the effectiveness of the technique is quite
【 预 览 】
附件列表
Files
Size
Format
View
TREC 2008 at the University at Buffalo: Legal and Blog Track