科技报告详细信息
Identification of Threats Using Linguistics-Based Knowledge Extraction.
Chew, P. A.
Technical Information Center Oak Ridge Tennessee
关键词: Linguistics;    Threats;    Hypothesis;    Learning;    Sabotage;   
RP-ID  :  DE2008940522
学科分类:工程和技术(综合)
美国|英语
来源: National Technical Reports Library
PDF
【 摘 要 】

One of the challenges increasingly facing intelligence analysts, along with professionals in many other fields, is the vast amount of data which needs to be reviewed and converted into meaningful information, and ultimately into rational, wise decisions by policy makers. The advent of the world wide web (WWW) has magnified this challenge. A key hypothesis which has guided us is that threats come from ideas (or ideology), and ideas are almost always put into writing before the threats materialize. While in the past the 'writing' might have taken the form of pamphlets or books, today's medium of choice is the WWW, precisely because it is a decentralized, flexible, and low-cost method of reaching a wide audience. However, a factor which complicates matters for the analyst is that material published on the WWW may be in any of a large number of languages. In 'Identification of Threats Using Linguistics-Based Knowledge Extraction', we have sought to use Latent Semantic Analysis (LSA) and other similar text analysis techniques to map documents from the WWW, in whatever language they were originally written, to a common language-independent vector-based representation. This then opens up a number of possibilities.

【 预 览 】
附件列表
Files Size Format View
DE2008940522.pdf 432KB PDF download
  文献评价指标  
  下载次数:13次 浏览次数:32次