Semantic Web and Information Extraction | |
Ontologies as a Source for the AutomaticGeneration of Grammars for InformationExtraction Systems | |
Thierry Declerck ; Paul Buitelaar | |
Others : http://ceur-ws.org/Vol-925/paper_3.pdf PID : 27559 |
|
来源: CEUR | |
【 摘 要 】
Grammars for Natural Language Processing (NLP) applications are generally built either by linguists – on the basis of their language competence, or by automated tools applied to existing large corpora oflanguage data — using either supervised or unsupervised methods (ora combination of both). Domain knowledge usually played just a little role in this process. The increasing availability of extended knowledge representation systems, like taxonomies and ontologies, is giving the opportunity to consider new approaches to the (automated) generation of processing grammars, especially in the field of domain-oriented Information Extraction (IE). The reason for this being that most of the taxonomies and ontologies are equipped with natural language expressions included in ontology elements like labels, comments or definitions. These de facto established relations between (domain) knowledge and natural language expressions can be exploited for the automatic generation ofdomain specific NLP and IE grammars. We describe in this paper steps leading to this automation.
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
Ontologies as a Source for the AutomaticGeneration of Grammars for InformationExtraction Systems | 172KB | download |