| International Workshop "Advanced Technologies in Material Science, Mechanical and Automation Engineering – MIP: Engineering – 2019" | |
| Genetic algorithm based sentence packaging in natural language text generation | |
| 材料科学;机械制造;原子能学 | |
| Devyatkin, Dmitry^1 ; Isakov, Vadim^1 ; Shvets, Alexander^1 | |
| Federal Research Center Computer Science and Control, Russian Academy of Sciences, Moscow, Russia^1 | |
| 关键词: Community detection; Community detection algorithms; Fitness functions; High potential; Machine learning models; Natural language text; Semantic structures; Short texts; | |
| Others : https://iopscience.iop.org/article/10.1088/1757-899X/537/4/042003/pdf DOI : 10.1088/1757-899X/537/4/042003 |
|
| 学科分类:材料科学(综合) | |
| 来源: IOP | |
PDF
|
|
【 摘 要 】
Sentence packaging is an important task in natural language text generation which could be treated as a particular kind of a community detection problem. We propose an approach based on genetic algorithm and predictive machine learning models to advance it. The approach allows handling large ontological and semantic structures in a form of a graph to produce well-formed sentences. The results of experiments showed that the genetic algorithm optimizing the modularity measure gives comparable results to ones achieved by a traditional community detection algorithm and outperforms it on a collection of relatively short texts. The design of an approach allows for further introducing linguistic characteristics into a fitness function that gives it a high potential to increase the quality of detected packages while taking into account the specificity of the domain.
【 预 览 】
| Files | Size | Format | View |
|---|---|---|---|
| Genetic algorithm based sentence packaging in natural language text generation | 778KB |
PDF