会议论文详细信息
2018 2nd International Conference on Artificial Intelligence Applications and Technologies
Convolutional Neural Network Based Structured Data Field Granularity Classification Method
计算机科学
Wu, Pin^1 ; Zhou, Quan^1 ; Wei, Qiu^1 ; Lei, Zhidan^1 ; Li, Xiaoqiang^1
School of Computer Engineering and Science, Shanghai University, No. 99 Shangda Road, Baoshan District, Shanghai, China^1
关键词: Automated classification;    Classification framework;    Classification methods;    Classification technology;    Convolutional neural network;    Enterprise decision-making;    Heterogeneous sources;    Medium sized enterprise;   
Others  :  https://iopscience.iop.org/article/10.1088/1757-899X/435/1/012033/pdf
DOI  :  10.1088/1757-899X/435/1/012033
学科分类:计算机科学(综合)
来源: IOP
PDF
【 摘 要 】

The data warehouse provides data support for enterprise decision-making and online analysis. In the process of building a data warehouse, many heterogeneous source system data needs to be integrated. In the integration process, these heterogeneous data needs to be classified and put into different topics. The diversity of the source systems of large and medium-sized enterprises poses difficulties for granularity at the field-level automated classification. However, the accuracy of previous methods cannot satisfy users. Therefore, this paper proposes a neural network-based classification technology to classify the data in the granularity field. This method adopts a sampling method to construct the characteristics of the field and innovates a novel classification framework based on the database field on the basis of the CNN network. Accurately achieve 89% by testing the data in the TPC-DS's dimension tables, and achieve 93% accuracy in real-world data testing. This method was validated in the actual environment of the three banks in China and achieved satisfactory results.

【 预 览 】
附件列表
Files Size Format View
Convolutional Neural Network Based Structured Data Field Granularity Classification Method 361KB PDF download
  文献评价指标  
  下载次数:6次 浏览次数:33次