Frontiers in Energy Research | |
A Power Customer Data Relational Algorithm Based on Magnanimity Fuzzy Address Matching | |
Xiaoyang Bu1  Peng Wu1  Zongwei Wang1  Jing Yang1  Peng Jin2  | |
[1] State Grid Customer Service Center, Tianjin, China;null; | |
关键词: improved simhash; multi-source heterogeneous data; address matching; data associations; digital fingerprint; data of electric client; | |
DOI : 10.3389/fenrg.2021.674865 | |
来源: Frontiers | |
【 摘 要 】
According to the short text and unstructured characteristics of customer address, a data association fusion method for address has been proposed. In this method, the address was mapped to a digital fingerprint by improved Simhash technology, which effectively reduced the dimension of massive addresses and simplified the similarity-matching process of multi-source heterogeneous addresses. Furthermore, the weight setting of the eigenvector of the simhash algorithm was improved by introducing special weight gain. A two-level index mechanism was established by the characteristics of address division and data structure of digital fingerprints; the time-consuming digital fingerprint comparison was greatly reduced. The experimental results showed that calculation efficiency was greatly optimized; accuracy and coverage of the comparison were ensured. Through address matching of different databases, information fusion can be completed and the goal which power customers' demands is connected to power grid equipment is achieved.
【 授权许可】
CC BY
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
RO202107138051354ZK.pdf | 2005KB | download |