LDRD 99-ERI-010 Final Report: Sapphire: Scalable Pattern Recognition for Large-Scale Scientific Data Mining | |
Kamath, C | |
Lawrence Livermore National Laboratory | |
关键词: Pattern Recognition; 99 General And Miscellaneous//Mathematics, Computing, And Information Science; Classification; Computers; Sapphire; | |
DOI : 10.2172/15003138 RP-ID : UCRL-ID-146978 RP-ID : W-7405-ENG-48 RP-ID : 15003138 |
|
美国|英语 | |
来源: UNT Digital Library | |
【 摘 要 】
There is a rapidly widening gap between our ability to collect data and our ability to explore, analyze, and understand the data. As a result, useful information is overlooked, and the potential benefits of increased computational and data gathering capabilities only partially realized. This problem of data overload is becoming a serious impediment to scientific advancement in areas as diverse as counter-proliferation, the Accelerated Strategic Computing Initiative (ASCI), astrophysics, computer security, and climate modeling, where vast amounts of data are collected through observations or simulations. To improve the way in which scientists extract useful information from their data, we are developing a new generation of tools and techniques based on data mining. Data mining is the semi-automated discovery of patterns, associations, anomalies, and statistically significant structures in data. It consists of two steps--in data pre-processing, we extract high-level features from the data, and in pattern recognition, we use the features to identify and characterize patterns in the data. In this project, our focus is on developing scalable algorithms for the pattern recognition task of classification. Our goal is to improve the performance of these algorithms, without sacrificing accuracy. We are demonstrating these techniques using an astronomy application, namely the detection of radio-emitting galaxies with a bent-double morphology in the FIRST survey. Our research has been incorporated into software to make it easily accessible to LLNL scientists. The author describes their accomplishments in each of these three areas.
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
15003138.pdf | 1116KB | download |