Financial Innovation | |
Novel modelling strategies for high-frequency stock trading data | |
Research | |
Ke Xu1  Li Xing2  Xuekui Zhang3  Yuying Huang4  | |
[1] Economics Department at University of Victoria, Victoria, Canada;Mathematics and Statistics Department at University of Saskatchewan, Saskatchewan, Canada;Mathematics and Statistics Department at University of Victoria, Victoria, Canada;Mathematics and Statistics Department at University of Victoria, Victoria, Canada;Statistics and Actuarial Science at University of Waterloo, Waterloo, Canada; | |
关键词: High-frequency trading; Machine learning; Mid-price prediction strategy; Raw data processing; Multi-class prediction; Ensemble learning; | |
DOI : 10.1186/s40854-022-00431-9 | |
received in 2022-05-19, accepted in 2022-11-24, 发布年份 2022 | |
来源: Springer | |
【 摘 要 】
Full electronic automation in stock exchanges has recently become popular, generating high-frequency intraday data and motivating the development of near real-time price forecasting methods. Machine learning algorithms are widely applied to mid-price stock predictions. Processing raw data as inputs for prediction models (e.g., data thinning and feature engineering) can primarily affect the performance of the prediction methods. However, researchers rarely discuss this topic. This motivated us to propose three novel modelling strategies for processing raw data. We illustrate how our novel modelling strategies improve forecasting performance by analyzing high-frequency data of the Dow Jones 30 component stocks. In these experiments, our strategies often lead to statistically significant improvement in predictions. The three strategies improve the F1 scores of the SVM models by 0.056, 0.087, and 0.016, respectively.
【 授权许可】
CC BY
© The Author(s) 2023
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
RO202305110277819ZK.pdf | 1338KB | download | |
40798_2022_490_Article_IEq42.gif | 1KB | Image | download |
40798_2022_490_Article_IEq44.gif | 1KB | Image | download |
40798_2022_490_Article_IEq46.gif | 1KB | Image | download |
Fig. 1 | 1178KB | Image | download |
40798_2022_490_Article_IEq53.gif | 1KB | Image | download |
40798_2022_490_Article_IEq54.gif | 1KB | Image | download |
40798_2022_490_Article_IEq55.gif | 1KB | Image | download |
40798_2022_490_Article_IEq56.gif | 1KB | Image | download |
40798_2022_490_Article_IEq57.gif | 1KB | Image | download |
40798_2022_490_Article_IEq58.gif | 1KB | Image | download |
Fig. 2 | 1160KB | Image | download |
Fig. 4 | 819KB | Image | download |
Fig. 4 | 1778KB | Image | download |
42004_2022_800_Article_IEq32.gif | 1KB | Image | download |
Fig. 5 | 6732KB | Image | download |
Fig. 1 | 245KB | Image | download |
Fig. 1 | 1194KB | Image | download |
Fig. 5 | 1117KB | Image | download |
Fig. 1 | 144KB | Image | download |
Fig. 3 | 934KB | Image | download |
MediaObjects/12974_2023_2701_MOESM2_ESM.tif | 10755KB | Other | download |
42004_2022_800_Article_IEq39.gif | 1KB | Image | download |
Fig. 3 | 179KB | Image | download |
Fig. 59 | 1107KB | Image | download |
Fig. 4 | 188KB | Image | download |
40798_2022_490_Article_IEq60.gif | 1KB | Image | download |
Fig. 4 | 1926KB | Image | download |
Fig. 3 | 1769KB | Image | download |
13690_2022_1010_Article_IEq4.gif | 1KB | Image | download |
Fig. 1 | 686KB | Image | download |
42004_2022_800_Article_IEq73.gif | 1KB | Image | download |
Fig. 2 | 543KB | Image | download |
42004_2022_800_Article_IEq75.gif | 1KB | Image | download |
Fig. 64 | 403KB | Image | download |
MediaObjects/12888_2023_4533_MOESM1_ESM.docx | 14KB | Other | download |
Fig. 2 | 1343KB | Image | download |
Fig. 65 | 1234KB | Image | download |
Fig. 5 | 1042KB | Image | download |
42004_2022_800_Article_IEq82.gif | 1KB | Image | download |
42004_2022_800_Article_IEq83.gif | 1KB | Image | download |
42004_2022_800_Article_IEq84.gif | 1KB | Image | download |
Fig. 4 | 278KB | Image | download |
42004_2022_800_Article_IEq86.gif | 1KB | Image | download |
Fig. 1 | 138KB | Image | download |
Fig. 6 | 2218KB | Image | download |
Fig. 6 | 803KB | Image | download |
Fig. 66 | 453KB | Image | download |
42004_2022_803_Article_IEq36.gif | 1KB | Image | download |
42004_2022_800_Article_IEq92.gif | 1KB | Image | download |
MediaObjects/41408_2023_783_MOESM1_ESM.pdf | 183KB | download | |
Fig. 4 | 804KB | Image | download |
MediaObjects/41408_2023_783_MOESM2_ESM.pdf | 39917KB | download | |
40249_2022_1049_Article_IEq55.gif | 1KB | Image | download |
MediaObjects/12888_2022_4504_MOESM1_ESM.docx | 31KB | Other | download |
Fig. 2 | 2112KB | Image | download |
MediaObjects/13046_2023_2611_MOESM6_ESM.pdf | 60KB | download | |
Fig. 1 | 105KB | Image | download |
MediaObjects/13046_2023_2611_MOESM7_ESM.pdf | 83KB | download | |
Fig. 6 | 425KB | Image | download |
Fig. 2 | 105KB | Image | download |
41116_2022_35_Article_IEq1.gif | 1KB | Image | download |
Fig. 1 | 1478KB | Image | download |
Fig. 1 | 140KB | Image | download |
Fig. 4 | 1209KB | Image | download |
Fig. 2 | 197KB | Image | download |
Fig. 2 | 1971KB | Image | download |
42004_2022_800_Article_IEq108.gif | 1KB | Image | download |
Fig. 3 | 718KB | Image | download |
Fig. 1 | 268KB | Image | download |
Fig. 3 | 254KB | Image | download |
Fig. 7 | 1847KB | Image | download |
Fig. 4 | 886KB | Image | download |
41116_2022_35_Article_IEq2.gif | 1KB | Image | download |
41116_2022_35_Article_IEq3.gif | 1KB | Image | download |
41116_2022_35_Article_IEq4.gif | 1KB | Image | download |
Fig. 2 | 952KB | Image | download |
41116_2022_35_Article_IEq8.gif | 1KB | Image | download |
41116_2022_35_Article_IEq10.gif | 1KB | Image | download |
41116_2022_35_Article_IEq12.gif | 1KB | Image | download |
41116_2022_35_Article_IEq13.gif | 1KB | Image | download |
41116_2022_35_Article_IEq14.gif | 1KB | Image | download |
41116_2022_35_Article_IEq15.gif | 1KB | Image | download |
Fig. 1 | 233KB | Image | download |
41116_2022_35_Article_IEq17.gif | 1KB | Image | download |
41116_2022_35_Article_IEq18.gif | 1KB | Image | download |
Fig. 2 | 220KB | Image | download |
41116_2022_35_Article_IEq20.gif | 1KB | Image | download |
41116_2022_35_Article_IEq22.gif | 1KB | Image | download |
41116_2022_35_Article_IEq23.gif | 1KB | Image | download |
41116_2022_35_Article_IEq24.gif | 1KB | Image | download |
41116_2022_35_Article_IEq25.gif | 1KB | Image | download |
Fig. 6 | 784KB | Image | download |
41116_2022_35_Article_IEq27.gif | 1KB | Image | download |
MediaObjects/12888_2023_4529_MOESM1_ESM.jpg | 1350KB | Other | download |
MediaObjects/12974_2023_2697_MOESM1_ESM.tif | 955KB | Other | download |
12936_2022_4438_Article_IEq9.gif | 1KB | Image | download |
42004_2022_803_Article_IEq55.gif | 1KB | Image | download |
Fig. 1 | 499KB | Image | download |
Fig. 7 | 239KB | Image | download |
Fig. 11 | 95KB | Image | download |
Fig. 1 | 332KB | Image | download |
MediaObjects/12888_2022_4501_MOESM1_ESM.docx | 13KB | Other | download |
Fig. 2 | 73KB | Image | download |
40249_2022_1049_Article_IEq20.gif | 1KB | Image | download |
Fig. 3 | 119KB | Image | download |
40249_2022_1049_Article_IEq56.gif | 1KB | Image | download |
【 图 表 】
40249_2022_1049_Article_IEq56.gif
Fig. 3
40249_2022_1049_Article_IEq20.gif
Fig. 2
Fig. 1
Fig. 11
Fig. 7
Fig. 1
42004_2022_803_Article_IEq55.gif
12936_2022_4438_Article_IEq9.gif
41116_2022_35_Article_IEq27.gif
Fig. 6
41116_2022_35_Article_IEq25.gif
41116_2022_35_Article_IEq24.gif
41116_2022_35_Article_IEq23.gif
41116_2022_35_Article_IEq22.gif
41116_2022_35_Article_IEq20.gif
Fig. 2
41116_2022_35_Article_IEq18.gif
41116_2022_35_Article_IEq17.gif
Fig. 1
41116_2022_35_Article_IEq15.gif
41116_2022_35_Article_IEq14.gif
41116_2022_35_Article_IEq13.gif
41116_2022_35_Article_IEq12.gif
41116_2022_35_Article_IEq10.gif
41116_2022_35_Article_IEq8.gif
Fig. 2
41116_2022_35_Article_IEq4.gif
41116_2022_35_Article_IEq3.gif
41116_2022_35_Article_IEq2.gif
Fig. 4
Fig. 7
Fig. 3
Fig. 1
Fig. 3
42004_2022_800_Article_IEq108.gif
Fig. 2
Fig. 2
Fig. 4
Fig. 1
Fig. 1
41116_2022_35_Article_IEq1.gif
Fig. 2
Fig. 6
Fig. 1
Fig. 2
40249_2022_1049_Article_IEq55.gif
Fig. 4
42004_2022_800_Article_IEq92.gif
42004_2022_803_Article_IEq36.gif
Fig. 66
Fig. 6
Fig. 6
Fig. 1
42004_2022_800_Article_IEq86.gif
Fig. 4
42004_2022_800_Article_IEq84.gif
42004_2022_800_Article_IEq83.gif
42004_2022_800_Article_IEq82.gif
Fig. 5
Fig. 65
Fig. 2
Fig. 64
42004_2022_800_Article_IEq75.gif
Fig. 2
42004_2022_800_Article_IEq73.gif
Fig. 1
13690_2022_1010_Article_IEq4.gif
Fig. 3
Fig. 4
40798_2022_490_Article_IEq60.gif
Fig. 4
Fig. 59
Fig. 3
42004_2022_800_Article_IEq39.gif
Fig. 3
Fig. 1
Fig. 5
Fig. 1
Fig. 1
Fig. 5
42004_2022_800_Article_IEq32.gif
Fig. 4
Fig. 4
Fig. 2
40798_2022_490_Article_IEq58.gif
40798_2022_490_Article_IEq57.gif
40798_2022_490_Article_IEq56.gif
40798_2022_490_Article_IEq55.gif
40798_2022_490_Article_IEq54.gif
40798_2022_490_Article_IEq53.gif
Fig. 1
40798_2022_490_Article_IEq46.gif
40798_2022_490_Article_IEq44.gif
40798_2022_490_Article_IEq42.gif
【 参考文献 】
- [1]
- [2]
- [3]
- [4]
- [5]
- [6]
- [7]
- [8]
- [9]
- [10]
- [11]
- [12]
- [13]
- [14]
- [15]
- [16]
- [17]
- [18]
- [19]
- [20]
- [21]
- [22]
- [23]
- [24]
- [25]
- [26]
- [27]
- [28]
- [29]
- [30]
- [31]
- [32]