EN

论文

当前位置: 首页 > 科学研究 > 科研成果 > 论文 > 正文

Aggregating multiple types of complex data in stock market prediction: A model-independent framework

来源: | 发布时间:2021-03-05| 点击:

作者:Wang, HW (Wang, Huiwen)[ 1,2 ] ; Lu, S (Lu, Shan)[ 1,3 ] ; Zhao, JC (Zhao, Jichang)[ 1,2 ]

KNOWLEDGE-BASED SYSTEMS

卷: 164

页: 193-204

DOI: 10.1016/j.knosys.2018.10.035

出版年: JAN 15 2019

文献类型:Article

摘要

The increasing richness in the volume and types of data in the financial domain provides unprecedented opportunities for understanding the stock market more comprehensively and makes price predictions more accurate than before. However, this situation also brings challenges to classic statistical approaches since these models might be constrained to a certain type of data. Aiming to aggregate information from different sources and to offer type-free capability to existing models, a framework for predicting the stock market in scenarios with mixed data, including scalar data, compositional data (pie-like) and functional data (curve-like), is established. The presented framework is model-independent because it serves as an interface to multiple types of data and can be combined with various prediction models. Moreover, the framework is proven to be effective through numerical simulations. For price prediction, we incorporate the trading volume (scalar data), intraday return series (functional data), and investors' emotions from social media (compositional data) through the framework to competently forecast the market trend at opening on the next day. The strong explanatory power of the framework is further demonstrated. Specifically, the intraday returns are found to impact the following opening prices differently between a bearish market and a bullish market. Additionally, it is not at the beginning of the bearish market but rather the subsequent period in which the investors' "fear" becomes indicative. This framework would help to easily extend existing prediction models to scenarios with multiple types of data and to provide a more systemic understanding of the stock market. (C) 2018 Elsevier B.V. All rights reserved.

关键词

作者关键词:Stock market; Machine learning; Sentiment analysis; Complex data; Heterogeneous data; Data aggregation

KeyWords Plus:REFINEMENT DOMAIN ADAPTATION; FUZZY TIME-SERIES; COMPOSITIONAL ANALYSIS; LINEAR-MODEL; TRANSFORMATIONS; REGRESSION; SELECTION; PATTERNS; IMPACT; NEWS

通讯作者地址:

Beihang University Beihang Univ, Sch Econ & Management, Beijing, Peoples R China.

通讯作者地址: Zhao, JC (通讯作者)