Enhancing Financial Market Predictions: Causality-Driven Feature Selection

Wenhao Liang, Zhengyang Li, Weitong Chen

Published: 01 Jan 2024, Last Modified: 06 Feb 2025ADMA (1) 2024EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: This paper introduces FinSen dataset that revolutionizes financial market analysis by integrating economic and financial news articles from 197 countries with stock market data. The dataset’s extensive coverage spans 15 years from 2007 to 2023 with temporal information, offering a rich, global perspective 160,000 records on financial market news. Our study leverages causally validated sentiment scores and LSTM models to enhance market forecast accuracy and reliability. Utilizing the FinSen dataset, we introduce an innovative Focal Calibration Loss, reducing Expected Calibration Error (ECE) to 3.34% with the DAN 3 model. This is not only improves prediction accuracy but also aligns probabilistic forecasts closely with real outcomes, crucial for financial sector where predicted probability is paramount. Our approach demonstrates the effectiveness of combining sentiment analysis with precise calibration techniques for trustworthy financial forecasting where the cost of misinterpretation can be high.
Loading

OpenReview is a long-term project to advance science through improved peer review with legal nonprofit status. We gratefully acknowledge the support of the OpenReview Sponsors. © 2025 OpenReview