Author ORCID Identifier
0000-0002-9829-8049
Document Type
Conference Paper
Disciplines
Statistics
Abstract
This paper investigates the determinants of option bid–ask spreads using machine learning techniques. We analyze a cross-sectional dataset of Apple Inc. (AAPL) call options, focusing on the relative bid–ask spread as the target variable. By comparing linear models with ensemble methods such as Random Forests and XGBoost, we find that nonlinear machine learning methods significantly outperform traditional OLS regression. The most influential factors are moneyness, implied volatility, and time to expiration, while volume and open interest have limited predictive power. Results suggest that spreads are driven by a mix of market microstructure dynamics, capital constraints, and regulatory requirements such as FINRA Rule 4210. Our findings highlight the value of ML in financial microstructure modeling and offer insights relevant to traders, regulators, and liquidity providers.
DOI
https://doi.org/10.21427/wsje-3269
Recommended Citation
Byrne, Brian and Shang, Qianru, "Analyzing Option Chain Bid–Ask Spreads with Machine Learning" (2025). SAML-25 Workshop on Statistical and Machine Learning. 3.
https://arrow.tudublin.ie/saml/3
Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 4.0 International License.
Publication Details
Statistical and Machine Learning: Methods and Applications (SAML-25) on June 5th and 6th, 2025 at TU Dublin, Ireland.
doi:10.21427/wsje-3269