Author ORCID Identifier

0000-0002-9829-8049

Document Type

Conference Paper

Disciplines

Statistics

Publication Details

Statistical and Machine Learning: Methods and Applications (SAML-25) on June 5th and 6th, 2025 at TU Dublin, Ireland.

doi:10.21427/wsje-3269

Abstract

This paper investigates the determinants of option bid–ask spreads using machine learning techniques. We analyze a cross-sectional dataset of Apple Inc. (AAPL) call options, focusing on the relative bid–ask spread as the target variable. By comparing linear models with ensemble methods such as Random Forests and XGBoost, we find that nonlinear machine learning methods significantly outperform traditional OLS regression. The most influential factors are moneyness, implied volatility, and time to expiration, while volume and open interest have limited predictive power. Results suggest that spreads are driven by a mix of market microstructure dynamics, capital constraints, and regulatory requirements such as FINRA Rule 4210. Our findings highlight the value of ML in financial microstructure modeling and offer insights relevant to traders, regulators, and liquidity providers.

DOI

https://doi.org/10.21427/wsje-3269

Creative Commons License

Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License
This work is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 4.0 International License.


Share

COinS