Exploratory Analysis of Crash Determinants
This study presents an exploratory analysis of the key factors contributing to fatal and severe crashes on German motorways. We employ Poisson and Negative Binomial regression models, combined with Lasso regularization and stability selection, to explore model specifications incorporating potentially many interaction terms and polynomials. Utilizing an extensive data set including rich geo-spatial characteristics for 500-meter segments covering large parts of the German motorway network, key variables influencing crash frequency are uncovered. To obtain correct standard errors post variable selection, we split the data into separate samples for model selection and parameter estimation. Our results indicate that the inclusion of a limited number of higher-order terms significantly improves the regression formulation. Robustness checks confirm the stability of these findings. The results offer clearer insights into the key crash determinants and are more computationally feasible than simulation-based methods commonly used in accident research.