Research Paper
Predicting Motor Carrier Crashes: A Temporal Validation Study
Publication Date: February 2026
Dataset: 259,726 crashes, 6.5M violations, 8.2M inspections across 243,333 motor carriers
Validation Method: Temporal split (Year 1: 2024 → Year 2: 2025)
Abstract
This study investigates whether publicly available FMCSA safety data can reliably predict which motor carriers will experience crashes in subsequent years. Using a temporal validation approach—training on 2024 data and testing on 2025 outcomes—we demonstrate that crash prediction is not only possible but achieves strong discriminatory power (AUC 0.848, Gini 0.696). The top 10% of carriers identified as high-risk account for 58% of the following year's crashes, representing a 5.8x lift over random selection.
Critically, we find that behavioral violations (speeding, reckless driving, substance-related offenses) predict crashes more effectively than equipment violations, and this predictive power varies significantly by fleet size. Small carriers with substance violations show 3.1x relative risk, while large carriers with speeding/reckless violations show 3.6x relative risk. The number of distinct behavioral violation categories is itself a powerful predictor: carriers with three or more behavioral violation types have a 65% probability of crashing the following year.
Key Results
0.848
AUC
Excellent discrimination
5.8x
Lift
Top 10% vs random
58%
Capture Rate
Y2 crashes in top decile
0.696
Gini
Strong separation
1. Introduction
Motor carrier safety is a critical concern for insurers, regulators, and the public. The Federal Motor Carrier Safety Administration (FMCSA) collects extensive data on carrier operations, including crash records, roadside inspections, and violations. A fundamental question for the industry is: can this data predict which carriers will crash in the future?
Previous research has shown correlations between safety metrics and crash history, but correlation is not prediction. A carrier's past crashes may simply reflect its size (more miles = more opportunity for crashes) rather than underlying risk. To be useful for underwriting, loss prevention, or regulatory targeting, a predictive model must demonstrate temporal validity—the ability to identify future crashes, not just explain past ones.
This study addresses this question using a rigorous temporal split methodology: we use only 2024 data to score carriers, then measure how well those scores predict 2025 crash outcomes. This approach ensures we're testing genuine predictive power, not hindsight.
Research Questions
- Can publicly available FMCSA data predict which carriers will crash in subsequent years?
- Which data sources provide the strongest predictive signal?
- Do behavioral violations (driver decisions) predict crashes better than equipment violations (vehicle condition)?
- Does predictive power vary by carrier size?
2. Data and Methods
2.1 Data Sources
We obtained the following datasets from FMCSA's Safety Measurement System (SMS):
| Dataset | Records | Key Fields |
|---|---|---|
| Crashes | 259,726 | DOT number, date, fatalities, injuries |
| Violations | 6,548,807 | DOT number, date, violation type, severity |
| Inspections | 8,215,916 | DOT number, date, OOS findings |
| Census | 4,381,921 | DOT number, mileage, power units |
2.2 Study Population
To ensure statistical reliability, we restricted the analysis to carriers with:
- At least 50,000 annual miles (sufficient exposure for rate calculation)
- At least 1 inspection in 2024 (observable activity)
- Active census registration
This yielded a study population of 243,333 carriers.
2.3 Temporal Split Design
Training Period (Year 1): January 1, 2024 – December 31, 2024
Validation Period (Year 2): January 1, 2025 – December 31, 2025
All predictor variables (crash counts, violation counts, rates) were calculated using only Year 1 data. The outcome variable—whether a carrier crashed in Year 2—was completely withheld during model development.
2.4 Violation Categorization
We classified violations into two categories based on the FMCSA Group_Desc field:
Behavioral Violations
Driver decisions and actions
- Speeding (all categories)
- Reckless/Dangerous Driving
- Drugs & Alcohol
- Distracted Driving (phone/text)
- Seat Belt violations
- HOS Fraud (false logs)
Equipment Violations
Vehicle condition and maintenance
- Brake defects
- Tire defects
- Lighting violations
- Suspension issues
- Coupling devices
- Other vehicle defects
2.5 Evaluation Metrics
- AUC (Area Under ROC Curve): Measures discriminatory power. 0.5 = random, 1.0 = perfect.
- Gini Coefficient: 2 × (AUC - 0.5). Standard insurance industry metric.
- Lift: Ratio of crash rate in top decile vs overall rate.
- Capture Rate: Percentage of Year 2 crashes found in top X% of risky carriers.
- Relative Risk (RR): Crash rate with factor / crash rate without factor.
3. Results
3.1 Overall Predictive Performance
The combined model achieved strong predictive performance on the held-out 2025 data:
| Metric | Value | Interpretation |
|---|---|---|
| AUC | 0.848 | Excellent (>0.80 is considered strong) |
| Gini Coefficient | 0.696 | Strong (insurance benchmark: >0.30) |
| Top 10% Lift | 5.79x | Top decile crashes 5.8x more than average |
| Spearman Correlation | 0.896 | Strong rank correlation |
3.2 Crash Capture by Risk Decile
Sorting carriers by their 2024 risk score, we measured how 2025 crashes distributed across deciles:
| Risk Decile | 2025 Crashes | % of Total | Cumulative % |
|---|---|---|---|
| Top 10% (highest risk) | 59,686 | 57.9% | 57.9% |
| Top 20% | 12,160 | 11.8% | 69.7% |
| Top 30% | 8,337 | 8.1% | 77.8% |
| Top 50% | 6,991 | 6.8% | 86.5% |
| Bottom 50% | 13,860 | 13.5% | 100% |
Key finding: The top 10% of carriers by 2024 risk score account for 58% of 2025 crashes. The top 20% account for 70%. This demonstrates strong, actionable predictive power.
3.3 Predictor Comparison: What Predicts Best?
We compared the predictive power of different data sources using relative risk (RR) and lift metrics:
| 2024 Predictor | Carriers | Relative Risk | % Crashed in 2025 | Lift |
|---|---|---|---|---|
| Had 2024 Crash | 45,088 | 1.77x | 40.7% | 3.57x |
| Tire Defects | 59,176 | 1.46x | 30.3% | 2.43x |
| Reckless/Dangerous Driving | 35,770 | 1.42x | 36.4% | 2.70x |
| Speeding Violations | 45,642 | 1.35x | 35.2% | 2.81x |
| Distracted Driving | 10,549 | 1.35x | 41.6% | 2.65x |
| Seat Belt Violations | 24,040 | 1.34x | 35.4% | 2.39x |
| HOS Fraud (False Logs) | 23,729 | 1.18x | 31.9% | 2.10x |
| Brake Defects | 90,303 | 1.16x | 25.7% | 2.22x |
| Drugs & Alcohol | 4,635 | 1.11x | 36.5% | 2.22x |
| No 2024 Crash (baseline) | 198,245 | 1.00x | 11.4% | 1.00x |
3.4 Size-Stratified Analysis: Hidden Predictive Power
A critical finding is that violation predictive power varies dramatically by fleet size. When we stratify by annual mileage, behavioral violations become much stronger predictors:
| Fleet Size Band | Speeding RR | Reckless RR | Drugs/Alcohol RR |
|---|---|---|---|
| Small (50k-100k miles) | 1.75x | 1.92x | 3.09x |
| Medium (100k-500k miles) | 1.47x | 1.53x | 1.64x |
| Large (500k-2M miles) | 1.44x | 1.39x | 1.43x |
| XLarge (2M+ miles) | 3.56x | 3.59x | 1.58x |
Small Carriers
Drugs/Alcohol: 3.09x RR
A small carrier with substance violations likely has pervasive safety culture problems. This single factor triples crash risk.
XLarge Carriers
Speeding: 3.56x | Reckless: 3.59x
For large fleets, behavioral violations indicate systemic driver management failures. The signal is 3.5x stronger than the overall average.
3.5 Cumulative Effect: Multiple Behavioral Violation Types
Perhaps the most striking finding is the cumulative effect of having multiple types of behavioral violations. The count of distinct violation categories is itself a powerful predictor:
| # Behavioral Violation Types | Carriers | Crash Rate | % Crashed in 2025 | RR vs Zero |
|---|---|---|---|---|
| 0 (no behavioral violations) | 160,320 | 0.024 | 10.8% | 1.00x |
| ≥1 type | 83,013 | 0.038 | 28.5% | 1.54x |
| ≥2 types | 26,547 | 0.039 | 46.5% | 1.59x |
| ≥3 types | 8,291 | 0.041 | 64.9% | 1.67x |
| ≥4 types | 2,414 | 0.039 | 81.5% | 1.61x |
| ≥5 types | 371 | 0.040 | 91.1% | 1.62x |
Critical finding: A carrier with 3+ behavioral violation types in 2024 has a 65% probability of crashing in 2025. With 5+ types, it's 91%. This suggests that counting distinct violation categories—not just total violations—captures something fundamental about organizational safety culture.
4. Discussion
4.1 Answer to Research Questions
Q1: Can FMCSA data predict future crashes?
Yes. With AUC 0.848 and 5.8x lift, prediction is not only possible but achieves performance comparable to mature insurance pricing models. The top 10% of risky carriers capture 58% of next year's crashes.
Q2: Which data sources provide the strongest signal?
Past crashes (rate-adjusted) are the single best predictor (1.77x RR), followed by behavioral violations analyzed by type rather than in aggregate. Raw violation data, when properly categorized, outperforms summary scores.
Q3: Do behavioral violations outperform equipment violations?
Yes, significantly. Reckless driving (1.42x RR) and speeding (1.35x RR) substantially outperform brake defects (1.16x RR) and lighting violations (~1.10x RR). Driver decisions matter more than vehicle condition.
Q4: Does predictive power vary by fleet size?
Dramatically. Behavioral violations that show 1.35x RR overall show 3.5x+ RR within specific size bands. This "hidden" predictive power is revealed only through stratified analysis. Size-specific models would significantly outperform one-size-fits-all approaches.
4.2 Why Behavioral Violations Predict Better
We hypothesize several mechanisms:
- Driver behavior is causal: Speeding, distraction, and impairment directly cause crashes. Equipment defects are often incidental findings during inspections.
- Behavioral violations reflect culture: A carrier with multiple behavioral violation types likely has weak safety management, poor driver selection, or inadequate supervision—systemic factors that persist over time.
- Equipment can be fixed quickly: A brake defect found today may be repaired tomorrow. A driver who speeds today will likely speed tomorrow.
4.3 Practical Applications
Insurance Underwriting
Use behavioral violation types and crash history for risk selection. Consider fleet-size-specific rating factors.
Regulatory Targeting
Prioritize carriers with 3+ behavioral violation types for compliance reviews—65% will crash next year.
Loss Prevention
Focus interventions on speeding and reckless driving. Equipment inspections are less impactful.
4.4 Limitations
- Selection bias: Only carriers with inspections are observable. Carriers avoiding inspections may differ systematically.
- Mileage accuracy: MCS-150 mileage is self-reported and may be unreliable for some carriers.
- Crash reporting: Minor crashes may be underreported, especially for small carriers.
- Two-year window: With only 2024-2025 data, we cannot assess longer-term stability or multiple validation periods.
5. Conclusion
This study demonstrates that motor carrier crashes can be predicted with strong accuracy using publicly available FMCSA data. The key findings are:
- Prediction is possible and practical. AUC 0.848 and 5.8x lift indicate that risk scoring can meaningfully differentiate carriers. The top 10% of risky carriers account for 58% of future crashes.
- Behavioral violations are the key signal. Driver decisions (speeding, reckless driving, distraction) predict crashes far better than equipment conditions. A carrier's behavioral violation profile reveals its safety culture.
- Size stratification unlocks hidden power. Violations that appear weakly predictive overall become 3x+ predictive within fleet size bands. One-size-fits-all models leave significant value on the table.
- Multiple violation types compound risk. The number of distinct behavioral violation categories is itself a powerful predictor. Carriers with 3+ types have 65% crash probability; 5+ types have 91%.
- Past crashes remain the best single predictor. But they are size-biased. Combining crash history with behavioral violations and size stratification yields the strongest models.
Bottom line: Organizations seeking to predict motor carrier crashes should prioritize behavioral violation analysis over aggregate metrics, stratify by fleet size, and count distinct violation categories—not just total violations. The data is publicly available; the predictive power is real.
Data Availability
The underlying data for this study is publicly available from FMCSA's Safety Measurement System (SMS) data downloads. The carrier-level dataset used in this analysis (243,333 carriers with Year 1 predictors and Year 2 outcomes) is available upon request for research purposes.
For methodology details on specific violation categorizations and rate calculations, see our Violation Type Study.