Fleetidy Logo

Research Paper

Predicting Motor Carrier Crashes: A Temporal Validation Study

Publication Date: February 2026

Dataset: 259,726 crashes, 6.5M violations, 8.2M inspections across 243,333 motor carriers

Validation Method: Temporal split (Year 1: 2024 → Year 2: 2025)

Abstract

This study investigates whether publicly available FMCSA safety data can reliably predict which motor carriers will experience crashes in subsequent years. Using a temporal validation approach—training on 2024 data and testing on 2025 outcomes—we demonstrate that crash prediction is not only possible but achieves strong discriminatory power (AUC 0.848, Gini 0.696). The top 10% of carriers identified as high-risk account for 58% of the following year's crashes, representing a 5.8x lift over random selection.

Critically, we find that behavioral violations (speeding, reckless driving, substance-related offenses) predict crashes more effectively than equipment violations, and this predictive power varies significantly by fleet size. Small carriers with substance violations show 3.1x relative risk, while large carriers with speeding/reckless violations show 3.6x relative risk. The number of distinct behavioral violation categories is itself a powerful predictor: carriers with three or more behavioral violation types have a 65% probability of crashing the following year.

Key Results

0.848

AUC

Excellent discrimination

5.8x

Lift

Top 10% vs random

58%

Capture Rate

Y2 crashes in top decile

0.696

Gini

Strong separation

1. Introduction

Motor carrier safety is a critical concern for insurers, regulators, and the public. The Federal Motor Carrier Safety Administration (FMCSA) collects extensive data on carrier operations, including crash records, roadside inspections, and violations. A fundamental question for the industry is: can this data predict which carriers will crash in the future?

Previous research has shown correlations between safety metrics and crash history, but correlation is not prediction. A carrier's past crashes may simply reflect its size (more miles = more opportunity for crashes) rather than underlying risk. To be useful for underwriting, loss prevention, or regulatory targeting, a predictive model must demonstrate temporal validity—the ability to identify future crashes, not just explain past ones.

This study addresses this question using a rigorous temporal split methodology: we use only 2024 data to score carriers, then measure how well those scores predict 2025 crash outcomes. This approach ensures we're testing genuine predictive power, not hindsight.

Research Questions

  1. Can publicly available FMCSA data predict which carriers will crash in subsequent years?
  2. Which data sources provide the strongest predictive signal?
  3. Do behavioral violations (driver decisions) predict crashes better than equipment violations (vehicle condition)?
  4. Does predictive power vary by carrier size?

2. Data and Methods

2.1 Data Sources

We obtained the following datasets from FMCSA's Safety Measurement System (SMS):

Dataset Records Key Fields
Crashes 259,726 DOT number, date, fatalities, injuries
Violations 6,548,807 DOT number, date, violation type, severity
Inspections 8,215,916 DOT number, date, OOS findings
Census 4,381,921 DOT number, mileage, power units

2.2 Study Population

To ensure statistical reliability, we restricted the analysis to carriers with:

This yielded a study population of 243,333 carriers.

2.3 Temporal Split Design

Training Period (Year 1): January 1, 2024 – December 31, 2024
Validation Period (Year 2): January 1, 2025 – December 31, 2025

All predictor variables (crash counts, violation counts, rates) were calculated using only Year 1 data. The outcome variable—whether a carrier crashed in Year 2—was completely withheld during model development.

2.4 Violation Categorization

We classified violations into two categories based on the FMCSA Group_Desc field:

Behavioral Violations

Driver decisions and actions

  • Speeding (all categories)
  • Reckless/Dangerous Driving
  • Drugs & Alcohol
  • Distracted Driving (phone/text)
  • Seat Belt violations
  • HOS Fraud (false logs)

Equipment Violations

Vehicle condition and maintenance

  • Brake defects
  • Tire defects
  • Lighting violations
  • Suspension issues
  • Coupling devices
  • Other vehicle defects

2.5 Evaluation Metrics

3. Results

3.1 Overall Predictive Performance

The combined model achieved strong predictive performance on the held-out 2025 data:

Metric Value Interpretation
AUC 0.848 Excellent (>0.80 is considered strong)
Gini Coefficient 0.696 Strong (insurance benchmark: >0.30)
Top 10% Lift 5.79x Top decile crashes 5.8x more than average
Spearman Correlation 0.896 Strong rank correlation

3.2 Crash Capture by Risk Decile

Sorting carriers by their 2024 risk score, we measured how 2025 crashes distributed across deciles:

Risk Decile 2025 Crashes % of Total Cumulative %
Top 10% (highest risk) 59,686 57.9% 57.9%
Top 20% 12,160 11.8% 69.7%
Top 30% 8,337 8.1% 77.8%
Top 50% 6,991 6.8% 86.5%
Bottom 50% 13,860 13.5% 100%

Key finding: The top 10% of carriers by 2024 risk score account for 58% of 2025 crashes. The top 20% account for 70%. This demonstrates strong, actionable predictive power.

3.3 Predictor Comparison: What Predicts Best?

We compared the predictive power of different data sources using relative risk (RR) and lift metrics:

2024 Predictor Carriers Relative Risk % Crashed in 2025 Lift
Had 2024 Crash 45,088 1.77x 40.7% 3.57x
Tire Defects 59,176 1.46x 30.3% 2.43x
Reckless/Dangerous Driving 35,770 1.42x 36.4% 2.70x
Speeding Violations 45,642 1.35x 35.2% 2.81x
Distracted Driving 10,549 1.35x 41.6% 2.65x
Seat Belt Violations 24,040 1.34x 35.4% 2.39x
HOS Fraud (False Logs) 23,729 1.18x 31.9% 2.10x
Brake Defects 90,303 1.16x 25.7% 2.22x
Drugs & Alcohol 4,635 1.11x 36.5% 2.22x
No 2024 Crash (baseline) 198,245 1.00x 11.4% 1.00x

3.4 Size-Stratified Analysis: Hidden Predictive Power

A critical finding is that violation predictive power varies dramatically by fleet size. When we stratify by annual mileage, behavioral violations become much stronger predictors:

Fleet Size Band Speeding RR Reckless RR Drugs/Alcohol RR
Small (50k-100k miles) 1.75x 1.92x 3.09x
Medium (100k-500k miles) 1.47x 1.53x 1.64x
Large (500k-2M miles) 1.44x 1.39x 1.43x
XLarge (2M+ miles) 3.56x 3.59x 1.58x

Small Carriers

Drugs/Alcohol: 3.09x RR

A small carrier with substance violations likely has pervasive safety culture problems. This single factor triples crash risk.

XLarge Carriers

Speeding: 3.56x | Reckless: 3.59x

For large fleets, behavioral violations indicate systemic driver management failures. The signal is 3.5x stronger than the overall average.

3.5 Cumulative Effect: Multiple Behavioral Violation Types

Perhaps the most striking finding is the cumulative effect of having multiple types of behavioral violations. The count of distinct violation categories is itself a powerful predictor:

# Behavioral Violation Types Carriers Crash Rate % Crashed in 2025 RR vs Zero
0 (no behavioral violations) 160,320 0.024 10.8% 1.00x
≥1 type 83,013 0.038 28.5% 1.54x
≥2 types 26,547 0.039 46.5% 1.59x
≥3 types 8,291 0.041 64.9% 1.67x
≥4 types 2,414 0.039 81.5% 1.61x
≥5 types 371 0.040 91.1% 1.62x

Critical finding: A carrier with 3+ behavioral violation types in 2024 has a 65% probability of crashing in 2025. With 5+ types, it's 91%. This suggests that counting distinct violation categories—not just total violations—captures something fundamental about organizational safety culture.

4. Discussion

4.1 Answer to Research Questions

Q1: Can FMCSA data predict future crashes?

Yes. With AUC 0.848 and 5.8x lift, prediction is not only possible but achieves performance comparable to mature insurance pricing models. The top 10% of risky carriers capture 58% of next year's crashes.

Q2: Which data sources provide the strongest signal?

Past crashes (rate-adjusted) are the single best predictor (1.77x RR), followed by behavioral violations analyzed by type rather than in aggregate. Raw violation data, when properly categorized, outperforms summary scores.

Q3: Do behavioral violations outperform equipment violations?

Yes, significantly. Reckless driving (1.42x RR) and speeding (1.35x RR) substantially outperform brake defects (1.16x RR) and lighting violations (~1.10x RR). Driver decisions matter more than vehicle condition.

Q4: Does predictive power vary by fleet size?

Dramatically. Behavioral violations that show 1.35x RR overall show 3.5x+ RR within specific size bands. This "hidden" predictive power is revealed only through stratified analysis. Size-specific models would significantly outperform one-size-fits-all approaches.

4.2 Why Behavioral Violations Predict Better

We hypothesize several mechanisms:

4.3 Practical Applications

Insurance Underwriting

Use behavioral violation types and crash history for risk selection. Consider fleet-size-specific rating factors.

Regulatory Targeting

Prioritize carriers with 3+ behavioral violation types for compliance reviews—65% will crash next year.

Loss Prevention

Focus interventions on speeding and reckless driving. Equipment inspections are less impactful.

4.4 Limitations

5. Conclusion

This study demonstrates that motor carrier crashes can be predicted with strong accuracy using publicly available FMCSA data. The key findings are:

  1. Prediction is possible and practical. AUC 0.848 and 5.8x lift indicate that risk scoring can meaningfully differentiate carriers. The top 10% of risky carriers account for 58% of future crashes.
  2. Behavioral violations are the key signal. Driver decisions (speeding, reckless driving, distraction) predict crashes far better than equipment conditions. A carrier's behavioral violation profile reveals its safety culture.
  3. Size stratification unlocks hidden power. Violations that appear weakly predictive overall become 3x+ predictive within fleet size bands. One-size-fits-all models leave significant value on the table.
  4. Multiple violation types compound risk. The number of distinct behavioral violation categories is itself a powerful predictor. Carriers with 3+ types have 65% crash probability; 5+ types have 91%.
  5. Past crashes remain the best single predictor. But they are size-biased. Combining crash history with behavioral violations and size stratification yields the strongest models.

Bottom line: Organizations seeking to predict motor carrier crashes should prioritize behavioral violation analysis over aggregate metrics, stratify by fleet size, and count distinct violation categories—not just total violations. The data is publicly available; the predictive power is real.

Data Availability

The underlying data for this study is publicly available from FMCSA's Safety Measurement System (SMS) data downloads. The carrier-level dataset used in this analysis (243,333 carriers with Year 1 predictors and Year 2 outcomes) is available upon request for research purposes.

For methodology details on specific violation categorizations and rate calculations, see our Violation Type Study.