U.S. Department of Transportation
Federal Highway Administration
1200 New Jersey Avenue, SE
Washington, DC 20590
202-366-4000


Skip to content
Facebook iconYouTube iconTwitter iconFlickr iconLinkedInInstagram

Federal Highway Administration Research and Technology
Coordinating, Developing, and Delivering Highway Transportation Innovations

Report
This report is an archived publication and may contain dated technical, contact, and link information
Publication Number: FHWA-HRT-05-042
Date: October 2005

Safety Effects of Differential Speed Limits

PDF Version (960 KB)

PDF files can be viewed with the Acrobat® Reader®

APPENDIX C. CONFIRMATION OF THE NEGATIVE BINOMIAL DISTRIBUTION TO CRASH DATA

As stated in the literature, the assumption in the application of the empirical Bayes formulation as done herein is that for a particular site i, the distribution of the number of crashes Ki,y over the years y obeys the Poisson distribution. Further, for a particular year y, the distribution of the number of crashes Ki,y between different i sites follows the negative binomial distribution. Based on these two assumptions, the expected number of crashes of a group mi1 are Gamma distributed.10,11,12,13 Figure 33 illustrates these concepts, where Ki,y is the actual crash count for site i and year y and mi,y is the expected crash counts for site i and year y.

View Alternative Text

Figure 33. Chart. Relationship between the Poisson and Negative Binomial Distributions for Crash Frequencies.

The Poisson and negative binomial distributions were tested with the data sets for selected states as described herein.

Verification of the Poisson Distribution

Using data from Virginia and Arizona, two techniques were used to verify that the Poisson distribution is appropriate. Firstly, theoretical versus actual frequencies were compared graphically. Secondly, the chi-square test was used to determine whether a statistically significant difference existed between the actual and theoretical distributions for the Kiy over time.

Figure 34 compares the actual crash frequency distribution and the Poisson distribution using one site on Interstate 85 in Virginia between milepost 19.52 and milepost 24.73, looking at the annual number of crashes between 1991 and 1999.

View Alternative Text

Figure 34. Chart. Comparison of Poisson Distribution and Actual Crash Distribution.

Table 27 shows that the calculated Χ2 value is less than the critical (tabulated) Χ2 value, which means that the assumed distribution is accepted. Theoretically, the computed chi-square value (which represents error in, or divergence from, the Poisson distribution) is less than the tabulated chi-square value; therefore, the hypothesis that the distributions are different cannot be proven at the 5 percent confidence level.

Table 27. Poisson validation description and results using the total crashes at four test sites.

State Test Sites Sample Size Data Χ2 calculated Χ2 sta, 0.05 Χ2 sta, 0.01
Arizona I-8 mp 42.06 to 54.96101991-20009.53014.0718.47
Arizona I-10 mp 19.79 to 26.65101991-20016.73815.5120.08
Virginia I-85 mp 19.52 to 24.7351995-19998.76411.0715.09
Virginia I-81 mp 206.04 to 213.4861995-20006.4687.8211.33

Verification of the Negative Binomial Distribution

A similar procedure was used to test the validity of the negative binomial distribution, except that crash rates as defined in figure 3 rather than the total number of crashes, was used to as the variable of interest. Table 28 highlights the result of the chi-square test and visual inspection of figure 35 suggests that the negative binomial distribution is appropriate for these data. (Crash rates rather than the number of crashes was used because of variation in the section lengths.)

Table 28. Negative binomial validation description and results.

StateYearSample SizeTest SiteCrash TypeΧ2calΧ2sta,0.05Χ2sta,0.01Result at 5% LevelResult at 1% Level
VA 199191 85n,95n,81n total42.326 44.8 60.1 Yes yes
VA 199290 85n,95n,81n total14.516 19.68 24.75 Yes yes
VA 199391 85n,95n,81n total10.730 20.08 14.07 Yes yes
VA 1995117 85,95,81n total21.386 22.37 27.72 Yes yes
VA 1996116 85,95,81n total18.153 18.31 23.91 Yes yes
VA 1997116 85,95,81n total14.135 19.68 24.76 Yes yes
VA 199984 85n,85s,81n total29.852 27.59 33.44 no(close)* yes
NC 199326 40,95,77,85 total4.005 6 9.22 Yes yes
NC 199925 40,95,77 total7.385 11.07 15.09 Yes yes
ID 199232 84,86,90,15 total28.327 38.89 45.67 Yes yes
ID 199432 84,86,90,16 total28.305 37.66 44.34 Yes yes
ID 199932 84,86,90,16 total36.322 42.57 49.61 Yes yes
ID 200032 84,86,90,16 total26.209 32.68 38.96 Yes yes
AZ 1991277 8,10,15,17,19,40 total24.151 21.03 26.25 no(close)* yes
AZ 1993277 8,10,15,17,19,40 total20.392 23.37 27.71 Yes yes
AZ 1994278 8,10,15,17,19,40 total17.697 26.3 32.03 Yes yes
AZ 1995277 8,10,15,17,19,40 total21.232 22.37 27.71 Yes yes
AZ 1996278 8,10,15,17,19,40 total19.977 22.37 27.71 Yes yes
AZ 1998279 8,10,15,17,19,41 total21.164 22.37 27.71 Yes yes
AZ 1999280 8,10,15,17,19,42 total20.149 23.69 29.17 yes yes
AZ 2000281 8,10,15,17,19,43 total24.187 27.59 33.44 yes yes

*The significance level of a chi-square test is actually a proof that a theoretical distribution does not fit the data. Thus, if a calculated chi-square value is sufficiently large such that it exceeds the 5 percent chi-square value, then it can be said that "researchers are 95 percent certain that the two distributions are different." In the two rows with asterisks, there is a 95 percent certainty that the two distributions are different but not 99 percent certain. In all other cases, it cannot be proved at the 95 percent level that the theoretical and actual distributions are different; therefore, it is presumed they are the same.

View Alternative Text

Figure 35. Chart. Comparison of Negative Binomial Distribution and Actual Crash Distribution (Probability Density Function).

 

Previous | Table of Contents | Next

Federal Highway Administration | 1200 New Jersey Avenue, SE | Washington, DC 20590 | 202-366-4000
Turner-Fairbank Highway Research Center | 6300 Georgetown Pike | McLean, VA | 22101