This report is an archived publication and may contain dated technical, contact, and link information

Publication Number: FHWA-RD-03-037
Date: May 2005

Validation of Accident Models for Intersections

FHWA Contact: John Doremi,
HRDI-10, (202) 493-3052, John.doremi@dot.gov

PDF files can be viewed with the Acrobat® Reader®

2. VALIDATION OF ACCIDENT MODELS (Continuation)

2.5.3 Model III

The summary statistics in the original report and Georgia data are given in table 62, which reveals that the Georgia sample had on average fewer accidents per year than the original data. This implies that either the Georgia sites were relatively more safe than the sites selected for the original model, or that the passage of time between the period for the original calibration (1993-95) and that for the validation data (1996-97) had resulted in an overall improvement in safety (due to many factors including improved roadway design, improved vehicles, emergency response services, etc.). The Georgia sites may also be safer because they have, on average, wider medians on major roads and fewer numbers of driveways than the original intersections. In addition, more than 50 percent of the sites in the original data had no median, while only 5 percent of sites in Georgia were without a median.

Table 62. Summary Statistics of Georgia Data: Type III Sites

Variable and Abbreviation		N	Mean	Median	Minimum	Maximum	Freq.	% Zero
No. of Crashes (TOTACC)	Original Data	84	3.88	2	0	19	326	21.4
	Georgia (0.05 Mile)	52	2.4	1.5	0	12	124	21.2
	Georgia (0.04 Mile)	52	2.2	1	0	12	116	25.0
No. of Intersection-Type Crashes (TOTACCI)	Original Data	84	2.62	1	0	13	135	34.5
	Georgia (0.05 Mile)	52	1.6	1	0	11	85	32.7
	Georgia (0.04 Mile)	52	1.5	1	0	11	80	36.5
	Georgia (0.05 Mile)	52	1.08	1	0	8	56	44.2
	Georgia (0.04 Mile)	52	1.02	1	0	8	53	48.1
	Georgia (0.05 Mile)	52	0.81	0	0	8	42	57.7
	Georgia (0.04 Mile)	52	0.77	0	0	8	40	61.5
Median Width on Major Road (MEDWIDTH1)	Original Data	84	3.74	0	0	36	N/A¹	53.6
Median Width on Major Road (MEDWIDTH1)	Georgia	52	27.0	20	0	63	N/A¹	5.8
No. of Driveways on Major Road (DRWY1)	Original Data	84	3.1	1	0	15	259	42.9
No. of Driveways on Major Road (DRWY1)	Georgia	52	1.5	1	0	9	77	42.3
AADT1 on Major Road	Original Data	84	12870	12050	2367	33058	N/A¹	N/A¹
AADT1 on Major Road	Georgia	52	13100	12200	6500	28600	N/A¹	N/A¹
AADT2 on Minor Road	Original Data	84	596	349	15	3001	N/A¹	N/A¹
AADT2 on Minor Road	Georgia	52	892	430	80	9490	N/A¹	N/A¹

¹ N/A: not available

Total Accident Models (TOTACC)

The model was recalibrated using the Georgia data. The parameter estimates, their standard errors, and p-values are shown in table 63, which reveals that the constant term, AADT1 and AADT2, were estimated with the same sign but with large differences in magnitude. MEDWDTH2 and DRWY1 were estimated with an opposite sign, although they were not statistically significant. AADT1 was also estimated as insignificant. The overdispersion parameters were lower than that for the original model, but the difference was not great.

Table 63. Parameter Estimates for TOTACC Type III Model Using Georgia Data

Variable	Original Estimate¹ (s.e., p-value)	Georgia Data 0.04 Mile (s.e, p-value)	Georgia Data 0.05 Mile (s.e., p-value)
Constant	-12.2196 (2.3575, 0.0001)	-8.690 (4.945, 0.1059)	-8.857 (4.585, 0.0750)
Log of AADT1	1.1479 (0.2527, 0.0001)	0.536 (0.459, 0.2434)	0.580 (0.426, 0.1737)
Log of AADT2	0.2624 (0.0866, 0.0024)	0.551 (0.179, 0.0021)	0.536 (0.163, 0.0010)
MEDWIDTH1	-0.0546 (0.0249, 0.0285)	0.004 (0.013, 0.7748)	0.0002 (0.012, 0.9894)
DRWY1	0.0391 (0.0239, 0.1023)	-0.009 (0.088, 0.9156)	0.011 (0.094, 0.9101)
K²	0.3893	0.374	0.300

¹ Vogt, 1999, (p. 111)

² K: Overdispersion value

Table 64 shows the prediction performance statistics for Model III for TOTACC. Low Pearson product-moment correlation coefficients with the Georgia data indicate that the accident predictions by the original model are marginally correlated with the observed number of accidents in the Georgia data. Other validation statistics also suggest a poor fit of the original model to the Georgia data. The MPB and MAD per year were larger than those for the original model. The MSPE per year squared was almost twice as high as the MSE per year squared.

Figure 5 depicts the prediction performance of the original model for individual sites in the Georgia 0.05-mile data. It is quite evident that the original model does not do a good job of predicting accidents at the Georgia intersections; this finding was expected on the basis of the low Pearson product-moment coefficients for the Georgia data.

Table 64. Validation Statistics for TOTACC Type III Model Using Georgia Data

Measure	Original Data	Georgia Data
Measure	Original Data	0.04 Mile	0.05 Mile
Years used for validation	1993 to 1995	1996 to 1997	1996 to 1997
Number of sites	84	52	52
Pearson product-moment correlation coefficients	0.66	0.03	0.09
MPB	-0.01	-1.34	-1.49
MPB/yr	0.00	-0.45	-0.50
MAD	2.26	5.93	6.14
MAD/yr	0.75	1.98	2.05
MSE	11.01	N/A¹	N/A¹
MSE/yr²	1.22	N/A¹	N/A¹
MSPE	N/A¹	9.36	9.64
MSPE/yr²	N/A¹	2.34	2.41

¹ N/A: not available

Figure 5. Observed versus Predicted Accident Frequency: TOTACC Type III

Intersection Related Total Accident Model (TOTACCI)

The parameter estimates, their standard errors, and p-values are given in table 65. Similar to the model of TOTACC, the variables AADT1, MEDWITH1, and DRWY1 were estimated as statistically insignificant. The constant term and AADT1, AADT2, and DRWY1 were estimated with the same direction of effect but with large differences in magnitude. The overdispersion parameters, K, were lower than that for the original model.

Table 65. Parameter Estimates for TOTACCI Type III Model Using Georgia Data

Variable	Original Estimate¹ (s.e., p-value)	Georgia Data 0.04 Mile (s.e, p-value)	Georgia Data 0.05 Mile (s.e., p-value)
Constant	-15.4661 (3.4685, 0.0001)	-7.774 (4.511, 0.1165)	-8.163 (4.097, 0.0683)
Log of AADT1	1.4331 (0.3608, 0.0001)	0.232 (0.366, 0.5264)	0.301 (0.339, 0.3747)
Log of AADT2	0.2686 (0.0988, 0.0065)	0.764 (0.262, 0.0035)	0.740 (0.229, 0.0012)
MEDWIDTH1	-0.0612 (0.0360, 0.0888)	0.004 (0.013, 0.7719)	0.002 (0.012, 0.8682)
DRWY1	0.0560 (0.0289, 0.0525)	0.090 (0.133, 0.4980)	0.095 (0.126, 0.4504)
K²	0.5118	0.352	0.272

¹ Vogt, 1999, (p. 112)

² K: Overdispersion value

Table 66 shows the GOF statistics for Model III for TOTACCI. Low Pearson product-moment correlation coefficients with the Georgia data indicate that the accident predictions by the original model are marginally correlated with the observed number of accidents in the Georgia data. Other validation statistics also suggest lack-of-fit to the Georgia data. The MPB and MAD per year were larger than those for the original model. The MSPE per year squared was almost twice as high as the MSE per year squared, indicating a general lack-of-fit to the Georgia data.

A plot of the predicted versus actual accidents using Georgia data will help to understand prediction performances of the original model for the Georgia data. As shown in figure 6, it is quite evident that the original model does not do a good job of predicting accidents at the Georgia intersections; this finding was expected on the basis of the low Pearson product-moment coefficients for the Georgia data.

Table 66. Validation Statistics for TOTACCI Type III Model Using Georgia Data

Measure	Original Data	Georgia Data
Measure	Original Data	0.04 Mile	0.05 Mile
Years used for validation	1993 to 1995	1996 to 1997	1996 to 1997
Number of sites	84	52	52
Pearson product-moment correlation coefficients	0.67	0.08	0.10
MPB	-0.005	-1.03	-1.13
MPB/yr	-0.002	-0.52	-0.56
MAD	1.76	1.95	1.95
MAD/yr	0.59	0.97	0.97
MSE	6.50	N/A¹	N/A¹
MSE/yr²	0.72	N/A¹	N/A¹
MSPE	N/A¹	5.93	6.14
MSPE/yr²	N/A¹	1.48	1.54

¹ N/A: not available

Figure 6. Observed versus Predicted Accident Frequency: TOTACCI Type III

Injury Accident Model (INJACC)

The two original variants for model III were validated.

Variant 1

The parameter estimates, their standard errors, and p-values are given in table 67, which reveals that the constant term and all of the variables were estimated with the same sign as in the original model, but there were large differences in their magnitudes. The constant term, AADT1, and HAU became insignificant with the Georgia data. The overdispersion parameters, K, were higher than that for the original model.

Table 68 shows the GOF measures for the original injury accident model (Variant 1) in the Vogt report applied to the Georgia data.⁽²⁾

The Pearson product-moment correlation coefficients were similar to those for the TOTACC model. However, the MPB, MAD, and MSPE per year squared were smaller than those for the TOTACC model.

Table 67. Parameter Estimates for INJACC Type III Model Using Georgia Data: Variant 1

Variable	Report Estimate¹ (s.e., p-value)	Georgia Data 0.04 Mile (s.e, p-value)	Georgia Data 0.05 Mile (s.e., p-value)
Constant	-12.3246 (2.8076, 0.0001)	-7.642 (6.397, 0.2774)	-6.958 (5.949, 0.2923)
Log of AADT1	1.1436 (0.2763, 0.0001)	0.423 (0.573, 0.4602)	0.381 (0.531, 0.4730)
Log of AADT2	0.1357 (0.1029, 0.1872)	0.454 (0.255, 0.0752)	0.420 (0.243, 0.0838)
HAU	0.0230 (0.0131, 0.0790)	0.001 (0.010, 0.8886)	0.000 (0.009, 0.9743)
K²	0.3787	0.682	0.553

¹ Vogt, 1999, (p. 113)

² K: Overdispersion value

Table 68. Validation Statistics for INJACC Type III Model Using Georgia Data: Variant 1

Measure	Georgia Data
Measure	0.04 Mile	0.05 Mile
Years used for validation	1996 to 1997	1996 to 1997
Number of sites	52	52
Pearson product-moment correlation coefficients	0.09	0.08
MPB	0.23	0.19
MPB/yr	0.11	0.10
MAD	0.78	0.78
MAD/yr	0.39	0.39
MSPE	2.30	2.30
MSPE/yr²	0.58	0.58

Figure 7 depicts the prediction performance of the original model for individual sites in the Georgia 0.05-mile data. It is quite evident that the original model does not do a good job of predicting accidents at the Georgia intersections, a finding that would have been expected on the basis of the low Pearson product-moment coefficients for the Georgia data.

Figure 7. Observed versus Predicted Accident Frequency: Injury Variant 1

Variant 2

The parameter estimates, their standard errors, and p-values are given in table 69, which reveals that the constant term and all of the variables were estimated with the same sign as in the original model. However, all of the variables except AADT2 became insignificant, and there were large differences in the magnitudes of the parameters. The overdispersion parameter, K, was almost twice as high as for the original model.

Table 70 shows the GOF measures for the original injury accident model (Variant 2) in the Vogt report applied to the Georgia data.⁽²⁾

The Pearson product-moment correlation coefficient was similar to that for TOTACC. However, the MPB, MAD, and MSPE per year squared were smaller than those for TOTACC.

Figure 8 depicts the prediction performance of the original model for individual sites in the Georgia 0.05-mile data. It is quite evident that the original model performs poorly when applied to the Georgia data, a finding that would have been expected on the basis of the low Pearson product-moment coefficients for the Georgia data.

Table 69. Parameter Estimates for INJACC Type III Model Using Georgia Data: Variant 2

Variable	Report Estimate¹ (s.e., p-value)	Georgia Data 0.04 Mile (s.e, p-value)	Georgia Data 0.05 Mile (s.e., p-value)
Constant	-11.0061 (2.6937, 0.0001)	-8.238 (7.223, 0.2962)	-7.786 (6.571, 0.2803)
Log of AADT1	0.9526 (0.2843, 0.0008)	0.457 (0.627, 0.4663)	0.410 (0.565, 0.4678)
Log of AADT2	0.1499 (0.0916, 0.1018)	0.468 (0.278, 0.0920)	0.457 (0.258, 0.0771)
HAU	0.0289 (0.0105, 0.0061)	0.002 (0.010, 0.8764)	0.001 (0.010, 0.9046)
DRWY1	0.0481 (0.0262, 0.0664)	0.038 (0.120, 0.7488)	0.085 (0.151, 0.5734)
ABSGRD1	0.1838 (0.1130, 0.1038)	0.167 (0.439, 0.7042)	0.225 (0.415, 0.5871)
K²	0.2588	0.666	0.501

¹ Vogt, 1999, (p. 113)

² K: Overdispersion value

Table 70. Validation Statistics for INJACC Type III Model Using Georgia Data: Variant 2

Measure	Georgia Data
Measure	0.04 Mile	0.05 Mile
Years used for validation	1996 to 1997	1996 to 1997
Number of sites	52	52
Pearson product-moment correlation coefficients	0.05	0.04
MPB	0.15	0.11
MPB/yr	0.08	0.06
MAD	0.77	0.77
MAD/yr	0.39	0.39
MSPE	2.61	2.61
MSPE/yr²	0.65	0.65

Figure 8. Observed versus Predicted Accident Frequency: Injury Variant 2

2.5.4 Model IV

The summary statistics are provided in table 71. Peak left-turn percentage on major road was not available in the Georgia data, since this variable would be too costly to collect in the field. Since the variable was not present in the Georgia data, modifications to the validation procedure had to be performed. The variable was removed from the original model by dividing both sides of the model equation by the exponential value of the coefficient of the variable times its average effect (the average effect of PKLEFT1 is the average value of PKLEFT1 in the calibration data).

The summary statistics showed that about 31 percent of the sites in the original data had no left-turn lane, while 17 percent in the Georgia data were without a left-turn lane. The summary statistics for all of the three States (California, Michigan, and Georgia) were also compared (refer to table 72). All of the sites in Michigan had no LTLN1S, while frequencies of TOTACC and TOTACCI for Georgia were higher than for the California data.

Pearson correlations of the original data, Georgia, and California are given in table 73. The observation that the coefficients for AADT 2 and LTLN1S estimated using Georgia data resulted in opposite signs than the original model required further investigation. Pearson correlations for these variables with the response (accident frequency) were computed for all three States-California, Michigan, and Georgia. Recall that the Pearson correlation reflects the degree to which the two variables are linearly related. Unlike the original data, AADT2 in Georgia is estimated as negative linearly related with TOTACC and TOTACCI, but these correlations are marginal and statistically insignificant. The variable LTLN1 is positively related with TOTACC and TOTACCI in Georgia and California (not significant), but is negative and significant for the Michigan data.

Table 71. Summary Statistics of Georgia Data: Type IV

Variable and Abbreviation		N	Mean	Median	Minimum	Maximum	Freq.	% Zero
No. of Crashes (TOTACC)	Original Data	72	5.5	3.5	0	38	398	12.5
	Georgia (0.05 Mile)	52	4.27	4.0	0	13	222	13.5
	Georgia (0.04 Mile)	52	4.17	3.0	0	13	217	13.5
No. of Intersection-Type Crashes (TOTACCI)	Original Data	72	4.1	2	0	27	297	22.2
	Georgia (0.05 Mile)	52	3.08	3.0	0	11	160	26.9
	Georgia (0.04 Mile)	52	3.06	3.0	0	11	159	36.5
No. of Injury Crashes (INJACC)	Georgia (0.05 Mile)	52	2.06	2.0	0	9.0	107	32.7
No. of Injury Crashes (INJACC)	Georgia (0.04 Mile)	52	2.0	2.0	0	9.0	104	32.7
No. of Intersection-Type Injury Crashes (INJACCI)	Georgia (0.05 Mile)	52	1.67	1.0	0	9.0	87	38.5
No. of Intersection-Type Injury Crashes (INJACCI)	Georgia (0.04 Mile)	52	1.67	1	0	9	87	38.5
Left-Turn Lanes on Major Road (LTLN1S)	Original Data	72	0.7	1	0	1	N/A¹	30.6
Left-Turn Lanes on Major Road (LTLN1S)	Georgia	52	0.8	1	0	1	N/A¹	17.3
Peak Left-Turn Percentage on Major Road (PKLEFT1)	Original Data	72	2.8	1.51	0	13.96	N/A¹	5.6
Peak Left-Turn Percentage on Major Road (PKLEFT1)	Georgia	N/A¹
AADT1 on Major Road	Original Data	72	13018	11166	3350	73000	N/A¹	N/A¹
AADT1 on Major Road	Georgia	52	13100	12200	6500	28600	N/A¹	N/A¹
AADT2 on Minor Road	Original Data	72	559	410	21	2018	N/A¹	N/A¹
AADT2 on Minor Road	Georgia	52	892	430	80	9490	N/A¹	N/A¹

¹ N/A: not available

Table 72. Summary Statistics of California, Michigan, and Georgia

Variable	California (N=54)¹				Michigan (N=18)¹				Georgia (N=52)²
Variable	Mean	Median	Min.	Max.	Mean	Median	Min.	Max.	Mean	Median	Min.	Max.
TOTACC	4.2	3	0	22	9.4	8.5	0	38	4.3	4	0	13
TOTACCI	3.5	2	0	21	6	4.5	0	27	3.1	3	0	11
AADT1	13788	11250	3350	73000	10707	10550	5967	19383	12631	12831	5300	25800
AADT2	441	301	21	1850	913	733	254	2018	706	463	300	2990
PKLEFT1	2.25	1	0	14	4.4	3.1	0.8	11.6	N/A	N/A	N/A	N/A
LTLNS	0.93	1	0	1	0	0	0	0	0.9	1	0	1

¹ Summary Statistics for California and Michigan were produced using the obtained original data

² Used TOTACC and TOTACCI for 0.05 mile

Table 73. Pearson Correlations: Original, Georgia, and California

The original data (N=72)

Variable		TOTACC	TOTACCI	AADT1	AADT2
TOTACC	Pearson Correlation	1.000	0.961	0.152	0.480
TOTACC	Sig. (2-tailed)	N/A¹	0.000	0.203	0.000
TOTACCI	Pearson Correlation	0.961	1.000	0.164	0.461
TOTACCI	Sig. (2-tailed)	0.000	N/A¹	0.168	0.000
AADT1	Pearson Correlation	0.152	0.164	1.000	-0.108
AADT1	Sig. (2-tailed)	0.203	0.168	N/A¹	0.365
AADT2	Pearson Correlation	0.480	0.461	-0.108	1.000
AADT2	Sig. (2-tailed)	0.000	0.000	0.365	N/A¹
LTLN1	Pearson Correlation	-0.279	-0.169	0.210	-0.219
LTLN1	Sig. (2-tailed)	0.018	0.156	0.077	0.064

Georgia (N= 52): 0.05 mile

Variable		TOTACC	TOTACCI	AADT1	AADT2
TOTACC	Pearson Correlation	1.000	0.934	0.294	-0.096
TOTACC	Sig. (2-tailed)	N/A¹	0.000	0.035	0.501
TOTACCI	Pearson Correlation	0.934	1.000	0.247	-0.096
TOTACCI	Sig. (2-tailed)	0.000	N/A¹	0.077	0.499
AADT1	Pearson Correlation	0.294	0.247	1.000	-0.008
AADT1	Sig. (2-tailed)	0.035	0.077	N/A¹	0.956
AADT2	Pearson Correlation	-0.096	-0.096	-0.008	1.000
AADT2	Sig. (2-tailed)	0.501	0.499	0.956	N/A¹
LTLN1	Pearson Correlation	0.293	0.250	0.083	0.143
LTLN1	Sig. (2-tailed)	0.035	0.074	0.558	0.313

¹ N/A: not available

Table 73 . Pearson Correlations: Original, Georgia, and California (Continued)

California (N=54)

Variable		TOTACC	TOTACCI	AADT1	AADT2
TOTACC	Pearson Correlation	1	0.987	0.215	0.494
TOTACC	Sig. (2-tailed)	N/A¹	0	0.118	0
TOTACCI	Pearson Correlation	0.987	1	0.201	0.508
TOTACCI	Sig. (2-tailed)	0	N/A¹	0.146	0
AADT1	Pearson Correlation	0.215	0.201	1	-0.035
AADT1	Sig. (2-tailed)	0.118	0.146	N/A¹	0.804
AADT2	Pearson Correlation	0.494	0.508	-0.035	1
AADT2	Sig. (2-tailed)	0	0	0.804	N/A¹
LTLN1	Pearson Correlation	0.074	0.046	0.138	0.266
LTLN1	Sig. (2-tailed)	0.596	0.742	0.319	0.052

¹ N/A: not available

Total Accident Models (TOTACC)

The parameter estimates, their standard errors, and p-values are given in table 74. Since the variable PKLEFT1 (peak left-turn percentage on major road) is not present in the Georgia data, modifications to the validation procedure had to be performed as described earlier. In the validation, the same parameter estimates in the originally published report were used, and the parameter estimates were also reproduced without PKLEFT1 for the revised original model ("Revised Estimates" in table 74).

In the revised original model, all of the variables were estimated with the same sign but with large differences in magnitude. The effect of AADT1 became smaller, while that of AADT2 became larger. The overdispersion values with the Georgia data were higher than for the original models, but the difference was not great.

For the Georgia data, the constant term and AADT1 were estimated with the same sign as for the original models. However, AADT2 and LTLN1S were estimated with an opposite sign to the original model, although AADT2 was insignificant. The values of the overdispersion parameter K for the Georgia data were lower than those for the original data.

Table 74. Parameter Estimates for TOTACC Type IV Model Using Georgia Data

Variable	Original Estimates¹ (s.e., p-value)	Revised Estimates² (s.e, p-value)	Georgia Data 0.04 Mile (s.e., p-value)	Georgia Data 0.05 Mile (s.e., p-value)
Constant	-9.4631 (2.5991, 0.0003)	-6.705 (2.373, 0.0181)	-5.599 (3.977, 0.2174)	-5.764 (4.110, 0.2173)
Log of AADT1	0.8503 (0.2779, 0.0022 )	0.501 (0.231, 0.0301)	0.624 (0.365, 0.0875)	0.653 (0.380, 0.0860)
Log of AADT2	0.3294 (0.1255, 0.0087)	0.478 (0.097, 0.0000)	-0.112 (0.229, 0.6253)	-0.097 (0.241, 0.6867)
PKLEFT1	0.1100 (0.0412, 0.0076)	N/A⁴	N/A⁴	N/A⁴
LTLN1S	-0.4841 (0.2311, 0.0362)	-0.504 (0.245, 0.0393)	1.273 (0.432, 0.0032)	1.085 (0.377, 0.0040)
K³	0.4578	0.553	0.382	0.417

¹ Vogt, 1999, (p. 116)

² Coefficient estimates of the variables were reproduced without PKLEFT1 using the original data

³ K: Overdispersion value

⁴ N/A: not available

Since PKLEFT1 was not available in the Georgia data, two models (original model and revised original model) were used for the validation activity to determine GOF measures. For the original model, the same parameter estimates in the report were used. For the revised original model, since PKLEFT1 was not available, PKLEFT1 was removed from the original model by dividing by the exponential value of the coefficient of this variable times its average effect, i.e., the average value of PKLEFT1.

GOF measures of the revised original model, shown in table 75, indicate that it could be a good alternative to the original model. Pearson product-moment correlation coefficients, MAD per year, and MSE per year squared were similar to those for the original model. The MPB per year was higher than that for the original model, but the difference was not great.

Values of 0.05 and 0.08 of the Pearson product-moment correlation coefficient indicate that the accidents in the Georgia data are not linearly related with the model-predicted values. This could be the result of a significant nonlinearity in the data and original model. The MPB and MAD per year for the Georgia data were larger than those for the original year data. The MSPEs per year squared were also higher than the MSEs per year squared.

Figure 9 depicts the prediction performance of the original model for individual sites in the Georgia 0.05-mile data. It is quite evident that the original model does not fit the Georgia data well, a finding that would have been expected on the basis of the low Pearson product-moment coefficients for the Georgia data.

Table 75. Validation Statistics for TOTACC Type IV Model Using Georgia Data

Measure	Original Model¹	Revised Original Model²	Georgia³ (0.04 Mile)	Georgia³ (0.05 Mile)
Years used for validation	1993-1995	1993-1995	1996-1997	1996-1997
Number of sites	72	72	51	51
Pearson product-moment correlation coefficients	0.56	0.56	0.05	0.08
MPB	-0.07	1.41	2.25	2.27
MPB/yr	-0.02	0.47	1.12	1.13
MAD	3.38	3.49	3.09	3.11
MAD/yr	1.13	1.16	1.54	1.55
MSE	30.62	32.66	N/A⁴	N/A⁴
MSE/yr²	3.40	3.63	N/A⁴	N/A⁴
MSPE	N/A⁴	N/A⁴	17.86	18.51
MSPE/yr²	N/A⁴	N/A⁴	4.47	4.63

¹ Used the original main model in the report. This model includes PKLEFT1

² Used the same coefficients in the original model, but PKLEFT1 was removed from the model by dividing by the exponential value of the coefficient of this variable times its average effect

³ Used the revised original model

⁴ N/A: not available

Figure 9. Observed versus Predicted Accident Frequency: TOTACC

Intersection Related Total Accident Model (TOTACCI)

The parameter estimates, their standard errors, and p-values are given in table 76. As before, the two models (original model and revised original model) were used for the validation. For the original model, the same parameter estimates in the report were used. Since the report also developed a model with AADT1 and AADT2 only, which model ("Revised Estimates" in table 76) was included for the validation.

In the alternative original model the constant term and parameter estimates of AADT1 and AADT2 were estimated with the same sign but with some difference in magnitude. The effect of AADT1 became smaller, while that of AADT2 became larger. The overdispersion value was slightly higher than for the original model.

For the Georgia data, AADT2 was estimated with an opposite sign to that of the original models. However, it was statistically insignificant, and the impact of the variable on the accident prediction was marginal. The constant term and AADT1 were also estimated as insignificant for the Georgia data. The overdispersion values for the Georgia data were similar to that for the revised original model.

The prediction performance measures are shown in table 77. As was the case for the TOTACC models, the revised model showed similar prediction performance measures to the original model.

Table 76. Parameter Estimates for TOTACCI Type IV Model Using Georgia Data

Variable	Original Estimates¹ (s.e., p-value)	Revised Estimates² (s.e, p-value)	Georgia Data 0.04 Mile (s.e., p-value)	Georgia Data 0.05 Mile (s.e., p-value)
Constant	-11.1096 (3.3345, 0.0008)	-7.2501 (2.9094, .0130)	-4.604 (5.482, 0.4755)	-4.603 (5.498, 0.4770)
Log of AADT1	0.9299 (0.3433, 0.0067 )	0.4582 (0.2844, 0.1071)	0.562 (0.495, 0.2564)	0.563 (0.497, 0.2568)
Log of AADT2	0.3536 (0.1163, 0.0024)	0.5311 (0.0996, .0001)	-0.041 (0.325, 0.8996)	-0.043 (0.326, 0.8957)
PKLEFT1	0.1491 (0.0586, 0.0110)	N/A⁴	N/A⁴	N/A⁴
K³	0.7096	0.8814	0.857	0.857

¹ Vogt, 1999, (p. 117)

² The report presents this model developed with AADT1 and AADT2 only

³ K: Overdispersion value

⁴ N/A: not available

Table 77. Validation Statistics for TOTACCI Type IV Model Using Georgia Data

Measure	Original Model¹	Revised Original Model²	Georgia³ (0.04 Mile)	Georgia³ (0.05 Mile)
Years used for validation	1993-1995	1993-1995	1996-1997	1996-1997
Number of sites	72	72	51	51
Pearson product-moment correlation coefficients	0.47	0.47	0.16	0.17
MPB	-0.17	1.28	1.81	1.76
MPB/yr	-0.06	0.43	0.90	0.88
MAD	3.00	3.00	2.59	2.54
MAD/yr	1.00	1.00	1.29	1.27
MSE	24.92	24.85	N/A⁴	N/A⁴
MSE/yr²	2.77	2.76	N/A⁴	N/A⁴
MSPE	N/A⁴	N/A⁴	12.32	12.18
MSPE/yr²	N/A⁴	N/A⁴	3.08	3.05

¹ Used the original main model in the report. This model includes PKLEFT1

² Used the same coefficients in the original model, but PKLEFT1 was removed from the model by dividing by the exponential value of the coefficient of this variable times its average effect

³ Used the revised original model

⁴ N/A: not available

Values of 0.16 and 0.l7 of the Pearson product-moment correlation coefficients indicate that the accident predictions by the original models are not strongly linearly correlated with the observed number of accidents in the Georgia data. Again, there are several possible explanations for this. The MPBs and MAD per year was larger than those for the original models. The MSPEs per year squared were also slightly higher than the MSEs per year squared.

Figure 10 depicts the prediction performance of the original model for individual sites in the Georgia 0.05-mile data. It is quite evident that the original model does not fit the Georgia data well, a finding that would have been expected on the basis of the low Pearson product-moment coefficients.

Figure 10. Observed versus Predicted Accident Frequency: TOTACCI

Injury Accident Model (INJACC)

The parameter estimates, their standard errors, and p-values are given in table 78. Again, all of the variables including the constant term were insignificant for the Georgia data, and AADT2 was estimated with an opposite sign to that of the original model. The overdispersion values for the Georgia data were higher than that for the original model.

Table 78. Parameter Estimates for INJACC Type IV Model Using Georgia Data

Variable	Original Estimates¹ (s.e., p-value)	Georgia Data 0.04 Mile² (s.e., p-value)	Georgia Data 0.05 Mile² (s.e., p-value)
Constant	-12.5296 (2.9908, 0.0001)	-4.811 (4.912, 0.4018)	-5.260 (4.778, 0.3392)
Log of AADT1	0.9505 (0.3284, 0.0038 )	0.599 (0.467, 0.1990 )	0.652 (0.457, 0.1543 )
Log of AADT2	0.3237 (0.1645, 0.0491)	-0.191 (0.374, 0.6100)	-0.162 (0.367, 0.6586)
PKLEFT1	0.0994 (0.0433, 0.0216)	N/A⁴	N/A⁴
SPD2	0.0339 (0.0179, 0.0577)	0.010 (0.031, 0.7379)	0.005 (0.031, 0.8732)
K³	0.4308	0.649	0.645

¹ Vogt, 1999, (p. 118)

² PKLEFT1 was not included in the model

³ K: Overdispersion value

⁴ N/A: not available

Previous | Table of Contents | Next

Page Owner: Office of Research, Development, and Technology, Office of Safety, RDT

Topics: research, safety, intersection safety
Keywords: research, safety, Accident modification factors, Traffic safety, Signalized intersections, Crash models, Crash model validation, Interactive highway safety design model
TRT Terms: Traffic accidents–United States–Forecasting, Roads–United States–Interchanges and intersections–Mathematical models, Rural roads–United States, Low-volume roads–United States, signalized intersections
Scheduled Update: Archive - No Update needed

This page last modified on 03/08/2016