This report is an archived publication and may contain dated technical, contact, and link information

Publication Number: FHWA-HRT-05-048
Date: April 2005

Safety Evaluation of Red-Light Cameras

PDF files can be viewed with the Acrobat® Reader®

X. Data and SPF Preparation

Before the actual data analyses, preliminary efforts involving file merging and data quality checks were completed.

Crash Data Linkage to Intersections

In most State DOT crash and inventory files, as is the case with the State data in FHWA's Highway Safety Information System (HSIS), crashes can be computer-linked to inventory and traffic volume data for roadway segments and intersections using location reference variables such as route or milepost on each record. This was possible in Howard and Montgomery Counties, where project staff were provided an electronic file of the milepost book used by police officers in the field. The treatment and reference intersections were identified and matched to crashes based on this milepost information.

This was not the case for the intersections analyzed in other jurisdictions. In Charlotte, NC, the 1997 and later data included intersection control numbers for all intersections and crashes that could be used for file linkage. There were also control numbers in the pre-1997 data, but they differed from the later data. The Charlotte staff provided us with conversions between the new and old systems.

For all data for the three California jurisdictions and Baltimore, MD, no such location system existed, and the crashes had to be manually linked to pertinent intersections based on the names of the crossing streets. The crashes were sorted by street names and an analyst matched the crash-report streets with the street names from the treated and comparison intersection file. All combinations of crash-report street names were checked to pick up possible misspellings by the investigating officer or coder.

The project team was able to conduct a limited verification of both the completeness of the State CODES data files and the manual linkage procedures using El Cajon, CA, data. The local traffic engineer sent the project team crash summary reports for one treatment, one signalized reference, and one unsignalized control intersection. These summary reports contain a listing of all cases that have been coded to an intersection by city staff, using their own coding scheme. The comparison of these crashes to those identified and linked by the project team indicated that use of the State data resulted in minor differences with the local crash summaries. It was thus concluded that the State data is of sufficient quality.

Defining Red-Light-Running Crashes

As indicated earlier, the basic analyses were to be focused on target crashes, those red-light-running crashes that could be affected by the RLC treatment. The analysis would also examine other intersection crashes to confirm that unanticipated effects were not present. Because there is no "red-light-running" crash category on most police crash forms, these target crashes must be defined based on variables on the form. Definitions could range from only crashes in which a citation for a traffic signal violation was given to all right-angle crashes and rear end crashes at or near the intersection. One could choose to include rear end crashes that were noted by the officer as "intersection-related" (where this variable was present), or to include all rear end crashes approaching the intersection within a specified distance of the intersection. Depending on the distance (X) chosen, the assumption would be that the RLC would affect behavior of the lead vehicle or vehicles, which could result in rear end crashes X distance back in the approaching queue of vehicles. One could also choose to include left-turn opposite-approach crashes because some of these would be red-light-running crashes if a protected signal phase existed. To further complicate matters, the different jurisdictions use slightly different definitions of right-angle crashes on the report form.

Based on definitions used in previous studies, available data variables in the current files, and project team discussions, the following general decisions were made by the project team:

In general, "RLC-related crashes" would include crashes in the intersection itself where one vehicle is "running the light," plus intersection-related rear end crashes that could be affected by RLC systems, including those rear end crashes occurring in the approach queue. Clearly, neither of these two types of crashes is explicitly defined in crash data. Thus, the following definitions were used.
"Red-light-running" crashes at the intersection proper were defined as "angle," "broadside," or "right- or left-turning" crashes involving two vehicles, with the vehicles entering the intersection from perpendicular approaches. "Perpendicular approaches" was defined using the compass directions of travel for each involved vehicle, a variable that was present in the data for all seven jurisdictions. In most jurisdictions, all crashes meeting these "crossing" criteria and occurring at or within 6.096 m (20 ft) of the intersection were captured. (A second definition of these RLR crashes includes crashes involving a left-turning and a through vehicle from opposite approaches on the same roadway. This would capture those vehicles running the red signal either during or before or after a protected signal phase.)
Rear end crashes used in the analyses were those defined as "rear end" by the crash type and occurring on any approach within 45.72 m (150 ft) of the intersection. Total intersection-related crashes were also analyzed. The definitions for each jurisdiction appear in table 15.

As could be expected, available crash variables and codes differed between cities, making it impossible to have totally consistent definitions across all seven jurisdictions. For example, only the three Maryland jurisdictions had an "intersection-related" code that can be used to further screen rear end crashes occurring within 45.72 m (150 ft) of the intersection. Thus, all rear end crashes within 45.72 m (150 ft) were used in Charlotte, NC, and all three California databases.

In addition, we encountered significant problems with the distance-from-intersection data in Baltimore, MD. Approximately 10 percent to 15 percent of the data appear to have questionable distances such as distances of 0.03 m (0.1 ft) and 0.30 m (1.0 ft) from the intersection. The project team attempted to verify these distances by obtaining hardcopies, but found that the accident case numbers in the computerized CODES data were not the same as the Baltimore Police Department case numbers, and only Baltimore has hardcopies of the reports. Thus, in the Baltimore analyses, two sets of data were used, a first set containing only rear end crashes within 45.72 m (150 ft) where the distance data were believed to be accurate, and a fuller set that also included crashes coded as within 45.72 m (150 ft), where the distance measurements were questionable. The analyses of these two sets of data revealed no significant differences; therefore the full set including the questionable distances was used for the final analysis.

The final set of criteria for each RLC-related crash type for each jurisdiction is listed in table 11.

Table 11. Definitions of crash types used in the analyses for each jurisdiction.

El Cajon, San Diego, San Francisco, CA

Intersection-related -All crashes at or within 6.096 m (20 ft) of intersection; rear end crashes within 45.72 m (50 ft).

Right-angle 1 (RA1)-Broadside, head-on, or sideswipe where vehicles approach intersection from perpendicular directions. (California does not have "left-turn" or "right-turn" as a crash type. Because there could be crashes from perpendicular directions where one of the vehicles is turning, it is assumed that all turning crashes are coded as either broadside, head-on, or sideswipe.)

Right-angle 2 (RA2)-Crashes in RA1 plus opposite direction left-turn. This may not be as precise a definition as RA1 because it could include non-RLR crashes in which the oncoming vehicle and the turning vehicle both had a green signal. That is, these are not restricted to locations with protected left-turn phases only. However, opposite direction left-turn crashes do include RLR crashes in which a vehicle turning left at the end of a green phase (referred to as a "sneaker" in traffic engineering terminology) is broadsided by a vehicle from the opposing direction that is technically running a red light.

Rear end-All rear end crashes within 45.72 m (150 ft) of intersection.

Charlotte, NC

Intersection related-All crashes at or within 6.096 m (20 ft) of intersection; rear end crashes within 45.72 m (150 ft) of intersection.

Right-angle 1 (RA1)-Angle, head-on, sideswipe, left-turn different roadways, right-turn different roadways where vehicles approach intersection from perpendicular directions.

Right-angle 2 (RA2)-Those in RA1 plus opposite direction left-turn.

Rear end -All rear end crashes within 45.72 m (150 ft) of intersection.

Howard County, Montgomery County, MD

Intersection-related-All crashes within 48.158 m (158 ft) and identified as "intersection" or "intersection-related."

Right-angle 1 (RA1)-Vehicles approach intersection from perpendicular directions, in any category of head-on, head-on left-turn, opposite direction sideswipe, straight movement angle, angle meets right-turn, angle meets left-turn, or angle meets left head-on.

Right-angle 2 (RA2)-Those in RA1 plus opposite direction left-turn.

Rear end-All rear end crashes within 48.158 m (158 ft) and identified as "intersection" or "intersection-related."

Baltimore, MD

Intersection-related-All crashes within 48.158 m (158 ft) and identified as "intersection" or "intersection-related."

Right-angle 2 (RA2)-Those in RA1 plus opposite direction left-turn.

Rear end-All rear end crashes within 48.158 m (158 ft) and identified as "intersection" or "intersection-related."

Development of Safety Performance Functions

As indicated earlier, the study required the development of safety performance functions (SPFs) for signalized and stop-controlled intersections. A reference group of untreated signalized intersections was used to develop SPFs to account for traffic volume changes and regression to the mean using the empirical Bayes procedure. The unsignalized intersection SPFs were used to account in that procedure for time trends in crash counts unrelated to the RLC installation. Therefore, it was necessary to first ensure that the comparison group used to calibrate the SPFs was suitable for this purpose, that is, that it had similar crash trends to the treatment group over the years before RLC installation. To this end a comparability test as outlined in Hauer was performed.⁽⁴⁾ This test confirmed the suitability of the comparison group.

To build the strongest possible SPFs, reference group data (i.e., data from the untreated signalized intersections) were combined for sets of jurisdictions, considering proximity and similarity in crash reporting practices. To this end, the three California cities of El Cajon, San Diego, and San Francisco were combined. Not only are these three cities in proximity, but they also do not have full reporting of PDO crashes, and the crash data all came from the State database maintained by the CHP. Howard and Montgomery Counties, MD, reference group data were combined because of their proximity and similarity in reporting practices. Baltimore, MD, and Charlotte, NC, were combined because of their high reporting of non-injury crashes. In each case where jurisdictions were combined, a jurisdiction-specific multiplier was calibrated and applied to account for any remaining differences in crash reporting.

Development of the SPFs involved determining which explanatory variables should be used, whether and how variables should be grouped, and how variables should enter into the model, in other words, the best model form. Generalized linear modeling was used to estimate model coefficients using the software package GENSTAT and assuming a negative binomial error distribution, all consistent with the common recent research practice in developing these models.⁽²⁹⁾

In specifying a negative binomial error structure, the dispersion parameter, k, which relates the mean and variance of the regression estimate, is iteratively estimated from the model and the data. The value of k is such that the smaller its value, the better a model is for a given set of data.

For specific crash types at signalized intersections, a multiplier is applied to the model that is equal to the proportion of total crashes that each crash type makes up. A value of k was calculated for each crash type using a maximum likelihood process, as explained earlier. Similarly, although data for groups of jurisdictions were combined for SPF calibration, separate multipliers and k values were calculated for each jurisdiction.

The inclusion of variables such as number of lanes rarely significantly affected the fit. This is not surprising because, as previous research has shown, much of the variation in crash experience is explained by the volume of traffic entering an intersection. The results of the SPF calibration for the signalized reference group are presented in table 12. The model forms used are tried and tested and, because of the limited datasets available, options on model forms and variables to include were so limited that a trial and error modeling approach, using published models as a guide, was realistic. In addition, fine tuning the model is not critical in EB analysis, especially because by weighting the observed count, one is accounting for omitted variables that may affect crash frequency.

Table 12. Safety performance functions for the signalized intersections reference group.

3-legged
		El Cajon	San Diego	San Francisco	Howard Co.	Montgomery Co.	Baltimore		Charlotte
Model form crashes/year		α(F1+F2)^b			α(F1+F2)^bexp(minllane*e)		α(F1)^c(F2)^dexp(majllane*f)
Ln(α) (s.e.)		-5.240 (2.21)	-5.651 (2.22)	-5.240 (2.21)	-6.970 (1.800)	-6.970 (1.800)	-3.100 (1.240)		-3.100 (1.240)
B (s.e.)		0.580 (0.218)	0.580 (0.218)	0.580 (0.218)	0.709 (0.183)	0.709 (0.183)	-		-
C (s.e.)		-	-	-	-	-	0.374 (0.119)		0.374 (0.119)
D (s.e.)		-	-	-	-	-	0.136 (0.080)		0.136 (0.080)
E (s.e.)		-	-	-	0.964 (0.297)	0.964 (0.297)	-		-
F (s.e.)		-	-	-	-	-	0.264 (0.075)		0.264 (0.075)
Total α, k		1.00, 0.18	1.00, 0.28	1.00, 0.28	1.00, 0.30	1.00, 0.30	1.00, 0.56		1.00, 0.28
Injury α, k		0.28, 0.13	0.31, 0.26	0.26, 0.26	0.12, 0.30	0.24, 0.21	0.15, 0.91		0.07, 0.24
Right-angle α, k		0.40, 0.67	0.35, 0.91	0.55, 0.91	0.35, 0.37	0.28, 0.14	0.44, 1.0		0.25, 0.45
Rear end α, k		0.41, 0.18	0.43, 0.25	0.22, 0.25	0.39, 0.63	0.44, 0.03	0.18, 1.1		0.61, 0.45
4-legged
	El Cajon		San Diego	San Francisco	Howard County	Montgomery County	Baltimore	Charlotte
Model form crashes/yr	α(F1+F2)^bexp(minrlane*e)				α(F1)^c(F2)^d		α(F1)^c(F2)^dexp(majllane*f)
Ln(α) (s.e.)	-3.950 (2.010)		-4.624 (2.021)	-4.477 (2.021)	-8.370 (1.090)	-8.370 (1.090)	-3.100 (1.240)	-3.100 (1.240)
B (s.e.)	0.530 (0.197)		0.530 (0.197)	0.530 (0.197)
C (s.e.)	-		-	-	0.703 (0.103)	0.703 (0.103)	0.374 (0.119)	0.374 (0.119)
D (s.e.)	-		-	-	0.335 (0.075)	0.335 (0.075)	0.136 (0.080)	0.136 (0.080)
E (s.e.)	-0.279 (0.129)		-0.279 (0.129)	-0.279 (0.129)	-	-	-	-
F (s.e.)	-		-	-	-	-	0.264 (0.075)	0.264 (0.075)
Total α, k	1.00, 0.19		1.00, 0.24	1.00, 0.24	1.00, 0.20	1.00, 0.20	1.00, 0.56	1.00, 0.28
Injury α, k	0.26, 0.14		0.29, 0.10	0.26, 0.10	0.16, 0.20	0.25, 0.25	0.15, 0.91	0.07, 0.24
Right-angle α, k	0.48, 0.34		0.42, 0.38	0.55, 0.38	0.38, 0.36	0.48, 0.45	0.44, 1.0	0.25, 0.45
Rear end α, k	0.32, 0.33		0.39, 0.48	0.22, 0.48	0.40, 0.45	0.32, 0.24	0.18, 0.9	0.61, 0.45

Table Legend:

F1 = entering AADT on major road, F2 = entering AADT on minor road; minllane = number of left-turn lanes on the minor road;

majllane = number of left-turn lanes on the major road; minrlane = number of right-turn lanes on the minor road; (s.e.) = standard error of the estimate;

k is a calibrated parameter relating the mean and variance used in the empirical Bayes estimation procedure

Previous | Table of Contents | Next

Page Owner: Office of Research, Development, and Technology, Office of Safety, RDT

Topics: research, safety, intersection safety, Stop Red Light Running Program
Keywords: research, safety, red light camera, Empirical Bayes, crash evaluation, economic analysis, signalized intersection
TRT Terms: Electronic traffic controls--Evaluation, Photography in traffic engineering, Cameras, Roads--Interchanges and intersections--Safety measures, Traffic safety--United States, Red light running, Cameras, Before and after studies, Economic analysis, Accident analysis, Accident characteristics
Scheduled Update: Archive - No Update needed

This page last modified on 03/08/2016