U.S. Department of Transportation
Federal Highway Administration
1200 New Jersey Avenue, SE
Washington, DC 20590
202-366-4000
Federal Highway Administration Research and Technology
Coordinating, Developing, and Delivering Highway Transportation Innovations
REPORT |
This report is an archived publication and may contain dated technical, contact, and link information |
|
![]() |
Publication Number: FHWA-HRT-17-070 Date: August 2017 |
Publication Number: FHWA-HRT-17-070 Date: August 2017 |
When planning a before–after safety evaluation study, it is vital to ensure that enough data are included to detect the expected change in safety with statistical confidence. Even though those designing the study do not know the expected change in safety in the planning stage, it is still possible to make a rough determination of how many sites are required based on the best available information about the expected change in safety. Alternatively, one could estimate the statistically detectable change in safety for the number of available sites. For a detailed explanation of sample size considerations, as well as estimation methods, see chapter 9 of Hauer.(10) The sample size analysis cases presented in this report address (1) how large a sample would be required to statistically detect an expected change in safety and (2) the change in safety that could be detected with available sample sizes.
For this analysis, the research team assumed that a conventional before–after study with comparison group design would be used because available sample size estimation methods were based on this assumption. The sample size estimates from this method would be conservative in that the EB methodology proposed would likely require fewer sites. To facilitate the analysis, the research team also assumed that the number of comparison sites was equal to the number of treatment sites, which again, was a conservative assumption.
As discussed earlier in the literature review, the crash types that cable median barriers and rumble strips on the inside shoulder would most affect would be cross-median crashes. For the study design, the research team assumed that crashes that were coded as head-on and opposite-direction sideswipe were cross-median. The research team used crash rates from the reference groups for total crashes and injury and fatal crashes; they used crash rates from the treatment group after reducing by 25 percent for cross-median crashes. (The team chose this reduction percentage based on the work by Bahar, which indicated that possible bias due to regression-to-the-mean was not likely to be higher than 25 percent.)(11)Table 2 provides the crash rate assumptions.
Crash Type |
Kentucky (A) |
Missouri (B) |
Illinois |
---|---|---|---|
Total |
8.25 |
1.91 |
6.95 |
Injury and fatal |
1.90 |
0.62 |
1.53 |
Cross-median |
0.27 |
0.03 |
0.07 |
Table 3 provides estimates of the required number of before- and after-period mile-years for statistical significance at both a 90- and 95-percent confidence level for both crash rate assumptions. The minimum sample indicates the level at which a study would seem to be worthwhile; that is, it would be feasible to detect with the desired level of confidence the largest effect that might reasonably be expected based on what was currently known about the strategy. The research team based these sample size calculations on the methodology in Hauer and on specific assumptions regarding the number of crashes per mile and years of available data.(10) Mile-years are the number of miles of highway on which the strategy was implemented multiplied by the number of years of data before or after implementation. For example, if a strategy was implemented at a 10-mi segment and data were available so far for 4 years since implementation, then there would be a total of 40 mi-year of after-period data available for the study.
Note: Assumes equal number of mile-years for treatment and comparison sites and equal length of before and after periods.The sample size values recommended in this study are indicated with an asterisk in table 3. These values are recommended based on the likeliness of obtaining the estimated sample size as well as the anticipated effects of the treatment. As noted, the sample size estimates provided were conservative in that the state-of-the-art EB methodology proposed for the evaluations would require fewer sites than the less robust conventional before–after study with a comparison group that had to be assumed for the calculations. Estimates can be predicted with greater confidence, or a smaller reduction in crashes would be detectable, if there were more site-years of data available in the after period. The same holds true if the actual data used for the analysis had a higher crash rate for the before period than was assumed.
The standard deviations of the estimated percent change in safety describe the statistical accuracy attainable for a given sample size. From this, one can estimate p-values for various sample sizes and expected change in safety for a given crash history based on the method in Hauer.(10)
For the available data in the three States in this evaluation, the research team estimated the minimum percentage changes in crash frequency that could be statistically detectable at 5- and 10-percent significance levels, as shown in table 4. For these calculations, the research team assumed that data would be available until the end of 2012. The results indicated that the data should be able to detect the recommended crash reduction values from table 3 if such an effect were present. Using these results, the authors made a decision to proceed with the evaluation using the data available at that time.
Note: Results are to nearest 1 percent.