U.S. Department of Transportation
Federal Highway Administration
1200 New Jersey Avenue, SE
Washington, DC 20590
202-366-4000
Federal Highway Administration Research and Technology
Coordinating, Developing, and Delivering Highway Transportation Innovations
REPORT |
This magazine is an archived publication and may contain dated technical, contact, and link information |
|
Publication Number: FHWA-HRT-15-048 Date: June 2015 |
Publication Number: FHWA-HRT-15-048 Date: June 2015 |
The study design involved a sample size analysis and prescription of needed data elements. The sample size analysis assessed the size of a sample required to statistically detect an expected change in safety and also determined what changes in safety can be detected with likely available sample sizes.
Sample Size Estimation Overview
Sample size estimations require assumptions of the expected treatment effect and the average crash rate at treatment sites prior to treatment. Minimum and desired sample sizes were calculated assuming a conventional before-after with comparison group (C-G) study design, as described in Hauer and a literature review of likely safety effects.(6) The sample size analysis undertaken for this study addressed the size of sample required to statistically detect an expected change in safety. The sample size estimates are conservative because the more robust EB methodology is actually used in the before-after analysis rather than the C-G methodology.
Sample sizes were estimated for various assumptions of the likely annual crash rate in the before period and likely safety effects of the strategy. Annual crash rates were assumed for five crash types (i.e., total, injury, run-off-road, head-on plus sideswipe-opposite-direction, and all target crashes (run-off-road plus head-on plus sideswipe-opposite-direction)), as shown in table 3. Intersection-related and animal crashes are not included in these crash rates. These crash rates, which were obtained from preliminary data for the untreated reference group data collected for the EB analysis, represent a range of mean crash rates. Only crash rates from Pennsylvania and Kentucky were used at the time of the sample size analysis. The study design assumed that the number of comparison sites would be equal to the number of treatment sites for a C-G study.
Crash Type | Pennsylvania (Crashes/Mi/Year) | Kentucky (Crashes/Mi/Year) |
---|---|---|
All | 0.96 | 1.21 |
Injury1 | 0.51 | 0.38 |
Run-Off-Road | 0.15 | 0.19 |
Head-On+Sideswipe-Opposite-Direction | 0.06 | 0.08 |
All Target Crashes | 0.21 | 0.27 |
1Non-injury crash type proportions assumed to be same as Pennsylvania for run-off-road, head-on, and sideswipe-opposite-direction.
Table 4 provides estimates of the required number of before and after period mile-years for statistical significance at both a 90- and 95-percent confidence level for both crash rate assumptions. The minimum sample indicates the level for which a study seems worthwhile; that is, it is feasible to detect with the level of confidence the largest effect that may reasonably be expected based on what is currently known about the strategy. These sample size calculations were based on specific assumptions regarding the number of crashes per mile and years of available data. Mile-years are the number of miles where the strategy was implemented multiplied by the number of years of data before or after implementation. For example, if a strategy was implemented at a 9-mi segment and data are available so far for 3 years since implementation, then there are a total of 27 mi-year of after period data available for the study.
Expected Percent Reduction in Crashes | 95-Percent Confidence Level | 90-Percent | |||
---|---|---|---|---|---|
PA Rate |
KY Rate |
PA Rate |
KY Rate | ||
Total |
5 |
1,057 |
444 |
740 |
311 |
10 |
216 |
91 |
151 |
64 | |
20 |
77 |
32 |
54 |
23 | |
30 |
33 |
14 |
23 |
10 | |
Injury |
10 |
2,508 |
3,366 |
1,756 |
2,357 |
20 |
512 |
687 |
359 |
481 | |
30 |
182 |
244 |
127 |
171 | |
40 |
79 |
106 |
55 |
74 | |
Run-Off-Road |
10 |
8,528 |
6,733 |
5,971 |
4,714 |
20 |
1,742 |
1,375 |
1,219 |
963 | |
30 |
618 |
488 |
432 |
341 | |
40 |
269 |
212 |
188 |
149 | |
Head- On+Sideswipe- Opposite-Direction |
10 |
21,321 |
15,991 |
14,927 |
11,195 |
20 |
4,354 |
3,265 |
3,048 |
2,286 | |
30 |
1,544 |
1,158 |
1,081 |
811 | |
40 |
672 |
504 |
471 |
353 | |
All Target Crashes |
10 |
6,092 |
4,738 |
4,265 |
3,317 |
20 |
1244 |
968 |
871 |
677 | |
30 |
441 |
343 |
309 |
240 | |
40 |
192 |
149 |
134 |
105 |
1Assumes equal number of mile-years for treatment and comparison sites and equal length of before and after periods.
2Bold indicates sample size values recommended in this study.
The sample size values recommended in this study are highlighted in bold in table 4. These values are recommended based on the likeliness of obtaining the estimated sample size as well as the anticipated effects of the treatment. As noted, the sample size estimates provided are conservative in that the state-of-the-art EB methodology proposed for the evaluations would require fewer sites than the less robust conventional before-after study with a comparison group that had to be assumed for the calculations. Estimates may be predicted with greater confidence or a smaller reduction in crashes will be detectable if there are more site-years of data available in the after period. The same holds true if the actual data used for the analysis had a higher crash rate for the before period than was assumed.
Following the data collection for both the before and after periods, the total mile-years of data available was 6,392 for the before period and 2,623 for the after period. For the available data, the minimum percentage change in crash frequency that could be statistically detectable at 95- and 90-percent confidence levels was estimated using the same crash rates found in table 3. The calculations are based on methodology in Hauer.(6) The results, which are shown in table 5, indicate that the data should be able to detect the recommended crash reduction values from table 4, if such an effect were present. Using these results, a decision was made to proceed with the evaluation using the data available at the time.
Crash Type | Mile-Years in Before Period | Mile-Years in After Period | 90-Percent Confidence Level1 | 95-Percent Confidence Level1 |
---|---|---|---|---|
Total | 6,392 | 2,623 |
5.0 |
7.5 |
Injury | 7.5 |
10.0 | ||
Run-Off-Road | 10.0 |
12.5 | ||
Head-On+Sideswipe-Opposite-Direction | 17.5 |
20.0 | ||
All Target Crashes | 7.5 |
10.0 |
1Minimum percent reduction detectable for crash rate assumption. Minimum percent reduction is rounded to nearest 2.5 percent.