U.S. Department of Transportation
Federal Highway Administration
1200 New Jersey Avenue, SE
Washington, DC 20590
202-366-4000
Federal Highway Administration Research and Technology
Coordinating, Developing, and Delivering Highway Transportation Innovations
REPORT |
This report is an archived publication and may contain dated technical, contact, and link information |
|
Publication Number: FHWA-HRT-13-098 Date: January 2014 |
Publication Number: FHWA-HRT-13-098 Date: January 2014 |
Figure 61 through figure 66 depict the breakdown of the total observed crossings across all 20 locations. Because all of the crossings are combined across locations, the proportions of each type of crossing may not be typical of all locations. Because Location 3 was the only location that included crossings at an unmarked intersection, the sum values are not the same in every figure.
Figure 61. Chart. Distribution of all crossings observed across the 20 different locations by the area in which they took place.
Figure 62. Chart. Distribution of crossings observed across all 20 locations in the marked intersections by circumstances of the crossing.
Figure 63. Chart. Distribution of crossings observed across all 20 locations in the unmarked non-intersections by the circumstances of the crossing.
Figure 64. Chart. Distribution of crossings observed across all 20 locations for all rule-breaking crossings by the circumstances of the crossing (including those in the unmarked non-intersections).
Figure 65. Chart. Distribution of crossings observed across all 20 locations for crossings made entirely during the walk light phase in the marked intersections, by the circumstances of the crossing.
Figure 66. Chart. Distribution of crossings observed across all 20 locations for rule-breaking crossings in marked intersections (i.e., crossings that took place at least partially during the don’t walk light phase) by the circumstances of the crossing.
Using data from all 20 locations, a model was successfully created in SAS® to predict whether pedestrians would cross at the marked intersection (and not the unmarked non-intersections). There were 68,056 pedestrian crossings among the 20 locations, 62,838 of which occurred at marked intersections. (Note that crossings at the unmarked intersection at Location 3 were not included in the model.) In other words, approximately 92 percent of the crossings occurred at the marked intersections.
First, a factor analysis with an orthogonal varimax rotation was used to describe the underlying relationships among the 16 environmental variables (see table 46). Based on the greater-than-one rule for the eigenvalues, five factors were kept for the rotation. The factor loadings are included. Combined, these factors accounted for approximately 74 percent of the standardized variance in the data.
Table 46. Environmental factors and their labels used to calculate the crossing location prediction model.
Label |
Environmental Variable |
Coding |
A |
Distance to the next marked crosswalk |
Distance in ft |
B |
AADT |
Expressed in thousands and rounded to the nearest 100 |
C |
One-way or two-way street |
1 or 2 |
D |
Presence of physical barriers that might prevent a pedestrian from crossing the roadway |
No barrier (0), partial barrier (1), or mostly blocked/large barrier (2) |
E |
Presence of a bus stop |
None (0), bus exit near marked intersection (1), bus exit at Non-Intersection (2) |
F |
Range of the number of trip originators/ destinations |
Range from very few (1) to a lot (5) |
G |
Presence of parking along the roadway |
Yes (1) or no (0) |
H |
Presence of a center turning lane |
Yes (1) or no (0) |
I |
Presence of a right turn only turning lane |
Yes (1) or no (0) |
J |
Length of walk phase |
Time in s |
K |
Length of don’t walk phase |
Time in s |
L |
Curb-to-curb distance |
Distance in ft |
M |
Presence and type of median |
No median (0), soft (1), hard (2), soft, not raised median |
N |
Presence of cross streets between marked crosswalks. |
No cross street (0), light-controlled cross street (1), not light-controlled cross street (2) |
O |
Far marked crosswalk light controlled |
Yes (1) or no (0) |
P |
Travel pace |
ft/s |
Note: Values in parentheses are the values assigned to categorical variables.
Table 47. Factor loadings for the 16 environmental variables.
Variable |
Factor 1 |
Factor 2 |
Factor 3 |
Factor 4 |
Factor 5 |
A |
-23 |
-4 |
74 |
13 |
21 |
B |
24 |
86 |
11 |
-9 |
-6 |
C |
11 |
86 |
13 |
19 |
-3 |
D |
2 |
6 |
-9 |
82 |
-14 |
E |
7 |
-39 |
17 |
51 |
16 |
F |
-40 |
4 |
-4 |
-36 |
51 |
G |
10 |
-19 |
-7 |
-7 |
79 |
H |
-2 |
6 |
10 |
86 |
-5 |
I |
-25 |
-3 |
29 |
-30 |
65 |
J |
-92 |
-5 |
6 |
-2 |
9 |
K |
78 |
33 |
-3 |
13 |
-2 |
L |
28 |
75 |
34 |
-13 |
21 |
M |
-2 |
38 |
76 |
3 |
-13 |
N |
23 |
16 |
84 |
-3 |
-12 |
O |
2 |
29 |
-12 |
29 |
75 |
P |
94 |
19 |
8 |
-8 |
2 |
Indicates a loading of 60 or greater in absolute value.
Each of the five factors is assigned a descriptive label based on its correlations with relative variables:
· Factor 1, Travel Pace and Phasing, is negatively correlated with the length of the walk phase, positively correlated with the length of the do not walk phase, and positively correlated with the travel pace.
· Factor 2, Traffic Throughput, is positively correlated with the AADT, traffic directionality (one- or two-way street), and the curb-to-curb distance (i.e., the width of the street).
· Factor 3, Distance to Safety, is positively correlated with the distance to the next marked crosswalk, the presence and type of median, and the presence of cross streets between marked crosswalks.
· Factor 4, External Objects (Barriers/Vehicles) in Center of Road, is positively correlated with the presence of physical barriers that might prevent a pedestrian from crossing the roadway and the presence of a center turning lane.
· Factor 5, External Objects (Vehicles) on Sides of Road, is positively correlated with the presence of parking along the roadway, the presence of a right turn only lane, and whether or not the far marked crosswalk is light controlled.
Scores for each of the five factors were generated using the standardized scoring coefficients listed in table 48. These coefficients can only be used with the standardized data and not with the raw, observed values. Each of the five factor scores is graphed against the percentage of intersection crossings for each location in figure 67.
Table 48. Standardized scoring coefficients for the five rotated factors.
Variable |
Factor 1 |
Factor 2 |
Factor 3 |
Factor 4 |
Factor 5 |
A |
-0.05047 |
-0.10759 |
0.38127 |
0.07187 |
0.08558 |
B |
-0.04080 |
0.36229 |
-0.07173 |
-0.04503 |
-0.03821 |
C |
-0.09979 |
0.38416 |
-0.07152 |
0.10124 |
-0.01225 |
D |
-0.04177 |
0.05360 |
-0.06575 |
0.39620 |
-0.01480 |
E |
0.09552 |
-0.22544 |
0.14359 |
0.25108 |
0.12869 |
F |
-0.12612 |
0.07613 |
-0.04661 |
-0.12670 |
0.20952 |
G |
0.12938 |
-0.11604 |
-0.01154 |
0.01230 |
0.41212 |
H |
-0.04871 |
0.03214 |
0.02615 |
0.41759 |
0.02701 |
I |
-0.03224 |
-0.03729 |
0.13973 |
-0.10183 |
0.29012 |
J |
-0.36946 |
0.11821 |
0.00329 |
0.02802 |
-0.01769 |
K |
0.26616 |
0.03607 |
-0.04015 |
0.04254 |
0.04615 |
L |
0.02736 |
0.25549 |
0.06317 |
-0.05440 |
0.09691 |
M |
-0.05098 |
0.06517 |
0.33786 |
-0.00163 |
-0.08623 |
N |
0.09721 |
-0.09803 |
0.42527 |
-0.04528 |
-0.06540 |
O |
-0.00290 |
0.14773 |
-0.12528 |
0.19947 |
0.39702 |
P |
0.37272 |
-0.08072 |
0.04904 |
-0.06944 |
0.06522 |
Notes: The dots remain in consistent locations from box to box.
solid line = regression line, shaded area = 95-percent confidence limits of the regression line).
Figure 67. Chart. Scatterplot of each location’s score for each of the five factors against the percentage of intersection crossings at that location.
Next, the scores for the five factors were regressed on the probability of an intersection crossing through logistic regression using forward selection. The model iterated three times after the entrance of an intercept term. The scores for Factor 1, (1) = 176.6, p < 0.01, Factor 2, (1) = 41.6, p < 0.01, and Factor 3, (1) = 402.3, p < 0.01, were statistically significant. The resultant model is shown in figure 68 where π represents the probability of crossing in the marked intersection and x is the vector of factor scores for a given location. When using a logit (log odds) model, represents the odds of success when all predictors equal zero, and represents the multiplicative effect of a one-unit increase in the corresponding x on the odds of success, when the other predictors are held fixed.
Figure 68. Equation. The logit of crossing at the intersection.
A good probability cutoff point for this data is 0.9, meaning that any future predictions using the model that are 0.9 or greater should be deemed an intersection crossing. With this cutoff, the model can successfully predict approximately 90 percent of the crossings overall. The resultant sensitivity and specificity are about 96 percent and 10 percent, respectively. Sensitivity measures how many events (intersection crossings) were successfully predicted, and specificity measures how many non-events (non-intersection crossings) were successfully predicted. The predicted probabilities of crossing at the intersection for each location are shown in table 49.
Table 49. Probabilities of correctly predicting a crossing at the marked intersection or the unmarked non-intersection by location and the corresponding upper and lower 95-percent confidence limits.
Location |
Total Crossings |
Estimated Probability |
Lower 95 Percent Confidence Limit |
Upper 95 Percent Confidence Limit |
1 |
1,110 |
0.91203 |
0.90715 |
0.91668 |
2 |
4,631 |
0.90319 |
0.89779 |
0.90834 |
3 |
2,878 |
0.90348 |
0.89874 |
0.90802 |
4 |
13,199 |
0.91365 |
0.90995 |
0.91721 |
5 |
10,635 |
0.94112 |
0.93862 |
0.94353 |
6 |
16,418 |
0.93063 |
0.92710 |
0.93400 |
7 |
12,958 |
0.94419 |
0.94152 |
0.94675 |
8 |
1,574 |
0.80553 |
0.79210 |
0.81829 |
9 |
805 |
0.80876 |
0.79562 |
0.82123 |
10 |
528 |
0.91159 |
0.90585 |
0.91701 |
11 |
17 |
0.89962 |
0.89213 |
0.90664 |
12 |
185 |
0.90793 |
0.90141 |
0.91406 |
13 |
609 |
0.85845 |
0.84056 |
0.87463 |
14 |
205 |
0.92484 |
0.92110 |
0.92842 |
15 |
840 |
0.93348 |
0.93106 |
0.93583 |
16 |
280 |
0.93036 |
0.92719 |
0.93340 |
17 |
225 |
0.94899 |
0.94636 |
0.95150 |
18 |
786 |
0.95218 |
0.94816 |
0.95590 |
19 |
84 |
0.90816 |
0.90462 |
0.91158 |
20 |
89 |
0.92066 |
0.91693 |
0.92424 |
A second model was developed to predict crossings that occurred entirely during the walk phase in the marked intersection. Crossings were divided into two categories—crossings that occurred within the marked intersection entirely during the walk light phase and all other crossings (e.g., unmarked non-intersection, marked intersection during the don’t walk light phase, etc.). Using this classification system, there were 70,378 pedestrian crossings among the 20 locations, 55,040 of which occurred entirely within the walk phase in the marked intersection. In other words, approximately 78 percent of the crossings were rule-following crossings. Using the factor analysis and standardized scoring coefficients (table 48), each of the five factor scores is graphed against the percentage of rule-following crossings for each location in figure 69.
Notes: The dots remain in consistent locations from box to box.
solid line = regression line, shaded area = 95-percent confidence limits of the regression line).
Figure 69. Chart. Scatterplot of each location’s score for each of the five factors against the percentage of intersection crossings at that location.
Next, the scores for the five factors were regressed on the probability of a rule-following crossing through logistic regression using forward selection (see figure 70). The model iterated five times after the entrance of an intercept term. The scores for Factor 1, (1) = 607.6, p < 0.01, Factor 2, (1) = 1224.5, p < 0.01, Factor 3, (1) = 187.4, Factor 4, (1) = 5783.2, p < 0.01, and Factor 5, (1) = 388.4, p < 0.01, were all statistically significant.
Figure 70. Equation. The logit of crossing at the intersection during the walk phase.
A good probability cutoff point for this data is 0.6, meaning that any future predictions using the model that are 0.6 or greater should be deemed a rule-following crossing. With this cutoff, the model can successfully predict approximately 79 percent of the crossings overall. The resultant sensitivity and specificity are about 94 percent and 23 percent, respectively. The predicted probabilities of crossing during the walk phase at the intersection for each location are shown in table 50.
Table 50. Probabilities of correctly predicting a crossing at the marked intersection entirely during the walk phase by location and the corresponding upper and lower 95‑percent confidence limits.
Location |
Total Crossings |
Estimated Probability |
Lower 95 Percent Confidence Limit |
Upper 95 Percent Confidence Limit |
1 |
1,110 |
0.7514 |
0.7251 |
0.7759 |
2 |
4,631 |
0.8791 |
0.8694 |
0.8882 |
3 |
2,878 |
0.4456 |
0.4321 |
0.4591 |
4 |
13,199 |
0.6798 |
0.6718 |
0.6877 |
5 |
10,635 |
0.9343 |
0.9295 |
0.9389 |
6 |
16,418 |
0.9062 |
0.9017 |
0.9106 |
7 |
12,958 |
0.7546 |
0.7471 |
0.7619 |
8 |
1,574 |
0.7421 |
0.7199 |
0.7631 |
9 |
805 |
0.7764 |
0.7463 |
0.8039 |
10 |
528 |
0.8431 |
0.8106 |
0.8710 |
11 |
17 |
0.6428 |
0.5546 |
0.7223 |
12 |
185 |
0.6431 |
0.5752 |
0.7057 |
13 |
609 |
0.2855 |
0.2530 |
0.3204 |
14 |
205 |
0.8054 |
0.7656 |
0.8399 |
15 |
840 |
0.5515 |
0.5181 |
0.5845 |
16 |
280 |
0.8286 |
0.7798 |
0.8683 |
17 |
225 |
0.7344 |
0.6730 |
0.7879 |
18 |
786 |
0.7304 |
0.6983 |
0.7603 |
19 |
84 |
0.8690 |
0.7788 |
0.9260 |
20 |
89 |
0.4227 |
0.3397 |
0.5103 |