U.S. Department of Transportation
Federal Highway Administration
1200 New Jersey Avenue, SE
Washington, DC 20590
202-366-4000


Skip to content
Facebook iconYouTube iconTwitter iconFlickr iconLinkedInInstagram

Federal Highway Administration Research and Technology
Coordinating, Developing, and Delivering Highway Transportation Innovations

 
REPORT
This report is an archived publication and may contain dated technical, contact, and link information
Back to Publication List        
Publication Number:  FHWA-HRT-13-098    Date:  January 2014
Publication Number: FHWA-HRT-13-098
Date: January 2014

 

Human Factors Assessment of Pedestrian Roadway Crossing Behavior

Full Dataset

Overview of Crossings

Figure 61 through figure 66 depict the breakdown of the total observed crossings across all 20 locations. Because all of the crossings are combined across locations, the proportions of each type of crossing may not be typical of all locations. Because Location 3 was the only location that included crossings at an unmarked intersection, the sum values are not the same in every figure.

Chart. Distribution of all crossings observed across the 20 different locations by the area in which they took place. This pie chart divides the total number of crossings as follows: 55,040 marked intersection—rule-following crossings; 7,798 marked intersection—rule-breaking crossings; 5,218 unmarked non-intersection crossings; and 2,322 unmarked intersection crossings.

Figure 61. Chart. Distribution of all crossings observed across the 20 different locations by the area in which they took place.

Chart. Distribution of crossings observed across all 20 locations in the marked intersections by the circumstances of the crossing. Circumstances include unimpeded crossings, yielding, and evasive actions 51,489; unimpeded crossings with vehicle yielding 9,627; unimpeded crossing with pedestrian evasive action 1,547; unimpeded crossings with pedestrian yielding 163; and unimpeded crossings with vehicle evasive action 12.

Figure 62. Chart. Distribution of crossings observed across all 20 locations in the marked intersections by circumstances of the crossing.

Chart. Distribution of crossings observed across all 20 locations in the unmarked non-intersections by the circumstances of the crossing. Circumstances include unimpeded crossings, yielding, and evasive actions 4,707; unimpeded crossings with pedestrian evasive action 341; unimpeded crossings with pedestrian yielding 97; unimpeded crossings with vehicle yielding 70; unimpeded crossings with vehicle evasive action 3.

Figure 63. Chart. Distribution of crossings observed across all 20 locations in the unmarked non-intersections by the circumstances of the crossing.

Chart. Distribution crossings observed across all 20 locations for all rule-breaking crossings by circumstances of crossing (including those in the unmarked non-intersections). Circumstances include unimpeded crossings, yielding, and evasive actions 13,911; unimpeded crossings with pedestrian evasive action 1,063; unimpeded crossings with pedestrian yielding 205; unimpeded crossings with vehicle yielding 155; and unimpeded crossings with vehicle evasive action 4.

Figure 64. Chart. Distribution of crossings observed across all 20 locations for all rule-breaking crossings by the circumstances of the crossing (including those in the unmarked non-intersections).

Chart. Distribution of crossings observed across all 20 locations for crossings made entirely during the walk light phase in the marked intersections by the circumstances of the crossing. Circumstances include unimpeded crossings, yielding, and evasive actions 44,448; unimpeded crossings with vehicle yielding 9,559; unimpeded crossings with pedestrian evasive action 891; unimpeded crossings with pedestrian yielding 131; and unimpeded crossings with vehicle evasive action 11.

Figure 65. Chart. Distribution of crossings observed across all 20 locations for crossings made entirely during the walk light phase in the marked intersections, by the circumstances of the crossing.

Chart. Distribution of crossings observed across all 20 locations for rule-breaking crossings in marked intersections (i.e., crossings that took place at least partially during the don’t walk light phase) by the circumstances of the crossing. Circumstances included unimpeded crossings, yielding, and evasive actions 7,041; unimpeded crossings with pedestrian evasive action 656; unimpeded crossings with vehicle yielding 68; unimpeded crossings with pedestrian yielding 32; and unimpeded crossings with vehicle evasive action 1.

Figure 66. Chart. Distribution of crossings observed across all 20 locations for rule-breaking crossings in marked intersections (i.e., crossings that took place at least partially during the don’t walk light phase) by the circumstances of the crossing.

Prediction Model—Crossing Area

Using data from all 20 locations, a model was successfully created in SAS® to predict whether pedestrians would cross at the marked intersection (and not the unmarked non-intersections). There were 68,056 pedestrian crossings among the 20 locations, 62,838 of which occurred at marked intersections. (Note that crossings at the unmarked intersection at Location 3 were not included in the model.) In other words, approximately 92 percent of the crossings occurred at the marked intersections.

First, a factor analysis with an orthogonal varimax rotation was used to describe the underlying relationships among the 16 environmental variables (see table 46). Based on the greater-than-one rule for the eigenvalues, five factors were kept for the rotation. The factor loadings are included. Combined, these factors accounted for approximately 74 percent of the standardized variance in the data.

Table 46. Environmental factors and their labels used to calculate the crossing location prediction model.

Label

Environmental Variable

Coding

A

Distance to the next marked crosswalk

Distance in ft

B

AADT

Expressed in thousands and rounded to the nearest 100

C

One-way or two-way street

1 or 2

D

Presence of physical barriers that might prevent a pedestrian from crossing the roadway

No barrier (0), partial barrier (1), or mostly blocked/large barrier (2)

E

Presence of a bus stop

None (0), bus exit near marked intersection (1), bus exit at Non-Intersection (2)

F

Range of the number of trip originators/ destinations

Range from very few (1) to a lot (5)

G

Presence of parking along the roadway

Yes (1) or no (0)

H

Presence of a center turning lane

Yes (1) or no (0)

I

Presence of a right turn only turning lane

Yes (1) or no (0)

J

Length of walk phase

Time in s

K

Length of don’t walk phase

Time in s

L

Curb-to-curb distance

Distance in ft

M

Presence and type of median

No median (0), soft (1), hard (2), soft, not raised median

N

Presence of cross streets between marked crosswalks.

No cross street (0), light-controlled cross street (1), not light-controlled cross street (2)

O

Far marked crosswalk light controlled

Yes (1) or no (0)

P

Travel pace

ft/s

Note: Values in parentheses are the values assigned to categorical variables.

 

Table 47. Factor loadings for the 16 environmental variables.

Variable

Factor 1

Factor 2

Factor 3

Factor 4

Factor 5

A

-23

-4

74

13

21

B

24

86

11

-9

-6

C

11

86

13

19

-3

D

2

6

-9

82

-14

E

7

-39

17

51

16

F

-40

4

-4

-36

51

G

10

-19

-7

-7

79

H

-2

6

10

86

-5

I

-25

-3

29

-30

65

J

-92

-5

6

-2

9

K

78

33

-3

13

-2

L

28

75

34

-13

21

M

-2

38

76

3

-13

N

23

16

84

-3

-12

O

2

29

-12

29

75

P

94

19

8

-8

2

Indicates a loading of 60 or greater in absolute value.

 

Each of the five factors is assigned a descriptive label based on its correlations with relative variables:

·         Factor 1, Travel Pace and Phasing, is negatively correlated with the length of the walk phase, positively correlated with the length of the do not walk phase, and positively correlated with the travel pace.

·         Factor 2, Traffic Throughput, is positively correlated with the AADT, traffic directionality (one- or two-way street), and the curb-to-curb distance (i.e., the width of the street).

·         Factor 3, Distance to Safety, is positively correlated with the distance to the next marked crosswalk, the presence and type of median, and the presence of cross streets between marked crosswalks.

·         Factor 4, External Objects (Barriers/Vehicles) in Center of Road, is positively correlated with the presence of physical barriers that might prevent a pedestrian from crossing the roadway and the presence of a center turning lane.

·         Factor 5, External Objects (Vehicles) on Sides of Road, is positively correlated with the presence of parking along the roadway, the presence of a right turn only lane, and whether or not the far marked crosswalk is light controlled.

Scores for each of the five factors were generated using the standardized scoring coefficients listed in table 48. These coefficients can only be used with the standardized data and not with the raw, observed values. Each of the five factor scores is graphed against the percentage of intersection crossings for each location in figure 67.

Table 48. Standardized scoring coefficients for the five rotated factors.

Variable

Factor 1

Factor 2

Factor 3

Factor 4

Factor 5

A

-0.05047

-0.10759

0.38127

0.07187

0.08558

B

-0.04080

0.36229

-0.07173

-0.04503

-0.03821

C

-0.09979

0.38416

-0.07152

0.10124

-0.01225

D

-0.04177

0.05360

-0.06575

0.39620

-0.01480

E

0.09552

-0.22544

0.14359

0.25108

0.12869

F

-0.12612

0.07613

-0.04661

-0.12670

0.20952

G

0.12938

-0.11604

-0.01154

0.01230

0.41212

H

-0.04871

0.03214

0.02615

0.41759

0.02701

I

-0.03224

-0.03729

0.13973

-0.10183

0.29012

J

-0.36946

0.11821

0.00329

0.02802

-0.01769

K

0.26616

0.03607

-0.04015

0.04254

0.04615

L

0.02736

0.25549

0.06317

-0.05440

0.09691

M

-0.05098

0.06517

0.33786

-0.00163

-0.08623

N

0.09721

-0.09803

0.42527

-0.04528

-0.06540

O

-0.00290

0.14773

-0.12528

0.19947

0.39702

P

0.37272

-0.08072

0.04904

-0.06944

0.06522

 

Chart. Scatterplot of each location’s score for each of the five factors against the percentage of intersection crossings at that location. The dots remain in consistent locations from box to box. The solid line = regression line and the blue area = 95% confidence limits of the regression line. Factor 1 is negatively skewed, factor 2 is slightly positively skewed, factor 3 is slightly positively skewed, factor 4 is positively skewed, and factor 5 is positively skewed,

Notes: The dots remain in consistent locations from box to box.

solid line = regression line, shaded area = 95-percent confidence limits of the regression line).

Figure 67. Chart. Scatterplot of each location’s score for each of the five factors against the percentage of intersection crossings at that location.

 

Next, the scores for the five factors were regressed on the probability of an intersection crossing through logistic regression using forward selection. The model iterated three times after the entrance of an intercept term. The scores for Factor 1, (1) = 176.6, p < 0.01, Factor 2, (1) = 41.6, p < 0.01, and Factor 3, (1) = 402.3, p < 0.01, were statistically significant. The resultant model is shown in figure 68 where π represents the probability of crossing in the marked intersection and x is the vector of factor scores for a given location. When using a logit (log odds) model, represents the odds of success when all predictors equal zero, and represents the multiplicative effect of a one-unit increase in the corresponding x on the odds of success, when the other predictors are held fixed.

Equation. The logit of crossing at the intersection. The equation shows logit is equal to 2.36 minus .031 times factor1, minus .08 times factor 2, minus .27 times factor 3.

Figure 68. Equation. The logit of crossing at the intersection.

A good probability cutoff point for this data is 0.9, meaning that any future predictions using the model that are 0.9 or greater should be deemed an intersection crossing. With this cutoff, the model can successfully predict approximately 90 percent of the crossings overall. The resultant sensitivity and specificity are about 96 percent and 10 percent, respectively. Sensitivity measures how many events (intersection crossings) were successfully predicted, and specificity measures how many non-events (non-intersection crossings) were successfully predicted. The predicted probabilities of crossing at the intersection for each location are shown in table 49.

Table 49. Probabilities of correctly predicting a crossing at the marked intersection or the unmarked non-intersection by location and the corresponding upper and lower 95-percent confidence limits.

Location

Total Crossings

Estimated Probability

Lower 95 Percent Confidence Limit

Upper 95 Percent Confidence Limit

1

1,110

0.91203

0.90715

0.91668

2

4,631

0.90319

0.89779

0.90834

3

2,878

0.90348

0.89874

0.90802

4

13,199

0.91365

0.90995

0.91721

5

10,635

0.94112

0.93862

0.94353

6

16,418

0.93063

0.92710

0.93400

7

12,958

0.94419

0.94152

0.94675

8

1,574

0.80553

0.79210

0.81829

9

805

0.80876

0.79562

0.82123

10

528

0.91159

0.90585

0.91701

11

17

0.89962

0.89213

0.90664

12

185

0.90793

0.90141

0.91406

13

609

0.85845

0.84056

0.87463

14

205

0.92484

0.92110

0.92842

15

840

0.93348

0.93106

0.93583

16

280

0.93036

0.92719

0.93340

17

225

0.94899

0.94636

0.95150

18

786

0.95218

0.94816

0.95590

19

84

0.90816

0.90462

0.91158

20

89

0.92066

0.91693

0.92424

 

Prediction Model—Walk Versus All Other Crossings

A second model was developed to predict crossings that occurred entirely during the walk phase in the marked intersection. Crossings were divided into two categories—crossings that occurred within the marked intersection entirely during the walk light phase and all other crossings (e.g., unmarked non-intersection, marked intersection during the don’t walk light phase, etc.). Using this classification system, there were 70,378 pedestrian crossings among the 20 locations, 55,040 of which occurred entirely within the walk phase in the marked intersection. In other words, approximately 78 percent of the crossings were rule-following crossings. Using the factor analysis and standardized scoring coefficients (table 48), each of the five factor scores is graphed against the percentage of rule-following crossings for each location in figure 69.

Chart. Scatterplot of each location’s score for each of the five factors against the percentage of intersection crossings at that location. The dots remain in consistent locations from box to box. The solid line is the regression line and the blue area is the 95-percent confidence limits of the regression line. Factor 1 is strongly negatively skewed, factor 2 is strongly positively skewed, factor 3 has no skew, factor 4 is slightly negatively skewed, and factor 5 is not skewed.

Notes: The dots remain in consistent locations from box to box.

solid line = regression line, shaded area = 95-percent confidence limits of the regression line).

Figure 69. Chart. Scatterplot of each location’s score for each of the five factors against the percentage of intersection crossings at that location.

 

Next, the scores for the five factors were regressed on the probability of a rule-following crossing through logistic regression using forward selection (see figure 70). The model iterated five times after the entrance of an intercept term. The scores for Factor 1, (1) = 607.6, p < 0.01, Factor 2, (1) = 1224.5, p < 0.01, Factor 3, (1) = 187.4, Factor 4, (1) = 5783.2, p < 0.01, and Factor 5, (1) = 388.4, p < 0.01, were all statistically significant.

Equation. The logit of crossing at the intersection during the walk phase. The equation shows logit is equal to 1.33 minus .37 times factor 1, plus .31 times factor 2, 
minus .23 times factor 3, minus .54 times factor 4, plus .24 times factor 5.

Figure 70. Equation. The logit of crossing at the intersection during the walk phase.

A good probability cutoff point for this data is 0.6, meaning that any future predictions using the model that are 0.6 or greater should be deemed a rule-following crossing. With this cutoff, the model can successfully predict approximately 79 percent of the crossings overall. The resultant sensitivity and specificity are about 94 percent and 23 percent, respectively. The predicted probabilities of crossing during the walk phase at the intersection for each location are shown in table 50.

Table 50. Probabilities of correctly predicting a crossing at the marked intersection entirely during the walk phase by location and the corresponding upper and lower 95‑percent confidence limits.

Location

Total Crossings

Estimated Probability

Lower 95 Percent Confidence Limit

Upper 95 Percent Confidence Limit

1

1,110

0.7514

0.7251

0.7759

2

4,631

0.8791

0.8694

0.8882

3

2,878

0.4456

0.4321

0.4591

4

13,199

0.6798

0.6718

0.6877

5

10,635

0.9343

0.9295

0.9389

6

16,418

0.9062

0.9017

0.9106

7

12,958

0.7546

0.7471

0.7619

8

1,574

0.7421

0.7199

0.7631

9

805

0.7764

0.7463

0.8039

10

528

0.8431

0.8106

0.8710

11

17

0.6428

0.5546

0.7223

12

185

0.6431

0.5752

0.7057

13

609

0.2855

0.2530

0.3204

14

205

0.8054

0.7656

0.8399

15

840

0.5515

0.5181

0.5845

16

280

0.8286

0.7798

0.8683

17

225

0.7344

0.6730

0.7879

18

786

0.7304

0.6983

0.7603

19

84

0.8690

0.7788

0.9260

20

89

0.4227

0.3397

0.5103

 

Federal Highway Administration | 1200 New Jersey Avenue, SE | Washington, DC 20590 | 202-366-4000
Turner-Fairbank Highway Research Center | 6300 Georgetown Pike | McLean, VA | 22101