DOI: 10.14714/CP96.1631

© by the author(s). This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-nd/4.0.

User-centered Design and Evaluation of a Geovisualization Application Leveraging Aggregated Quantified-Self Data

Jonathan K. Nelson, The Pennsylvania State University | jknelson@psu.edu

Alan M. MacEachren, The Pennsylvania State University | maceachren@psu.edu

Individual movement traces recorded by users of activity tracking applications such as Strava provide opportunities that extend beyond delivering personal value or insight to the individual who engages in these “quantified-self” (QS) activities. The large volumes of data generated by these individuals, when aggregated and anonymized, can be used by city planners, Departments of Transportation, advocacy groups, and researchers to help make cities safer and more efficient. This opportunity, however, is constrained by the technical skills and resources available to those tasked with assessing bicycling behavior in urban centers. This paper aims to address the question of how to design cartographic interfaces to serve as mediated platforms for making large amounts of individual bicycling data more accessible, usable, and actionable. Principles of cartographic representation, geovisual analytics techniques, and best practices in user interface/experience design are employed to arrive at an effective visualization tool for a broad urban planning audience. We use scenario-based design methods to encapsulate knowledge of map use practice gleaned from the development process, and conduct a post-implementation, two-part user study with seven domain experts to further assess the usability and utility of the interactive mapping tool.

KEYWORDS: cartographic design study; user-centered design; movement data; quantified-self (QS); urban planning

INTRODUCTION

Advancements in global navigation satellite and positioning systems, together with the subsequent increase in use of geo-enabled tracking devices, have resulted in unprecedented amounts of individual movement data (Swan 2012; Laube 2015). These data are increasingly being generated using personal devices, such as smartphones and other wearables (e.g., augmented eyewear, pedometers, smartwatches, textiles, wristbands, etc.). At the most ambitious end of the self-tracking spectrum is the Quantified Self (QS) community, which is composed of individuals who believe in “self-knowledge through numbers” and who use these devices to track biological, physical, behavioral, environmental, and/or other information about themselves (Swan 2013).

The movement data generated by quantified-selfers can offer more than just direct personal insight to the individual who engages in QS activities. The large volumes of data generated by these individuals, when aggregated and anonymized, can also be used to inform city safety (Zeile et al. 2015), preferential route choices (Baker et al. 2017), and air pollution exposure (Sun and Mobasheri 2017). Ubiquitous computing and Internet of Things (IoT) technologies create further potential for integrating individual movement data into smart city initiatives, such as traffic congestion monitoring (Zanella et al. 2014). This can result in a more humanized, bottom-up approach to city planning (Smyth et al. 2013). The opportunity, however, is constrained by the technical skills and resources available to those tasked with assessing bicyclist or pedestrian behavior in urban centers. This paper aims to address the question of how to design interactive mapping tools that can help urban and transportation planners leverage personal fitness data to better inform infrastructure decisions that aid in the safe and efficient movement of bicyclists?

More specifically, in this problem-driven research, we examine the design of a commercial interactive cartographic application intended to support urban and regional planning. The interactive mapping tools considered in this design study are focused on utilizing large volumes of individual movement data contributed voluntarily by users of an activity tracking application, Strava. Metro is a small division of Strava (the company, which shares a name with its application) that aggregates and anonymizes data on bicycling activities recorded on the platform, and licenses these data to a variety of organizations that are taking data-driven approaches to city planning. Based on a needs assessment and work domain analysis carried out by the first author while employed by Strava, the following insights emerged:

There existed a disconnect between what Strava Metro assumed data users were capable of (in terms of big data management, analysis, and visualization), and the reality of their time, resources, and abilities.

Data users faced significant challenges with data size/complexity, resulting in frustration, confusion, and limited ability to extract value from the data.

Most data users aimed to accomplish some variant of the following tasks: quantify/qualify ridership; distinguish between commute and recreational bicycling corridors; and identify candidate areas for modifying or creating new bicycle facilities.

Thus, we set out to develop a partnership with data users to alleviate their frustration and help address their needs. More specifically, we initiated a multi-year, user-centered design study to develop a geovisualization tool to make Metro data more accessible to stakeholders in the city planning process. These stakeholders may possess limited, or even no experience with geographic information systems (GIS) or spatial analysis. However, the tool should also support more advanced analysts, and offer immediate insights into the fundamental tasks outlined above. A major contribution of our work is the methodological framework we present for evaluating the extent to which the proposed design solution addresses the needs outlined above.

In the following sections, we first ground our design framework in relevant literature on cartographic approaches to mapping movement, geovisual analytics techniques for interacting with complex representations of movement, and urban interfaces (or city dashboards). Next, we introduce Metro DataView, an interactive network flow map designed to make data on bicycle activity more accessible, usable, and useful to decision makers, stakeholders, and researchers in the urban planning domain. A simplified approach to visualizing individual movement traces is then considered, followed by a presentation of a three-stage user-centered design and evaluation model employed to both formatively and summatively assess the cartographic tools presented in this work. That model combines a hypothetical use case scenario and claims analysis with a post-implementation user study to (1) characterize the domain problem, (2) synthesize knowledge of map use practice gleaned from the development process, (3) assess the extent to which the geovisualization tools support insight discovery, and (4) evaluate interface usability and utility. The paper concludes with a discussion of tool adoption and impact, major contributions, design limitations, and opportunities for future research and development.

BACKGROUND

This section provides background on cartographic approaches relevant to mapping individual activity traces; highlights ongoing research in the geovisual analytics community focused on exploring and making sense of complex movement data with an emphasis on transportation applications; and concludes with an overview on urban interfaces and their relevance to the visual communication of movement for an urban planning audience.

CARTOGRAPHIC APPROACHES TO FLOW MAPPING

Individual movement data are commonly structured as series (or trajectories) of point records, each composed of a latitude/longitude coordinate pair and a timestamp. Mapping these points directly is the simplest approach to visualizing and attempting to make sense of data in this form (Andrienko et al. 2008). Mapped depictions of raw GPS point data, however, become less feasible if the number of trajectories is large, if they reveal personally identifiable information, or if the analytical goal is to detect group—rather than individual—movement behavior. One, if not all, of these conditions will likely be met when engaging with individual movement data. In these cases, data aggregation becomes a viable strategy for preserving individual privacy while also making analytical tasks computationally tractable and mapped results visually meaningful (Rinzivillo et al. 2008).

Approaches to representing the spatial flow of objects in aggregate form have existed since at least 1845 (Robinson 1967). Effective flow maps are difficult to create, and have traditionally been drawn carefully by hand. Tobler (1987) introduced the challenges of migration flow mapping by computer, and, despite decades of advancement in GIS technology, creating effective digital flow maps remains difficult today (Rae 2011; Zhu and Guo 2014; Zhou et al. 2019). Connecting straight lines between locations on a map is inadequate, because the map quickly becomes cluttered and illegible. More sophisticated approaches to creating flow maps involve algorithm-driven line bundling and aggregation (e.g., Phan et al. 2005; Buchin, Speckmann, and Verbeek 2011; Debiasi et al. 2014). Such approaches have the advantage of reducing visual clutter through the offsetting or merging of lines. However, they become increasingly ineffective as data size and dimensionality increase, as well as when locations for origins and destinations and the arbitrary lines between them disguise the underlying patterns of the spatial phenomenon under study (Guo 2009).

GEOVISUAL ANALYTICS APPROACHES TO MAKING SENSE OF FLOWS

Interactive geovisual analytics approaches allow us to move beyond messy cartographic representations of movement into an environment that lets users filter, analyze, and more effectively make sense of large, complex flow data. These approaches oftentimes leverage computational data summarization, pattern extraction techniques, and linked views (Andrienko et al. 2008). For example, Boyandin et al. (2011) proposed linking two separated origin and destination maps with a non-spatial temporal heatmap to represent change in movement flows over time more effectively. Other notable works include those of Wood, Dykes, and Slingsby (2010), and Wood, Slingsby, and Dykes (2011). These authors explored the benefits and shortcomings of representing flows with curved symbols in comparison to other, more novel, approaches such as gridded views and origin-destination (OD) maps (i.e., an origin-destination matrix overlaid on geography). With respect to computational pattern extraction, Guo (2009) introduced a methodological framework that combined hierarchical and multivariate clustering, together with interactive flow maps, and demonstrated the value of that framework in the context of migration mapping. This work was later extended to more effectively support multi-resolution flow clustering on large datasets (Zhu and Guo 2014), and to develop a multi-scale flow density estimation and selection methodology for visualizing patterns in complex OD pairs (Zhu et al. 2019).

Transportation, in particular, is an active research area in the geovisual analytics community, because professionals in this domain rely on interactive, visual tools to support their analyses of vehicular and pedestrian movement (Andrienko et al. 2017). Many data types, derivatives, and visual representations have been developed to inform the spatial, temporal, and contextual properties of aircraft, automobiles, pedestrians, ships, trains, and other moving entities. We refer readers to Chen, Guo, and Wang (2015), and Andrienko et al. (2017) for comprehensive surveys of data visualization frameworks and techniques relevant to traffic and transportation data, but we will highlight two particular works that emphasize the diversity of research being conducted in this application area. Zeng et al. (2013), for example, considered both data aggregation and pattern extraction techniques in their exploratory analysis of passenger re-distribution patterns at intersections in traffic networks. The authors further adapted a variant of the circos figure (i.e., a circular plot for exploring relationships among objects and positions; see Krzywinski et al. 2009) to visualize the flow of travelers through interchanges. More recently, Zhou et al. (2019) proposed a visual abstraction approach that leverages a Natural Language Processing word embedding framework, together with adaptive sampling, to make sense of large amounts of OD data. The authors demonstrated how their visualization system reduced visual clutter and highlighted human mobility patterns using bicycle share and mobile phone location datasets.

The work we present in this paper contributes to the growing body of research on the representation and analysis of movement data by introducing and evaluating an integrated network and OD flow map constructed from the aggregation of individual bicycling traces. Network flow maps have received less attention than their radial and distributive counterparts due to limited access to trajectory data, particularly individual trajectory data, as well as challenges surrounding data size and complexity. Through processes of data abstraction and novel techniques for rendering large geospatial data in a web browser, this paper offers an effective framework for making large amounts of individual movement data more usable, useful, and accessible, while also preserving the privacy of the data creators.

URBAN INTERFACES

Beyond the research on mapping and analysis of flow data outlined above, the system presented here also draws upon research on urban data interfaces, often termed dashboards (Few 2004). These dashboards provide analysts, decision makers, and the public with access to data about the city, and are increasingly being used for urban planning and real-time city monitoring. For example, Maynooth University, in partnership with Dublin City Council recently undertook an extensive Building City Dashboards (BCD) project. The BCD project focuses on expanding the Dublin Dashboard, one of the most comprehensive urban interfaces to date (Kitchin, Maalsen, and McArdle 2016).

On one hand, urban interfaces provide citizens and planners alike with a multifaceted, data-driven perspective on their city. On the other, this perspective is biased as a result of data cleansing, data that are (un)consciously not included, a lack of information on how the data were sourced and/or derived, and the varying abilities that users have to make sense of the data. Mattern (2014) cautions against the “instrumented” rationalization of the city mediated through an inevitably incomplete interface that lacks in affect and civic collaboration. Mattern provides guidelines for urban interfaces and asks designers to consider composition and framing of screen elements and how they interact over time and space, scale of context (entire city vs. street corner), intended audience, and the types of information about the city that cannot (or should not) be represented by data visualizations. As big social media data are increasingly leveraged to feed urban dashboards and geovisualization tools more broadly, the proper design and use of those tools must reflect the (semi-)volunteered nature of data collection; respect the privacy of the data creators in the collection, storage, analysis and visualization of the data; and consider the uneven density and representativeness of social media data across space and time (Martin and Schuurman 2020).

As humans, our experiences in and understanding of cities are bound in place, space, and time. Urban interfaces undoubtedly benefit from, if not require, a cartographic component. In many cases, the map may be the central element of the interface, thus conceptualizing and creating effective urban interfaces benefits from expertise not only in user interface (UI) and user experience (UX) design, but also map design (Roth 2017). Further, the design process gains from balancing interaction design frameworks relevant to data visualization and cartography (e.g., Shneiderman’s [1996] Information Seeking Mantra or Roth’s [2013] taxonomy of cartographic interaction primitives) with an iterative, user-centered approach to defining (or refining) interaction flows in response to feedback provided by those who will use the interface. In the following section, we introduce a map-centric urban interface designed to provide city planning professionals, local advocacy groups, and researchers with insights into how bicyclists move across urban networks.

METRO DATAVIEW

Metro DataView is an interactive flow map that depicts volumes of unique bike trips, commute-designated trips, and bicyclists across an urban network. In addition, the cartographic tool provides an option to view a rasterized heatmap of the GPS points that define the activity traces used in the creation of the other views.

DataView differs from many urban bike maps due to its ability to relay objective information on how bicyclists are moving across a street network. Oftentimes, bike map design in urban planning contexts is based on subjective input provided by the “average” bicyclist. Wessel and Widener (2015) surveyed dozens of urban bike maps and found that Departments of Transportation and planning agencies in cities across the United States were publishing maps that assigned bike routes to ill-defined classifications, such as “preferred,” “use with caution,” or “not recommended.” In some cases, subjective context was also used to represent gradient (e.g., “steep hill” vs. “very steep hill”) and safety (e.g., “difficult intersection”). This subjective design approach can be attributed to a lack of data on infrastructure and ridership, as well as to the Federal Highway Administration’s “bicycle level of service” initiative that aims to evaluate the suitability of roadways for bicycle activity based on “comfortability” ratings provided by a subset of bicyclists for a sample of road segments (Harkey, Reinfurt, and Sorton 1998). Ratings can be correlated with road characteristics (e.g., pavement condition, shoulder width, speed limit, etc.) to extrapolate level-of-comfort designations across the entire network (Landis, Vattikuti, and Brannick 1997).

However, defining the “average” bicyclist is problematic, and as a result, many urban bike maps leave much to be desired with respect to objectively informing a diverse audience of bicyclists on how to successfully navigate the city. There are, though, some noteworthy exceptions to this subjective design approach. Wessel and Widener (2015) designed a printed bike map of Cincinnati, Ohio that intentionally did not include any unquantifiable information on roadway or terrain characteristics, with the intent that bicyclists of all types could make more objective wayfinding decisions. Similarly, but more narrowly focused, Brügger, Fabrikant, and Çöltekin (2017) conducted an empirical study to comparatively evaluate three linear elevation change symbolization methods (variation in color hue, color-coded arrows, and elevation profiles) to gain insight into how to design static maps to better facilitate bicycle route planning. Most similar to the design solution presented in our work is the Madrid Cycle Track initiative, in which Romanillos and Austwick (2016) developed network flow and heat maps from volunteer bicyclists to reveal mobility patterns across Madrid, Spain based on journey purpose (casual vs. messenger) and sociodemographic characteristics (age and gender). Bike maps, such as these examples and DataView, which quantify and effectively communicate ridership across the network, as well as other characteristics of the urban environment, can help bicyclists to choose safe and personally appropriate routes and assist city planning professionals in making strategic infrastructure decisions that promote bicycling as a recognized mode of active transportation (Su et al. 2010).

In the following subsections, we formally introduce Strava, the activity tracking platform by which bicycling trips were collected, aggregated, and made available for Metro DataView. Next, we provide a high-level summary on how the bicycling trip data are processed to support effective and efficient visualization and interaction. Lastly, system design decisions are described in detail.

CASE STUDY DATA PLATFORM

Strava is a social network for athletes that provides a platform for application users to record, analyze, and share their fitness-related activities. Tens of millions of activities are uploaded to the platform daily from users all over the world, and over two billion activities had been recorded in total between the company’s inception in 2009 and December 2019 (Strava 2019).

Activities recorded by platform users provide opportunities that extend beyond just delivering personal value or insight to the individual who engages in quantified-self activities (Lee and Sener 2020). The large volumes of data generated voluntarily by users of these types of applications, when aggregated and anonymized, can be used by city planners, Departments of Transportation, advocacy groups, and researchers to help make cities safer and more efficient for bike and pedestrian activity (DiGioia et al. 2017). This requires transforming large numbers of activity traces into actionable insights for a variety of stakeholders in the urban planning space.

Metro is a small division of Strava that licenses aggregated and anonymized activity data to Departments of Transportation and other city planning organizations. Researchers are leveraging Metro data to better understand spatial patterns in bicycling and pedestrian behavior across many different application (and geographic) areas. For example, Griffin and Jiao (2015) evaluated the relationship between place-based/road network variables and the geography of bicycling-for-fitness in Travis County, Texas. Metro data have been used to facilitate smarter mobility planning in Johannesburg, South Africa (Selala and Musakwa 2016), and to model the relationship between bicycling trip purpose and air pollution exposure in Glasgow, UK (Sun and Mobasheri 2017).

A consideration when using Metro data is that the Strava user group reflects only a subset of the bicyclist population. Many of the users engage primarily in sports and training activities, not necessarily in everyday commuting trips. Recognizing this, prior studies have aimed to evaluate the representativeness of Metro data. For example, the Centers for Disease Control and Prevention reported a strong association (ρ = 0.60) between the number of Strava-tracked commuters and the number of active commuters sampled by the US Census Bureau’s American Community Survey (ACS) in four major cities (Whitfield 2016). However, intracity correlation may vary as result of population density, social (dis)advantage, and overall ridership in the area (Conrow et al. 2018). At the very least, crowdsourced fitness data can complement and extend traditional active transportation surveillance and analysis despite sample and other biases inherent in user-generated data sources (Jestico, Nelson, and Winters 2016; Ferster et al. 2017; Lee and Sener 2020). Moreover, these data exhibit unprecedented spatial and temporal resolution, allowing for new approaches to measuring changes in bicycling behavior across an urban network as a result of infrastructure change and implementation (Boss et al. 2018).

However, the success of such approaches hinges on data accessibility and utility. In the following subsection, we explain the data abstraction process by which streams of individual movement traces recorded on Strava are transformed into summarized data views that a relevant audience can visualize and interact with using Metro DataView.

DATA PROCESSING & ABSTRACTION

Bicycling activities (i.e., streams of GPS points) recorded on Strava that are made publicly available by platform users are first queried from a PostGIS database based on a geographic area of interest and specified timeframe. Next, a map-matching process (White, Bernstein, and Kornhauser 2000) is performed to identify street network geometry traversed by bicyclists. More specifically, GPS points are aggregated against a vector street network (e.g., OpenStreetMap extract, TIGER network, etc.), intersection nodes (derived based on where the street network breaks), and arbitrarily defined hexagons with a 350-meter diameter. For street and intersection aggregations, the first and last 500 meters of each activity trace are cropped to preserve user privacy. Activity start and endpoints are only used in the hexagonal aggregation process, because these are created for the purpose of exploring origin-destination patterns in bicycling behavior and the size of the areal unit preserves user privacy. The aggregation processes output counts of unique individuals, activities, and commute-designated trips appended to all three types of spatial geometry. If a trip begins and ends at different locations, it is designated as a commute. Median interchange crossing times are also derived at the intersection level. This results in trajectory-oriented views of movement from both origin-destination and route-based perspectives (Andrienko and Andrienko 2010). The combined approach of map matching and aggregation has been identified as an effective strategy for outputting a useful dataset for transportation planning while also maximizing geoprivacy (Sila-Nowicka and Thakuriah 2016).

The three spatial datasets are output in GeoJSON format and converted into vector map tiles. Rendering GeoJSON directly on the client is not computationally practical because it requires downloading the entire data file on every map load. Vector tiles reduce the amount of data transferred to the client by returning vector representations only of features visible within the current map bounds and zoom level (Eriksson and Rydkvist 2015). In comparison to raster tiles, feature attribution persists through the GeoJSON-to-vector-tile transformation. As a result, features can be dynamically styled, manipulated, and interacted with on the client in real-time.

FEATURES & FUNCTIONALITY

As noted above, Metro DataView is an interactive mapping tool that depicts aggregate patterns of bicycling behavior across a road network. The intent of the tool is to provide city planning professionals and stakeholders, particularly those who possess limited or no GIS expertise, with a simple interface for: easily distinguishing commute from recreation bicycle corridors; identifying candidate areas for fixing or creating new bicycle facilities; and quantifying ridership pre- and/or post infrastructure change. Visual representations of counts of unique bike trips are displayed by default. An interactive tutorial is initiated when the application is loaded, to introduce and familiarize users with the interface and functionality. Learnability is one of the fundamental components of system usability, because a user’s initial experience with an interface involves making sense of how it works and what it depicts (Nielsen 1993). Interactive tutorials are effective strategies for conveying short, chunked sequences of syntactic knowledge to novice map users (Roth, MacEachren, and McCabe 2009; Mead 2014).

System features can be accessed in the control panel, which is in the upper left corner of the map interface. Area-of-interest, timeframe, and global statistics on the total number of activities and bicyclists being represented in the interface are specified at the top of the control panel. Below this information are buttons that allow users to switch between the following unique data views: Rides, Commutes, Cyclists, or Heat. Only one of these views can be selected at a time. Figure 1 provides an overview of the layout of Metro DataView’s various features. Intersections, origin-destination polygons, and destination-origin polygons can be toggled on or off. When on, these layers are overlaid on top of the street network and correspond to whichever aggregate data view is enabled. For example, if the “Commutes” view is selected, the intersection layer will depict counts of commutes at interchanges across the network (see Figure 2). Similarly, if the “Cyclists” view is selected, the origin-destination layers will depict the number of bike riders starting or ending at each of the arbitrarily-defined hexagonal areas across the network (see Figure 3). Additionally, users can view a non-aggregated, rasterized heatmap of GPS points from the activities used to create the other views (see Figure 4). Satellite imagery is also available as an alternative to the default dark base map.

Figure 1. Metro DataView: (a) network view options; (b) intersection toggle; (c) origin/destination toggle; (d) basemap selection; (e) network legend; (f) navigation and help.

Figure 1. Metro DataView: (a) network view options; (b) intersection toggle; (c) origin/destination toggle; (d) basemap selection; (e) network legend; (f) navigation and help.

The network flow map is the primary layer within the tool and is symbolized using both variations in line width and a diverging blue-red color scheme to represent the volume of counts across road segments. Counts on streets are binned into six classes based on a variant of the Jenks (1967) natural breaks method, and colors were selected using ColorBrewer, an online resource for selecting logical color schemes for thematic data (Harrower and Brewer 2003). Wider lines and darker shades of red signify road segments with higher counts. Narrower lines and darker shades of blue (with dark blue having the lowest visual contrast with the black map background) denote road segments with lower counts. The intent of this symbolization is to show the volume of bicycling behavior, highlight key corridors of activity, as well as identify prominent areas of inactivity.

VIDEO: Watch a short demonstration of DataView

A complimentary intersection layer can be toggled on and off atop the network flow map to provide additional perspective on whichever network view is selected (Figure 2). The size of intersection point symbols is scaled based on the number of unique bike trips, commute-designated trips, or bicyclists aggregated to the points. Higher counts are represented by larger points. The opacity of intersection point symbols is varied based on median crossing times through the intersections. More opaque point symbols depict longer intersection crossing times, helping to draw attention to potentially problematic interchanges atop a dark base map.

Figure 2. Intersection points representing the aggregate number of bicyclists crossing an interchange; count and median crossing time statistics shown on hover.

Figure 2. Intersection points representing the aggregate number of bicyclists crossing an interchange; count and median crossing time statistics shown on hover.

Additionally, a view of origin-destination polygons can also be toggled. These are based on the previously mentioned layer of contiguous 350-meter hexagonal bins. Metro DataView can support both origin- and destination-first views (i.e., users can select an origin and see all destinations associated with that origin or users can select a destination and find all origins associated with the destination). Polygon color value is varied to reflect the number of unique bike trips, commute-designated trips, or bicyclists that started or ended within the area. Darker shades of grey denote lower counts; lighter shades signify higher counts. Clicking on a polygon returns all destination (or origin) polygons associated with that polygon. The map automatically zooms to the bounds of the associated polygons and highlights them in shades of pink. Figure 3 depicts commute destinations across Fredrikstad, Norway and the associated origins of one selected destination.

Figure 3. Overall patterns in commute-designated bike trip destinations across Fredrikstad, Norway (top); hover on a destination polygon to obtain a count of commute trips ending in the area (bottom left); click on a destination polygon to reveal all associated origin polygons (bottom right).

Figure 3. Overall patterns in commute-designated bike trip destinations across Fredrikstad, Norway (top); hover on a destination polygon to obtain a count of commute trips ending in the area (bottom left); click on a destination polygon to reveal all associated origin polygons (bottom right).

Lastly, users can select the Heat view, which depicts a rasterized heatmap of the movement traces that were aggregated to create the views described above (Figure 4). Counts are not available in this view. Activity density can only be inferred relatively. This view supports users in discovering missing or inaccurate geometry in the street network that was used for aggregation. The validity of the aggregated counts is contingent on the locational accuracy of the underlying street network. Thus, the heatmap not only depicts a rasterized representation of raw movement traces, but also serves as a guide for correcting or adding to an existing vector road network (i.e., if hotspots are seen that do not follow existing roads or paths, this may suggest that a segment of geometry needs to be created or updated in the base map).

Figure 4. Rasterized heatmap view of GPS bicycling traces across Fredrikstad, Norway.

Figure 4. Rasterized heatmap view of GPS bicycling traces across Fredrikstad, Norway.

In summary, Metro DataView is designed to visualize an aggregated bicycling dataset in ways applicable to an urban planning audience. In the next section, we consider a simplified approach to visualizing individual activity traces that extends the rasterized point heatmap view already incorporated into DataView.

SIMPLIFIED APPROACH TO VISUALIZING ACTIVITY TRACES

To explore directions for future visualization development and further inform the design of Metro DataView, we implemented an alternate approach to visualizing patterns in bicycling behavior. The approach consolidated and extended the raster-only heatmap view seen in Figure 4, and did not require a computationally expensive aggregation process. Instead, a hierarchical visualization and interaction design approach was taken to seamlessly transition raster to vector representations of GPS bicycling trace data as a function of map zoom level. As the user zooms in on the interactive web map, static raster tiles representing GPS points transition to vector paths, revealing more subtle intricacies of movement patterns through city centers and along recreational trails. This visual abstraction approach builds upon the foundational work of Peuquet (1981) on translating and integrating raster-vector data representations, as well as the respective works of Brewer and Buttenfield (2007), and Stolte, Tang, and Hanrahan (2003) on multiscale symbol representation for USGS DLG reference map data and multiscale visualization using data cubes. The raster and vector representations are designed to look visually consistent; hence, users should not notice or be distracted by transitions in data representation when interacting with the map. Figure 5 shows small- and large-scale depictions of bicycling traces.

Figure 5. Small to large (left to right) map scale depictions of raster to vector representations of bicycling traces.

Figure 5. Small to large (left to right) map scale depictions of raster to vector representations of bicycling traces.

There were two main objectives for exploring this alternate approach to visualizing bicycling traces. The first objective was to assess if the approach had the potential to further broaden and diversify the Metro DataView user group. This raster-vector map integration could be implemented quickly and at a low cost because it did not require data aggregation. The tradeoff, however, was that the visualization of behavior patterns remained relative; functionality to obtain counts on a single street was not provided and was computationally impractical to implement. The second objective for exploring this approach was to further understand the merits of the more comprehensive Metro DataView service, and to better outline what a minimally viable product was from the perspective of a non-technical stakeholder in urban planning (i.e., someone unfamiliar with and potentially uninterested in undertaking analysis with Metro data, but who might find value in a simple visualization of bicycling behavior).

In the following section, we discuss the user-centered design and evaluation model used to assess the extent to which features and functionality of both the heatmap and DataView support city planning professionals in making informed decisions on bicycle infrastructure implementation.

USER CENTERED DESIGN & EVALUATION

The cartographic interfaces described in this work resulted from a multi-dimensional, in-depth long-term case study patterned after Shneiderman and Plaisant (2006), and consisted of three major stages of user-centered evaluation:

  1. Scenario Based Design
  2. Insight Discovery
  3. Usability & Utility Evaluation

In stage one of the study, we employed scenario-based design techniques (Rosson and Carroll 2002) to formatively assess client (and potential client) feedback that was solicited in various ways (through structured surveys, focus groups, phone calls, in-person discussion, and email correspondence) over a two-year product development cycle.1 Insights generated through this collaborative effort were integrated into a hypothetical use case scenario and supporting claims analysis to characterize the domain problem, exemplify the design challenges, and synthesize knowledge of map use practice gleaned from the development process.

Stages two and three of the study focused on summative assessment. We recruited seven non-client domain and visualization experts to evaluate the extent to which DataView, specifically, supported insight discovery (stage two) and was deemed useable/useful (stage three). Insight discovery was informed through a semi-constrained task and usability/utility were evaluated via survey. We report on the methodology and results for each stage in the following subsections, which are organized at the highest level by assessment type.

FORMATIVE ASSESSMENT

SCENARIO-BASED DESIGN

Scenario-based design (SBD) principles were employed to present a “sketch of use” for both the standalone heatmap and DataView. SBD focuses on how people will use a system as opposed to describing a system’s features and functionality (Rosson and Carroll 2002). Scenarios of envisioned use are typically defined at early stages of system development to guide the design process (e.g., MacEachren et al. 2011), but can also be effective at informing other stages of system evolution (Rosson and Carroll 2002). In this instance, we are presenting a hypothetical use case scenario and complementing claims analysis to abstract the domain problem into essential tasks (scenario) and the necessary data representations and interactions to support those tasks (claims analysis). This scenario synthesizes insights gleaned from a two-year human-centered business practice carried out by the first author while employed by Strava, working with a range of clients, and is prototypical of one that would be common in a large Department of Transportation organization.

SCENARIO

Susan is a bike and pedestrian facilities project manager working for a hypothetical State Transportation Agency. Susan is responsible for overseeing all phases of the project lifecycle, from scoping and right-of-way to preliminary engineering and construction. As part of a smart city initiative, the State has appropriated 45 million dollars over a 5-year period to research, design, and implement bicycle infrastructure and recreational facilities in the State’s largest urban center.

The initiative is currently in research and scoping phases with some projects already identified and others yet to be defined. Various locations within the urban center have been identified as “hubs” for bicycle activity, and three Eco Counters have been purchased and installed to generate data on how many bicyclists are moving through these specific areas. The challenge, however, is that these counters are relatively far apart in a large urban center and cannot begin to inform key corridors throughout the heart of the network. Installing more counters is not fiscally practical, so Susan searches for cheaper, alternative data sources to complement the counter data. She discovers the report published by the Centers for Disease Control and Prevention (Whitfield 2016) that showed strong correlation between counter data and crowdsourced bicycling data maintained by Strava Metro. Susan reaches out to Metro to inquire more about its data and visualization services. Susan acknowledges that she is not trained in GIS and data analytics, and that she has very limited internal resources to leverage for data mapping and analysis. As a result, she decides to license DataView, and agrees to test out a beta heatmap service that Metro is actively developing.

Susan was informed that the beta heatmap is simpler to use than DataView, so she visits that link first. The webpage renders, and Susan is intrigued by the bright blue and white lines that represent GPS traces from bicycle activities. There’s so many of them that on top of the dark basemap, they almost seem to illuminate the entire road network. She zooms in on the map to the locations of each of the three Eco Counters. The heatmap seems to confirm an influx of bicycling activity at those locations. She then zooms in to other areas that have been designated as key recreation and commute corridors, and again finds dense, saturated blue lines on the heatmap. Susan pans outward in various directions. She follows the lines around intersections, into parks, and through the residential and financial districts, attempting to identify popular routes and noteworthy destinations along those routes. Susan tries clicking on the map and the lines, hoping to find activity counts or functionality to toggle between commute and recreational-designated trips, but nothing happens. After fifteen minutes of exploration, Susan realizes that while she has detected known trends, she lacks quantifiable evidence needed to more effectively inform where to construct or modify essential infrastructure. Moreover, Susan isn’t a bicyclist herself, which makes parsing the relative distinction between “less” or “more” rides on a given street very difficult.

Next, Susan inputs the URL link to DataView. She steps through the onboarding experience and is optimistic that the functionality will better serve her needs. Upon completing the short tutorial, she first switches between the various data views (rides, commutes, cyclists) in the control panel. Colors and patterns in the network map change, and the street legend updates based on the view. Susan quickly realizes that this map looks much different than the heatmap she had previously viewed; there appears to be fewer lines and less noise. She leaves the “commute” view selected and zooms in to one of the Eco Counter locations on Main Street. Rather than finding many overlaid lines, she discovers that the actual road segments have been colored and widened based on the number of commutes that crossed each one. She hovers on the segment nearest to the Eco Counter, and a dialogue box pops up and relays that 3,577 commute-designated bike trips happened along that segment over the last three months. She then switches between the other two views: “rides” and “cyclists.” The color and width of the lines update Agency, and she learns that a total of 5,674 bike trips crossed the segment, accounting for 2,884 unique bicyclists.

Because the street segments are symbolized based on counts, Susan no longer needs to pan around attempting to follow patterns. Rather, she zooms out and quickly detects vibrant shades of red illuminating key corridors throughout the network. Susan has now identified a corridor of interest intersecting the Eco Counter location on Main Street. She toggles between the “origin-destination” and “destination-origin” views. Susan finds many popular origin polygons in the northern suburbs that all relate to a small and spatially-focused number of destination polygons in the southern, financial district. The street network map reveals a handful of popular arteries that all seem to flow into the Main Street corridor. Looking more closely, Susan notices that ten blocks south of the Eco Counter location, the primary commute route abruptly diverges into a residential area for about 13 blocks before returning to Main Street. She zooms back in to the newly discovered area of interest and toggles “intersections” on. The small, brightly colored nodes quickly convey that median crossing times for bike trips through the more direct, yet less traversed part of Main Street are significantly longer than those through the residential area detour. Susan isn’t overly familiar with the area, but finds this pattern somewhat surprising because she knows that an unprotected bike lane already exists along the entire stretch of Main Street. She switches the basemap from “dark” to “satellite,” and finds that the number of car lanes on Main Street abruptly changes from two with no roadside parking to one with roadside parking. The bike lane appears to be quite close to the parking spaces, and in some cases, it almost looks like cars are parked partway in the bike lane. Given this insight, Susan initiates a new protected bike lane project in the State’s internal prioritization and selection system. She also forwards the DataView link to various local bicycle advocacy groups. Community input and buy-in are essential, and Susan is hoping that these groups can provide additional qualitative insights on the impact of the potential project to bicycle safety and efficiency along that corridor.

CLAIMS ANALYSIS

SBD claims analysis aims to provide a balanced view on challenges and opportunities surrounding system features that have important consequences for users (Carroll and Rosson 1992). Positive and negative claims are made about features to evaluate system design decisions and help identify focused opportunities for subsequent user testing (MacEachren et al. 2011). Table 1 presents a claims analysis on key features in the standalone heatmap and DataView, and was used to guide the design of a post-implementation user study, which we report on in the following subsection.

Table 1. SBD Claims Analysis for Heatmap and DataView.

Table 1. SBD Claims Analysis for Heatmap and DataView.

SUMMATIVE ASSESSMENT

METHODOLOGY & PARTICIPANTS

A post-implementation, two-part user study, consisting of task and survey components, was conducted to evaluate the design and utility of Metro DataView from the perspectives of non-client domain experts. The standalone raster-to-vector heat map did not move forward to this stage of evaluation because it did not relay activity or bicyclist counts, which was functionality that had been deemed essential by users in stage one. Study design was informed by a synthesis of best practices for evaluation of geovisual analytics systems, which emphasizes instruction, analytical work, and feedback collection as the primary components of an effective study design (Savelyev and MacEachren, 2020). The first part of the study focused on a semi-constrained insight discovery task, in which participants were provided with a web link to the interactive mapping tool for the entire US state of Utah and instructions on how to use it. Participants were asked to explore the interface at their convenience over a period of a week and write a short essay (up to 500 words) based on a self-selected role (e.g., city planner, local advocate, transportation analyst, etc.). The task prompted participants to clearly articulate (1) their selected role, (2) insights gleaned through interacting with the tool, (3) goals or approaches taken when interacting, (4) what visualizations/functionality were employed in arriving at various insights, and (5) to what extent the tool supported insight discovery. The intent of this exploration activity was to provide evidence about DataView’s ability to support a broad range of insights on the part of participants, while also providing a consistent framework for synthesizing results.

The second part of the study entailed an online follow-up survey, consisting of a mix of multiple choice questions, five-point Likert scale ratings, and open-ended response questions, designed to evaluate the usability and utility of Metro DataView. Usability metrics were based on the system usability scale (Brooke 1996). Utility metrics followed a format similar to those designed by Pezanowski et al. (2018) and Robinson et al. (2020) but were adapted to explicitly evaluate the extent to which DataView facilitates better understanding of bicycling behavior at various spatial resolutions and could help city planners or Departments of Transportation make informed decisions on bicycle infrastructure design and implementation.

Seven participants engaged in the study; six completed both parts, while one submitted an incomplete and unusable essay alongside a complete survey. Participants were recruited using email lists that targeted experts in the use of interactive maps as an input to decision-making (e.g., geography, urban planning, or place-related policy making domains) and experts in the design of interactive, web-based interfaces to explore data (e.g., data visualization, cartography, or human-computer interaction domains). Figure 6 depicts a visual summary of participants’ demographic and professional backgrounds.

Figure 6. Visual summary of participants’ demographic and professional backgrounds.

Figure 6. Visual summary of participants’ demographic and professional backgrounds.

TASK ACTIVITY RESULTS

The task component of the study resulted in six essays, ranging in length from 121–516 words, with a mean word count of 273. Table 2 summarizes essay content, and is organized based on the selected roles, goals, specified interactions, noteworthy insights, and recommendations for tool enhancement provided by the participants.

Table 2. Participants’ approaches to insight discovery.

Table 2. Participants’ approaches to insight discovery.

In summary, the six participants explored the tool from different perspectives with some overlap or similarity in assumed roles. Self-directed tasks or goals ranged from well-defined to broad exploration. Most participants interacted with and commented on insights obtained from using DataView’s various features; however, one participant chose to focus their assessment solely on the utility of the origin-destination polygons. Four of the six participants provided very specific insights about findings at particular locations, whereas two participants provided shorter, more general insights. Five participants provided recommendations for how to improve the interface. Overall, the types of insights and recommendations for system improvement provided by participants aligned with the claims made about DataView based on a synthesis of client collaboration carried out by the first author, Jonathan Nelson, while working for Strava Metro (see Table 1). For example, participants demonstrated success in using the three different network views to identify prominent bike corridors. However, they also expressed the need to distinguish recreation-only from commute corridors. Similarly, participants overlaid intersection and OD polygon data successfully to make inferences about movement volumes, wait times, and prominent origin/destination locations. However, they highlighted additional challenges interpreting symbology. In some instances, participants provided input for system improvement that extended beyond the negative claims identified in the claims analysis. We explore these recommendations in more detail in the following section on usability and utility evaluation.

USABILITY/UTILITY RESULTS

Results from the online follow-up survey included seven participants’ responses to a mix of Likert scale, multiple choice, and short answer questions designed to collect feedback on the usability and utility of Metro DataView. The first four questions focused primarily on time spent using the interface and initial impressions of its design and effectiveness. One participant reported having spent 30–45 minutes using DataView, three participants spent 15–30 minutes, and three spent less than 15 minutes. All participants reported stepping through the entire onboarding experience. The majority of respondents characterized their initial experience using DataView to be straightforward and found the overall design of the interface to be effective. Figures 7 and 8 provide visual summaries of the strengths and weaknesses of DataView in terms of usability and utility.

Figure 7. Visual summary of participants’ usability ratings.

Figure 7. Visual summary of participants’ usability ratings.



Figure 8. Visual summary of participants’ utility ratings.

Figure 8. Visual summary of participants’ utility ratings.

Overall, the majority of participants found the tool easy to use and its functionality well integrated. Moreover, most participants also agreed that DataView facilitated a better understanding of bicycling behavior at various spatial resolutions and that the tool could help city planners or transportation departments make informed decisions on infrastructure design and implementation. However, participants’ agreement was more divided on whether or not the tool could prompt new hypotheses about the decisions that bicyclists make when navigating through a city or whether it could be helpful when generating an analytical report to prompt further action.

In addition to rating their agreement with the statements shown in Figures 7 and 8, participants were also asked to provide short answer recommendations for improvements that could be made to enhance DataView’s usability and utility. Six participants provided input. Some of the feedback echoed what had been distilled in the claims analysis and in participants’ essay responses, such as the desire for temporal filtering and spatial selection. Additionally, participants recommended incorporating information on where bicycle infrastructure already exists, to more effectively communicate to decision makers where improvements need to be made, and to better inform bicyclists about more preferred or safe routes. Participants also suggested incorporating other modes of travel and predictive modeling to provide a more complete representation of movement across the network and identify hotspots for accidents. Results from this post-implementation user study will be shared with Strava to help inform ongoing development of Metro’s visualization tools and services. In the following section, we highlight the impact of DataView in the context of client adoption and propose opportunities for future research and development based on the input from non-client domain experts.

DISCUSSION

Metro DataView has been delivered to city, state, and regional Departments of Transportation, local advocacy groups, and researchers across the globe. From smaller towns (e.g., Conway, Arkansas and Grey County, Ontario) to large government agencies (e.g., Colorado State Department of Transportation and Transport for London), Metro DataView is being used to inform the city planning process. For example, Transport for London leverages the interactive mapping tool and the underlying data that support it to generate network demand models and assess the potential for growth in bicycle transport throughout the Capital. In Queensland, Australia, the Department of Transport and Main Roads uses DataView to quantify the impact of bicycling infrastructure investment. The Florida Department of Transportation prioritizes street sweeping efforts based on insights extracted from the tool. In October 2017, Texas Public Radio published a piece on how the Texas Department of Transportation and local planning organizations were using the tool to better understand how bicyclists were moving across the state’s network to prioritize where to implement new facilities and bicycle infrastructure (Flahive 2017).

This work describes a unique circumstance: a novel and impactful cartographic product was conceptualized and created in an industry setting, while also being grounded in academic methodology and scholarship. On one hand, this situation created an opportunity that enabled the widespread adoption of an effective decision-making tool as summarized above. On the other hand, conducting this research in a commercial environment resulted in not being able to report on specific methods or findings from client-centered design studies that were carried out over a two-year product development cycle. To address this shortcoming, we employed a scenario-based design strategy to distill our collaborative efforts with clients into a hypothetical use case scenario and supporting claims analysis. While some transferable knowledge to other design studies is inevitably lost in the translation, this approach forced us to synthesize both the wealth and diversity of client feedback into a cohesive and guiding design narrative. This narrative served to abstract and characterize a real-world domain problem and helped to focus our follow-up evaluation methods toward assessing system features and functionality that were deemed essential by the target audience. Combining scenario-based design techniques with task assessment, usability, and utility evaluation enabled rich, multi-dimensional insights into how geovisualization tools can be designed to support city planning professionals in making cities safer and more efficient for bicyclists. While this multistage design study required significant time and effort, the depth of insight into the domain problem and effectiveness of the proposed design solution would not have been achievable if only a single method had been employed.

The impact and “success” of this work is a direct result of the multistage user-centered design model that guided its evolution. The value of engaging with intended users and stakeholders of a system during the development process cannot be overstated, echoing recent scholarship in interactive cartography (e.g., Slocum et al. 2003; Robinson et al. 2005; Roth et al. 2010; Delikostidis, van Elzakker, and Kraak 2016). Moreover, this work aligns with at least three opportunities proposed by Roth et al. (2017) for adapting user-centered design methodology to interactive cartographic studies: namely contextualizing and emphasizing the process (not just the result), conducting purposeful rather than convenient study participant sampling, and promoting and illustrating the value in being comprehensive and thorough. While user-centered design aims to ensure system success, defining and evaluating success is challenging. The question of “to what extent does the system meet or exceed the expectations and desires of its users?” can be subjective and inappropriate to quantify. Unlike controlled experiments that are replicable and generalizable, user-centered design studies tend to inform a more specific situation, making its findings transferable and insights contextual only to similar use cases (Sedlmair, Meyer, and Munzner 2012).

For Strava Metro, there was a clear need to develop interactive, visual ways of making activity data more accessible, usable, and useful. Feedback from organizations using the data indicated users’ frustration, confusion, and limited ability to generate valuable insight into the spatiotemporal patterns of bicycling behavior. Having identified this need, we conducted a multi-dimensional, in-depth long-term case study consisting of three major stages of evaluation: (1) scenario-based design, (2) insight discovery, and (3) usability and utility assessment. Stage one translated over two years of collaborative efforts—working with Departments of Transportation, local advocacy groups, and other city planning professionals—into a transferable abstraction of a real-world problem, while also illustrating design challenges and identifying focused opportunities for further user testing. Stages two and three of the study explored these focused opportunities through an insight discovery task and usability and utility assessment completed by seven non-client domain experts. This design framework was selected because the goal was to obtain both formative and summative insights, but with a focus on qualitative rather than quantitative results. Moreover, the intent was neither to assess nor quantify how the final system compared to other tools capable of delivering similar insights, but to design and implement a specific system to meet stakeholder and client needs.

Two key findings resulted from stage one of the study. The first is that the visualization solution produced needed to relay counts of bike trips across the road network and distinguish commute from recreation trip types. Second, inferences on relative activity density provided by the raster-to-vector heatmap were deemed insufficient for making informed planning decisions. Results from stages two and three of the study reinforced stage one claims made about DataView based on the synthesis of findings from working with clients over the product development cycle. For example, seven non-client domain and visualization experts demonstrated success in using the three different network views to identify prominent bike corridors, and successfully overlaid intersection and OD polygon data to make inferences about movement volumes, wait times, and popular origin/destination locations. These results also revealed important system shortcomings and opportunities for future work. For example, DataView’s current inability to relay information on the presence and condition of bicycling infrastructure limits users’ inferences on the connectedness of the network. Information on volume alone is insufficient for understanding how many people aren’t biking but could be if infrastructure was improved. Additionally, functionality to support enhanced spatial and temporal filtering would advance users’ abilities to make sense of the more nuanced patterns of commute and recreational behavior.

More broadly, there exist opportunities to extend the technical framework to support the aggregation, tiling, and mapping of multiple data sources to relay a more complete and representative depiction of how individuals and entities move and interact across a network. As noted above, crowdsourced fitness data represent only a subset of the active population, and should serve to complement and extend more traditional approaches to active transportation surveillance and analysis (Jestico, Nelson, and Winters 2016; Ferster et al. 2017; Lee and Sener 2020). User-generated fitness data, for example, can be combined with survey and counter data to more effectively model the flow of bicyclists across a network (Whitfield 2016). Additionally, supplemental data on crash incidents, roadway characteristics, and environmental factors can be integrated with crowdsourced activity data to help prioritize safety initiatives and inform why some routes are more popular than others (Quartuccio et al. 2014; Quercia, Schifanella, and Aiello 2014; Sun and Mobasheri 2017). Multiple data sources, together with civic collaboration and input, foster a more complete and honest urban interface (Mattern 2014).

CONCLUSION

In this work we presented a design strategy to address a problem-driven research question: how to make large amounts of aggregated and anonymized individual movement data more accessible and actionable to stakeholders in the city planning process? We employed cartographic principles of representation, geovisual analytics techniques, and best practices in UI/UX design to arrive at an interactive mapping tool that can communicate the complex flow of bicycling traces across urban street networks to experts trained in transportation analytics and modeling, as well as a broader, public audience. A major contribution of this work is our approach to combining scenario-based design methods with a post-implementation user study to characterize the domain problem; map essential user tasks to data representations and interactions; articulate the design rationale; and validate the design solution.

Beyond presenting an adaptable and flexible design approach, we proposed an innovative technical framework for rendering, and enabling interaction with, large geospatial datasets in the browser. Additionally, we explored a hierarchical visualization design approach that seamlessly transitions raster to vector data representations across map scale. The intent of this approach was to leverage scale-specific advantages of each data representation type in the context of web mapping. Rendering activity traces as raster pixels at small map scales and as vector paths at large map scales is an effective strategy for achieving reasonable client performance while enabling more flexibility in map interaction. While the raster-to-vector heatmap did not meet the needs of this study’s target audience, there exist opportunities to extend this visualization paradigm through design and evaluation of interaction strategies across scale (i.e., addressing the disconnect between visual consistency and interaction inconsistency across scale). Future research is also needed to address design limitations identified by study participants. While incorporating more advanced spatial and temporal filtering functionality into the interface is a relatively clear need, inclusion of predictive modeling, bicycling infrastructure information, and data on other modes of transportation will require additional user input to capture the context of the need and the breadth of its applicability to the target audience group.

ACKNOWLEDGEMENTS

We wish to thank Strava Metro for its commitment to helping make cities safer and better for bicyclists and pedestrians. This commitment is what fostered the ideation and development of the geovisualization tool, Metro DataView, described in this paper. More specifically, we would like to acknowledge Davis Kitchel for his contributions towards solving DataView’s technical challenges; Kristin Rousseau for her efforts in managing the transition of DataView from prototype to product; and Michael Horvath for championing Strava Metro’s mission and believing in its potential. We also wish to thank user study participants for their time and feedback and the anonymous reviewers for their helpful input.

FUNDING

This work was supported by Strava and the National Science Foundation under IGERT Award #DGE-1144860, Big Data Social Science, and The Pennsylvania State University.

DISCLOSURE STATEMENT

The core of the technology development described in this work was carried out while the first author was employed by Strava Metro as the lead user experience designer for the Metro DataView application described in this work.

NOTE

1 Client feedback collected over the two-year development cycle was part of Strava business activities, thus not considered to be “research” at the time and as a result, is not directly reportable here.

REFERENCES

Andrienko, Gennady, and Natalia Andrienko. 2010. “A General Framework for Using Aggregation in Visual Exploration of Movement Data.” The Cartographic Journal 47 (1): 22–40. https://doi.org/10.1179/000870409X12525737905042.

Andrienko, Gennady, Natalia Andrienko, Jason Dykes, Sara Irina Fabrikant, and Monica Wachowicz. 2008. “Geovisualization of Dynamics, Movement and Change: Key Issues and Developing Approaches in Visualization Research.” Information Visualization 7: 173–180. https://doi.org/10.1057/IVS.2008.23.

Andrienko, Gennady, Natalia Andrienko, Wei Chen, Ross Maciejewski, and Ye Zhao. 2017. “Visual Analytics of Mobility and Transportation: State of the Art and Further Research Directions.” IEEE Transactions on Intelligent Transportation Systems 18 (8): 2232–2249. https://doi.org/10.1109/TITS.2017.2683539.

Baker, Kevin, Kristien Ooms, Steven Verstockt, Pascal Brackman, Philippe De Maeyer, and Rik Van de Walle. 2017. “Crowdsourcing a Cyclist Perspective on Suggested Recreational Paths in Real-World Networks.” Cartography and Geographic Information Science 44 (5): 422–435. https://doi.org/10.1080/15230406.2016.1192486.

Boss, Darren, Trisalyn Nelson, Meghan Winters, and Colin J. Ferster. 2018. “Using Crowdsourced Data to Monitor Change in Spatial Patterns of Bicycle Ridership.” Journal of Transport & Health 9: 226–233. https://doi.org/10.1016/j.jth.2018.02.008.

Boyandin, Ilya, Enrico Bertini, Peter Bak, and Denis Lalanne. 2011. “Flowstrates: An Approach for Visual Exploration of Temporal Origin‐Destination Data.” Computer Graphics Forum 30 (3): 971–980. https://doi.org/10.1111/j.1467-8659.2011.01946.x.

Brewer, Cynthia A., and Barbara P. Buttenfield. 2007. “Framing Guidelines for Multi-Scale Map Design Using Databases at Multiple Resolutions.” Cartography and Geographic Information Science 34 (1): 3–15. https://doi.org/10.1559/152304007780279078.

Brooke, John. 1996. “SUS: A ‘quick and dirty’ usability scale.” In Usability Evaluation in Industry, edited by P. W. Jordan, B. Thomas, B. A. Weerdmeester, and A. L. McClelland, 189–194. London: Taylor and Francis.

Brügger, Annina, Sara Irina Fabrikant, and Arzu Çöltekin. 2017. “An Empirical Evaluation of Three Elevation Change Symbolization Methods along Routes in Bicycle Maps.” Cartography and Geographic Information Science 44 (5): 436–451. https://doi.org/10.1080/15230406.2016.1193766.

Buchin, Kevin, Bettina Speckmann, and Kevin Verbeek. 2011. “Flow Map Layout via Spiral Trees.” IEEE Transactions on Visualization and Computer Graphics 17 (12): 2536–2544. https://doi.org/10.1109/TVCG.2011.202.

Carroll, John M., and Mary Beth Rosson. 1992. “Getting Around the Task-Artifact Cycle: How to Make Claims and Design by Scenario.” ACM Transactions on Information Systems (TOIS) 10 (2): 181–212. https://doi.org/10.1145/146802.146834.

Chen, Wei, Fangzhou Guo, and Fei-Yue Wang. 2015. “A Survey of Traffic Data Visualization.” IEEE Transactions on Intelligent Transportation Systems 16 (6): 2970–2984. https://doi.org/10.1109/TITS.2015.2436897.

Conrow, Lindsey, Elizabeth Wentz, Trisalyn Nelson, and Christopher Pettit. 2018. “Comparing Spatial Patterns of Crowdsourced and Conventional Bicycling Datasets.” Applied Geography 92: 21–30. https://doi.org/10.1016/j.apgeog.2018.01.009.

Debiasi, Alberto, Bruno Simões, and Raffaele De Amicis. 2014. “Supervised Force Directed Algorithm for the Generation of Flow Maps.” In WSCG 2014 – Full Papers Proceedings, edited by Vaclav Skala, 193–202. Plzen, Czech Republic: Union Agency.

Delikostidis, Ioannis, Corné PJM van Elzakker, and Menno-Jan Kraak. 2016. “Overcoming Challenges in Developing More Usable Pedestrian Navigation Systems.” Cartography and Geographic Information Science 43 (3): 189–207. https://doi.org/10.1080/15230406.2015.1031180.

DiGioia, Jonathan, Kari Edison Watkins, Yanzhi Xu, Michael Rodgers, and Randall Guensler. 2017. “Safety Impacts of Bicycle Infrastructure: A Critical Review.” Journal of Safety Research 61: 105–119. https://doi.org/10.1016/j.jsr.2017.02.015.

Eriksson, Oskar, and Emil Rydkvist. 2015. “An In-Depth Analysis of Dynamically Rendered Vector-Based Maps with WebGL Using Mapbox GL JS.” Master’s thesis, Linkopings Universitet.

Ferster, Colin Jay, Trisalyn Nelson, Meghan Winters, and Karen Laberee. 2017. “Geographic Age and Gender Representation in Volunteered Cycling Safety Data: A Case Study of BikeMaps.org.” Applied Geography 88: 144–150. https://doi.org/10.1016/j.apgeog.2017.09.007.

Few, Stephen. 2004. “Dashboard Confusion.” Perceptual Edge. Accessed November 1, 2020. https://www.perceptualedge.com/articles/ie/dashboard_confusion.pdf.

Flahive, Paul. 2017. “Big Data Sheds Light on Where Cyclists Go.” Texas Public Radio, October 17, 2017. http://www.tpr.org/post/big-data-sheds-light-where-cyclists-go.

Griffin, Greg P., and Junfeng Jiao. 2015. “Where Does Bicycling for Health Happen? Analysing Volunteered Geographic Information through Place and Plexus.” Journal of Transport & Health 2 (2): 238–247. https://doi.org/10.1016/j.jth.2014.12.001.

Guo, Diansheng. 2009. “Flow Mapping and Multivariate Visualization of Large Spatial Interaction Data.” IEEE Transactions on Visualization and Computer Graphics 15 (6): 1041–1048. https://doi.org/10.1109/TVCG.2009.143.

Harkey, David L., Donald W. Reinfurt, and Alex Sorton. 1998. The Bicycle Compatibility Index: A Level of Service Concept, Implementation Manual. No. FHWA-RD-98-095. United States Federal Highway Administration. https://safety.fhwa.dot.gov/tools/docs/bci.pdf.

Harrower, Mark, and Cynthia A. Brewer. 2003. “ColorBrewer.org: An Online Tool for Selecting Colour Schemes for Maps.” The Cartographic Journal 40 (1): 27–37. https://doi.org/10.1179/000870403235002042.

Jenks, George F. 1967. “The Data Model Concept in Statistical Mapping.” International Yearbook of Cartography 7: 186–190.

Jestico, Ben, Trisalyn Nelson, and Meghan Winters. 2016. “Mapping Ridership Using Crowdsourced Cycling Data.” Journal of Transport Geography 52: 90–97. https://doi.org/10.1016/j.jtrangeo.2016.03.006.

Kitchin, Rob, Sophia Maalsen, and Gavin McArdle. 2016. “The Praxis and Politics of Building Urban Dashboards.” Geoforum 77: 93–101. https://doi.org/10.1016/j.geoforum.2016.10.006.

Krzywinski, Martin, Jacqueline Schein, Inanc Birol, Joseph Connors, Randy Gascoyne, Doug Horsman, Steven J. Jones, and Marco A. Marra. 2009. “Circos: An Information Aesthetic for Comparative Genomics.” Genome Research 19 (9): 1639–1645. https://doi.org/10.1101/gr.092759.109.

Landis, Bruce, Venkat Vattikuti, and Michael Brannick. 1997. “Real-Time Human Perceptions: Toward a Bicycle Level of Service.” Transportation Research Record 1578 (1): 119–126. https://doi.org/10.3141/1578-15.

Laube, Patrick. 2015. “The Low Hanging Fruit Is Gone: Achievements and Challenges of Computational Movement Analysis.” SIGSPATIAL Special 7 (1): 3–10. https://doi.org/10.1145/2782759.2782762.

Lee, Kyuhyun, and Ipek Nese Sener. 2020. “Strava Metro Data for Bicycle Monitoring: A Literature Review.” Transport Reviews 1–21. https://doi.org/10.1080/01441647.2020.1798558.

MacEachren, Alan M., Anuj Jaiswal, Anthony C. Robinson, Scott Pezanowski, Alexander Savelyev, Prasenjit Mitra, Xiao Zhang, and Justine Blanford. 2011. “Senseplace2: Geotwitter Analytics Support for Situational Awareness.” In IEEE Conference on Visual Analytics Science and Technology 2011, edited by Silvia Miksch and Matt Ward, 181–190. Piscataway, NJ: IEEE.

Martin, Michael E., and Nadine Schuurman. 2020. “Social Media Big Data Acquisition and Analysis for Qualitative GIScience: Challenges and Opportunities.” Annals of the American Association of Geographers 110 (5): 1335–1352. https://doi.org/10.1080/24694452.2019.1696664.

Mattern, Shannon. 2014. “Interfacing Urban Intelligence,” Places Journal, April 2014. https://doi.org/10.22269/140428.

Mead, Rashauna. 2014. “Expert Perspectives on the Design and Use of Learning Materials for Neocartographic Interfaces” Ph.D. diss., University of Wisconsin–Madison.

Nielsen, Jakob. 1993. Usability Engineering. Cambridge, MA: AP Professional. https://doi.org/10.1016/C2009-0-21512-1.

Peuquet, Donna J. 1981. “An Examination of Techniques for Reformatting Digital Cartographic Data/Part 2: The Vector-to-Raster Process.” Cartographica 18 (3): 21–33. https://doi.org/10.3138/K632-661R-K76J-1R80.

Pezanowski, Scott, Alan M. MacEachren, Alexander Savelyev, and Anthony C. Robinson. 2018. “SensePlace3: A Geovisual Framework to Analyze Place–Time–Attribute Information in Social Media.” Cartography and Geographic Information Science 45 (5): 420–437. https://doi.org/10.1080/15230406.2017.1370391.

Phan, Doantam, Ling Xiao, Ron Yeh, Pat Hanrahan, and Terry Winograd. 2005. “Flow Map Layout.” In IEEE Symposium on Information Visualization (InfoVis 05), edited by John Stasko and Matt Ward, 219–224. Piscataway, NJ: IEEE.

Quartuccio, Jacob, Simone Franz, Christian Gonzalez, Naomi Kenner, David M. Cades, Joseph B. Sala, Steven R. Arndt, and Patrick McKnight. 2014. “Seeing is Believing: The Use of Data Visualization to Identify Trends for Cycling Safety.” In Proceedings of the Human Factors and Ergonomics Society Annual Meeting 58 (1): 1361–1365. Los Angeles: SAGE Publications. https://doi.org/10.1177/1541931214581284.

Quercia, Daniele, Rossano Schifanella, and Luca Maria Aiello. 2014. “The Shortest Path to Happiness: Recommending Beautiful, Quiet, and Happy Routes in the City.” In Proceedings of the 25th ACM Conference on Hypertext and Social Media, 116–125. New York: ACM. https://doi.org/10.1145/2631775.2631799.

Rae, Alasdair. 2011. “Flow-Data Analysis with Geographical Information Systems: A Visual Approach.” Environment and Planning B: Planning and Design 38 (5): 776–794. https://doi.org/10.1068/b36126.

Rinzivillo, Salvatore, Franco Turini, Vania Bogorny, Christine Körner, Bart Kuijpers, and Michael May. 2008. “Knowledge Discovery from Geographical Data.” In Mobility, Data Mining and Privacy, edited by Giannotti, Fosca, and Dino Pedreschi, 243–265. Berlin: Springer-Verlag. https://doi.org/10.1007/978-3-540-75177-9_10.

Robinson, Anthony C., Cary L. Anderson, and Sterling D. Quinn. 2020. “Evaluating Geovisualization for Spatial Learning Analytics.” International Journal of Cartography 6 (3): 331–349. https://doi.org/10.1080/23729333.2020.1735034.

Robinson, Anthony C., Jin Chen, Eugene J. Lengerich, Hans G. Meyer, and Alan M. MacEachren. 2005. “Combining Usability Techniques to Design Geovisualization Tools for Epidemiology.” Cartography and Geographic Information Science 32 (4): 243–255. https://doi.org/10.1559/152304005775194700.

Robinson, Arthur H. 1967. “The Thematic Maps of Charles Joseph Minard.” Imago Mundi 21 (1): 95–108. https://doi.org/10.1080/03085696708592302.

Romanillos, Gustavo, and Martin Zaltz Austwick. 2016. “Madrid Cycle Track: Visualizing the Cyclable City.” Journal of Maps 12 (5): 1218–1226. https://doi.org/10.1080/17445647.2015.1088901.

Rosson, Mary Beth, and John M. Carroll. 2002. “Scenario-based Design.” In The Human-Computer Interaction Handbook: Fundamentals, Evolving Technologies and Emerging Applications, edited by J. Jacko and A. Sears, 1032–1050. New York: Lawrence Erlbaum Associates.

Roth, Robert E. 2013. “An Empirically-Derived Taxonomy of Interaction Primitives for Interactive Cartography and Geovisualization.” IEEE Transactions on Visualization and Computer Graphics 19 (12): 2356–2365. https://doi.org/10.1109/TVCG.2013.130.

———. 2017. “User Interface and User Experience (UI/UX) Design.” In The Geographic Information Science & Technology Body of Knowledge, edited by John P. Wilson. https://doi.org/10.22224/gistbok/2017.2.5.

Roth, Robert E., Alan M. MacEachren, and Craig A. McCabe. 2009. “A Workflow Learning Model to Improve Geovisual Analytics Utility.” In Proceedings of the 24th International Cartographic Conference. https://icaci.org/files/documents/ICC_proceedings/ICC2009/html/refer/20_8.pdf.

Roth, Robert E., Arzu Çöltekin, Luciene Delazari, Homero Fonseca Filho, Amy L. Griffin, Andreas Hall, Jari Korpi, Ismini Lokka, André Mendonça, Kristien Ooms, et al. 2017. “User Studies in Cartography: Opportunities for Empirical Research on Interactive Maps and Visualizations.” International Journal of Cartography 3 (1): 61–89. https://doi.org/10.1080/23729333.2017.1288534.

Roth, Robert E., Kevin S. Ross, Benjamin G. Finch, Wei Luo, and Alan M. MacEachren. 2010. “A User-centered Approach for Designing and Developing Spatiotemporal Crime Analysis Tools.” In Extended Abstracts Volume, GIScience 2010, edited by Ross Purves and Robert Weibel. http://giscience2010.org/pdfs/paper_238.pdf.

Savelyev, Alexander, and Alan M. MacEachren. 2020. “Advancing the Theory and Practice of System Evaluation: A Case Study in Geovisual Analytics of Social Media.” International Journal of Cartography 6 (2): 202–221. https://doi.org/10.1080/23729333.2019.1637488.

Sedlmair, Michael, Miriah Meyer, and Tamara Munzner. 2012. “Design Study Methodology: Reflections from the Trenches and the Stacks.” IEEE Transactions on Visualization and Computer Graphics 18 (12): 2431–2440. https://doi.org/10.1109/TVCG.2012.213.

Selala, M. K., and W. Musakwa. 2016. “The Potential of Strava Data to Contribute in Non-Motorised Transport (Nmt) Planning in Johannesburg.” The International Archives of Photogrammetry, Remote Sensing and Spatial Information Sciences 41: 587–594. https://doi.org/10.5194/isprs-archives-XLI-B2-587-2016.

Shneiderman, Ben. 1996. “The Eyes Have It: A Task by Data Type Taxonomy for Information Visualizations.” In IEEE Symposium on Visual Languages, edited by Regina Spencer Sipple, 336–343. Los Alamitos, CA: IEEE Computer Society Press. https://doi.org/10.1109/VL.1996.545307.

Shneiderman, Ben, and Catherine Plaisant. 2006. “Strategies for Evaluating Information Visualization Tools: Multi-Dimensional In-Depth Long-Term Case Studies.” In Proceedings of BELIV’06, 1–7. Venice: ACM. https://doi.org/10.1145/1168149.1168158.

Sila-Nowicka, Katarzyna, and Piyushimita Thakuriah. 2016. “The Trade-Off Between Privacy and Geographic Data Resolution. A Case of GPS Trajectories Combined with the Social Survey Results.” ISPRS-International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences XLI-B2: 535–542. https://doi.org/10.5194/isprs-archives-XLI-B2-535-2016.

Slocum, Terry A., Daniel C. Cliburn, Johannes J. Feddema, and James R. Miller. 2003. “Evaluating the Usability of a Tool for Visualizing the Uncertainty of the Future Global Water Balance.” Cartography and Geographic Information Science 30 (4): 299–317. https://doi.org/10.1559/152304003322606210.

Smyth, Michael, Ingi Helgason, Martin Brynskov, Ivica Mitrovic, and Gianluca Zaffiro. 2013. “UrbanIxD: Designing Human Interactions in the Networked City.” In CHI’13 Extended Abstracts on Human Factors in Computing Systems, edited by Patrick Baudisch, Michel Beaudouin-Lafon, and Wendy E. Mackay, 2533–2536. New York: ACM. https://doi.org/10.1145/2468356.2468823.

Stolte, Chris, Diane Tang, and Pat Hanrahan. 2003. “Multiscale Visualization Using Data Cubes.” IEEE Transactions on Visualization and Computer Graphics 9 (2): 176–187. https://doi.org/10.1109/TVCG.2003.1196005.

Strava. 2019. “Strava Releases 2019 Year In Sport Data Report.” Strava Press. Accessed November 22, 2020. https://blog.strava.com/press/strava-releases-2019-year-in-sport-data-report.

Su, Jason G., Meghan Winters, Melissa Nunes, and Michael Brauer. 2010. “Designing a Route Planner to Facilitate and Promote Cycling in Metro Vancouver, Canada.” Transportation Research Part A: Policy and Practice 44 (7): 495–505. https://doi.org/10.1016/j.tra.2010.03.015.

Sun, Yeran, and Amin Mobasheri. 2017. “Utilizing Crowdsourced Data for Studies of Cycling and Air Pollution Exposure: A Case Study Using Strava Data.” International Journal of Environmental Research and Public Health 14 (3): 274. https://doi.org/10.3390/ijerph14030274.

Swan, Melanie. 2012. “Sensor Mania! The Internet of Things, Wearable Computing, Objective Metrics, and the Quantified Self 2.0.” Journal of Sensor and Actuator Networks 1 (3): 217–253. https://doi.org/10.3390/jsan1030217.

———. 2013. “The Quantified Self: Fundamental Disruption in Big Data Science and Biological Discovery.” Big Data 1 (2): 85–99. https://doi.org/10.1089/big.2012.0002.

Tobler, Waldo R. 1987. “Experiments in Migration Mapping by Computer.” The American Cartographer 14 (2): 155–163. https://doi.org/10.1559/152304087783875273.

Wessel, Nate, and Michael Widener. 2015. “Rethinking the Urban Bike Map for the 21st Century.” Cartographic Perspectives 81: 6–22. https://doi.org/10.14714/CP81.1243.

White, Christopher E., David Bernstein, and Alain L. Kornhauser. 2000. “Some Map Matching Algorithms for Personal Navigation Assistants.” Transportation Research Part C: Emerging Technologies 8 (1–6): 91–108. https://doi.org/10.1016/S0968-090X(00)00026-7.

Whitfield, Geoffrey P. 2016. “Association Between User-Generated Commuting Data and Population-Representative Active Commuting Surveillance Data—Four Cities, 2014–2015.” MMWR. Morbidity and Mortality Weekly Report 65: 959–962. https://doi.org/10.15585/mmwr.mm6536a4.

Wood, Jo, Aidan Slingsby, and Jason Dykes. 2011. “Visualizing the Dynamics of London’s Bicycle-Hire Scheme.” Cartographica 46 (4): 239–251. https://doi.org/10.3138/carto.46.4.239.

Wood, Jo, Jason Dykes, and Aidan Slingsby. 2010. “Visualisation of Origins, Destinations and Flows with OD Maps.” The Cartographic Journal 47 (2): 117–129. https://doi.org/10.1179/000870410X12658023467367.

Zanella, Andrea, Nicola Bui, Angelo Castellani, Lorenzo Vangelista, and Michele Zorzi. 2014. “Internet of Things for Smart Cities.” IEEE Internet of Things Journal 1 (1): 22–32. https://doi.org/10.1109/JIOT.2014.2306328.

Zeile, Peter, Bernd Resch, Jan-Philipp Exner, and Günther Sagl. 2015. “Urban Emotions: Benefits and Risks in Using Human Sensory Assessment for the Extraction of Contextual Emotion Information in Urban Planning.” In Planning Support Systems and Smart Cities, edited by Stan Geertman, Joseph Ferreira Jr., Robert Goodspeed, and John Stillwell, 209–225. Berlin: Springer-Verlag. https://doi.org/10.1007/978-3-319-18368-8_11.

Zeng, Wei, Chi‐Wing Fu, Stefan Müller Arisona, and Huamin Qu. 2013. “Visualizing Interchange Patterns in Massive Movement Data.” Computer Graphics Forum 32 (3.3): 271–280. https://doi.org/10.1111/cgf.12114.

Zhou, Zhiguang, Linhao Meng, Cheng Tang, Ying Zhao, Zhiyong Guo, Miaoxin Hu, and Wei Chen. 2019. “Visual Abstraction of Large Scale Geospatial Origin-Destination Movement Data.” IEEE Transactions on Visualization and Computer Graphics 25 (1): 43–53. https://doi.org/10.1109/TVCG.2018.2864503.

Zhu, Xi, and Diansheng Guo. 2014. “Mapping Large Spatial Flow Data with Hierarchical Clustering.” Transactions in GIS 18 (3): 421–435. https://doi.org/10.1111/tgis.12100.

Zhu, Xi, Diansheng Guo, Caglar Koylu, and Chongcheng Chen. 2019. “Density-Based Multi-Scale Flow Mapping and Generalization.” Computers, Environment and Urban Systems 77: 101359. https://doi.org/10.1016/j.compenvurbsys.2019.101359.