"It’s not about big data, it’s about big understanding… Understanding strengthens the link between data tools and policy action." These were the reflections of Jack Dangermond, Founder and President of the Esri, at the 14th convening of the Project on Municipal Innovation Advisory Group (PMI-AG). Dangermond was joined by leaders from America’s largest cities to discuss how municipal Chief Data Officers can impact policy, emphasizing ways that data – and in particular, data visualization – might be used in the fight against poverty and inequality.
Big data analytics and data visualization have made great strides in recent years. Building and food inspections are better targeted; congestion on roads and public transport is monitored in real time; natural disasters are preemptively simulated; and city "story maps" are used to engage citizens in municipal service delivery, small business decisions, and land use planning. However, the palpable sense of anticipation among PMI participants suggested that using big data and data visualization to tackle deeper systemic issues proves more difficult.
With regards to poverty and inequality, progress in data visualization has been largely limited to what several participants described as "sad maps." Maps for crime rates, school retention, and infant mortality measures all tell similar geographical stories: the darkest shades of the map, indicating the most severe disparities, represent the same few neighborhoods. Participants agreed that all too often these maps prompt concern, but offer few suggestions on how to shape policy.
Over the course of the panel, however, it became clear that several emerging data-orientated approaches have the potential to shape policies that address poverty and inequality:
- Overlay "sad maps" with the existing geographical distribution of service delivery to inform future budget allocations. Several participants are in the process of using maps to determine whether there is underlying discrimination in infrastructure investments.
- Use spatial correlation analysis to identify and understand positive deviance. That is, given what we know about a neighborhood’s context, which areas are performing better than they ought to be? For instance, what can we learn from high poverty neighborhoods where obesity is low?
- Use regression over space and time to identify root causes. In doing so, the most important lead indicators can be used to identify those who may benefit most from early intervention. In Chicago, intensive academic tutoring is targeted at those who fail Algebra 1; the best predictor of school dropout. A similar approach is being employed to analyze patterns of eviction in New York that are most likely to lead to homelessness.
Similarly, getting the best out of big data requires engagement with the public. Several Chief Data Officers explained that they spend a lot of time talking with the public to understand what is happening "on the ground." Data is only one piece of the puzzle. In a bid to share costs and broaden capabilities, a growing number of cities are partnering with leading research universities through the White House backed MetroLab network. It is hoped that these partnerships will underpin the transition of big data and data visualization across mayoral administrations.
Like e-mail, big data and data visualization will likely become ubiquitous. For systemic policy issues though, moving from data to information products and ultimately to policy action is no easy task. The ideas raised during the PMI-AG discussion suggests big data and data visualization have the potential to improve policies targeting poverty and inequality. With that said, there is much work and discovery to come before we can make firm conclusions about their efficacy. In the words of Thomas Edison – "the value of an idea lies in the using of it."