Data scientists are solving a hard problem: How can organizations convert the rapidly growing deluge of data into insights that will make them more successful? But recruiting data scientists is a challenge for most firms — McKinsey estimates that by next year, the United States will face a shortage of 140,000 to 190,000 data scientists. Moreover, this problem gets even worse further up the org chart: The U.S. needs nearly 1.5 million data-literate managers who can make use of the insights produced by the data scientists.
Unlike some professions where the distribution of jobs is spread fairly evenly across the United States — for example, the concentration of elementary school teachers does not change much from California to New York — the distribution of data scientists varies considerably. While no source gives a definitive answer, a variety of information gives us clues about current trends.
First, we can look at the number of people working in closely related professions, such as computer scientists and statisticians, as a share of total workers. According to the Center for Data Innovation, Maryland, Virginia and Delaware top the list for employing workers in statistics and database management, and Washington, Massachusetts and Virginia lead in software service jobs, such as computer programming and software development. North Dakota, Wyoming and South Dakota, along with Mississippi, Idaho and Wyoming, rank last in these two areas, respectively.
Second, we can see which states have the most job listings for data scientists as a share of total job listings. On this metric, Washington is the clear front-runner, with Maryland, Massachusetts and Virginia as its closest peers. At the back of the pack, Louisiana, Montana and Mississippi have the lowest share of data science job listings.
Third, we can determine which states have the most active data science community by measuring participation in data science events. The top three states for this metric are New York, California and Massachusetts — all states with large metropolitan tech hubs. They have thriving data science communities where knowledge-sharing, network-building and collaboration are common. But data scientists in Mississippi, South Dakota or Wyoming are likely pretty lonely, as none of these states has an active data science community.
The biggest question for most state policymakers is how to change the status quo. One important factor is the pipeline for data scientists. At the high school level, the best metric for whether schools are preparing students for careers in data science is the percent of students taking computer science and statistics advanced placement (AP) tests and these students’ test scores. Massachusetts leads the nation with the highest ratio of students taking the computer science or statistics AP tests compared to other tests. However, while its students perform well, it is not the highest ranked. This distinction goes to Utah and Illinois, which tie for the top position, though both states have a lower ratio of students taking these tests. This suggests that these states need to find a way to scale their programs to more students. Finally, some states, such as Mississippi, Louisiana and New Mexico, have few students taking these tests, and those who do perform relatively poorly, on average.
As states vie to be the front-runner in data science, regardless of position, every state should recognize the importance of data science jobs and grow their data science talent if they want to be competitive in the data economy.
Note: Rankings in this article are from the report The Best States for Data Innovation published by the Center for Data Innovation.