Chapter 11
In This Chapter
Learning about life in academia
Looking at public sector big data responsibilities
Identifying big data roles that impact healthcare
Not all the jobs in big data are concerned with driving profits or revenue. Many roles in big data out there are in the public sector or university settings working toward public safety, medical discovery, and the advancement of science. Instead of looking for the best pricing combination or executing a online shopping basket analysis, you may find yourself wanting to uncover future signs of cancer, predict where the next disease may strike, or run simulations on clinical trials to find a cure for a disease.
This chapter covers the role big data plays in different areas of the government, how higher education leverages big data, and how you can explore the job possibilities.
Higher education plays a very important role in the big data ecosystem. From training new talent to pushing the envelope of innovation, there are many exciting areas available for people interested in big data careers.
The foundation of academia is teaching. The role of the university is critical for teaching the next crop of mathematicians, scientists, and business analysts. If you’re considering a role as a professor, you’ll find big data to be a very rewarding and challenging field.
Here’s the typical career path of a college professor after earning a PhD:
Tenure is a status that is granted to select professors guaranteeing them employment after they complete a probationary period. Professors are tenured to ensure that they won’t lose their jobs because of political infighting or other nonteaching issues. Tenure provides them the academic freedom to teach in their own way without fear of repercussions.
What about your buddy at work who also teaches a class at the local university? You can be that person, too. There is a class of educators at the university level who are not on the formal professor track. They might be called lecturers or adjunct professors. These titles are reserved for part-time faculty members who are at the university for teaching purposes and usually have other jobs. They don’t usually have the research or service requirements that other faculty members have — they’re focused on classroom teaching. All schools, from the largest universities to the smallest community colleges, employ part-time faculty to help meet classroom demand or bring in professionals with specialized knowledge to add value to their programs.
Universities are centers for research. When you think of research, you may imagine lab coats, bubbling beakers, and Bunsen burners. But much of the research that is conducted today is focused on data analysis. There are a variety of research jobs in universities — from research assistants all the way to post-doctorate researchers. These roles are often funded through grants from the National Science Foundation, the Department of Energy, the Department of Defense, the National Institutes of Health, and many other agencies in conjunction with university research. Some of the positions may last only as long as the funding lasts.
Nonprofit organizations often serve an entire industry as a whole. They’re funded by member companies to drive the following:
Companies and organizations often come together to build information and data systems that all member firms may benefit from, even though they’re competitors in some cases. For example, The Data Warehouse Institute (http://tdwi.org) is made up of members from competing firms. However, they collaborate to promote the industry as a whole.
Another example is the National Association of Homebuilders (NAHB), an organization that has been around for more than 70 years and has 140,000 member firms. Homebuilding represents a huge part of the U.S. gross domestic product (GDP) when you factor in all the associated trades and industries. Data, insights, and research from the NAHB impacts financial markets, public policy, building and safety standards, and a number of other key issues. Figure 11-1 shows a heat map representing the leading building markets in the United States to illustrate the new construction housing recovery. Heat maps show concentrations, or heat, of groups of data by a concentration of a particular color.
Courtesy of the National Association of Home Builders
Figure 11-1: An NAHB housing heat map.
Public sector jobs are typically segregated into civilian and defense. Their customers are the agencies or organizations they’re tasked to serve. These groups can be local, state, or federal. Public sector jobs are focused on supporting infrastructure, services, public safety, and regulatory needs. For example, in public safety, you may analyze crime data to predict where to apply funding for a police force, or you may analyze disease data to help provide input for policymakers and lawmakers. The insights from big data can have a huge impact on the government.
McKinsey & Company, a management consulting firm, has identified three major areas for the uses of big data in public sector administration:
Big data can also be used to predict potential crime networks, terror connections, and other crime risks. Big data analysis can be used to spot and predict health and disease information that can be used by policymakers who are appropriating funding for medical research.
Civilian organizations are local, state, and federal government groups that are not associated with defense and intelligence. These include law enforcement groups like the local police and the Federal Bureau of Investigation (FBI), which is a part of the Department of Justice.
Life in these groups differs vastly from one agency to the next, but they do share a lot of the same characteristics:
Somebody once said that steady plodding brings prosperity. So if you’re inclined to leverage your skills for the benefit of society, the public sector could be a great fit!
Some government agencies have been leveraging big data to run analytics to identify potential threats to the United States. These threats include such things as spying from foreign countries, attacks to our financial institutions, and cyber-terrorism. The task of sifting through unstructured data like voice files (which could include recorded phones calls or voicemails) and emails in search of patterns to identify future threats is a classic case of using big data.
Consider an example of government intelligence agencies that need to isolate events related to suspected conversations related to terrorist activities. These events can occur via emails, text, voice, and other media. The first step would be to isolate noise from the relevant content, creating a group of interest. With big data, agencies can
Given the technical and data requirements, you can see that government agencies have a high demand for big data analysts. People who enter a three-letter agency like the FBI, Central Intelligence Agency (CIA), National Security Agency (NSA), and Defense Intelligence Agency (DIA), enter at a rank structure somewhat similar to the military. There are job classifications that determine the level of responsibility and associated pay grades.
Figure 11-2 shows a posting taken from www.cia.gov for a data scientist within the agency. As you can see from this posting, the salary range is quite wide and depends totally on your experience level. Much of the work is similar to a corporate IT job within the commercial sector. The main difference is that the objective of the analysis involves national defense instead of assisting business units in increasing profitability.
Figure 11-2: A posting for a data scientist job at the CIA.
McKinsey & Company is a prestigious consulting firm that works with leaders in business and government around the world. It made a huge impact in the world of big data with some very big claims in its 2011 report, Big Data: The Next Frontier for Innovation, Competition, and Productivity. McKinsey projected the impact of big data on healthcare in the United States could top $300 billion per year, which is more than double the total annual health spending of Spain.
Using information to improve healthcare is in the infancy stages today, which should bode well for the future of big data job growth. There are three factors to support this:
Healthcare companies need to derive insights from the data they collect, but this is complicated by a lack of system integration. This lack of integration is only an opportunity for growth in jobs, responsibility, and innovation. This means that big data people entering the workforce today will be part of a huge movement to bring massive advancement across this complex network. There is great room for innovation when gaps like this exist.
You may wonder what this means for big data? This leads us to the final factor that promises huge job potential for big data workers in the near future and long term. With millions of people using digital devices within their healthcare network, there will be an enormous glut of data from which we can pull untold insights. I say “untold” here, because we just don’t fully understand what will be discovered. That is the big promise of big data!