Table of Contents
Series page
Title page
Copyright page
Series Foreword
Preface
Acknowledgments
1 What Is Data Science?
2 What Are Data, and What Is a Data Set?
3 A Data Science Ecosystem
4 Machine Learning 101
5 Standard Data Science Tasks
6 Privacy and Ethics
7 Future Trends and Principles of Success
Glossary
Further Readings
References
Index
About Author
List of Tables
Table 1 A Data Set of Classic Books
Table 2 Diabetes Study Data Set
Table 3 A Data Set of Emails: Spam or Not Spam?
List of Illustrations
Figure 1 A skills-set desideratum for a data scientist.
Figure 2 The DIKW pyramid (adapted from Kitchin 2014a).
Figure 3 Data science pyramid (adapted from Han, Kamber, and Pei 2011).
Figure 4 The CRISP-DM life cycle (based on figure 2 in Chapman, Clinton, Kerber, et al. 1999).
Figure 5 The CRISP-DM stages and tasks (based on figure 3 in Chapman, Clinton, Kerber, et al. 1999).
Figure 6 A typical small-data and big-data architecture for data science (inspired by a figure from the Hortonworks newsletter, April 23, 2013, https://hortonworks.com/blog/hadoop-and-the-data-warehouse-when-to-use-which).
Figure 7 The traditional process for building predictive models and scoring data.
Figure 8 Databases, data warehousing, and Hadoop working together (inspired by a figure in the Gluent data platform white paper, 2017, https://gluent.com/wp-content/uploads/2017/09/Gluent-Overview.pdf).
Figure 9 Scatterplots of shoe size and height, weight and exercise, and shoe size and exercise.
Figure 10 Scatterplots of the likelihood of diabetes with respect to height, weight, and BMI.
Figure 11 (
a
) The best-fit regression line for the model “Diabetes = −7.38431 + 0.55593 BMI.” (
b
) The dashed vertical lines illustrate the residual for each instance.
Figure 12 Mapping the logistic and tanh functions as applied to the input
x
.
Figure 13 A simple neural network.
Figure 14 A neural network that predicts a person’s fitness level.
Figure 15 A deep neural network.
Figure 16 A decision tree for determining whether an email is spam or not.
Figure 17 Creating the root node in the tree.
Figure 18 Adding the second node to the tree.
Guide
Cover
Table of Contents