Table of Contents
Cover
Title Page
About the Authors
List of Figures
List of Tables
Preface
1 The Opportunity
1.1 Introduction
1.2 The Rise of Data
1.3 Realising Data as an Opportunity
1.4 Our Definition of Monetising Data
1.5 Guidance on the Rest of the Book
2 About Data and Data Science
2.1 Introduction
2.2 Internal and External Sources of Data
2.3 Scales of Measurement and Types of Data
2.4 Data Dimensions
2.5 Quality of Data
2.6 Importance of Information
2.7 Experiments Yielding Data
2.8 A Data‐readiness Scale for Companies
2.9 Data Science
2.10 Data Improvement Cycle
3 Big Data Handling, Storage and Solutions
3.1 Introduction
3.2 Big Data, Smart Data…
3.3 Big Data Solutions
3.4 Operational Systems supporting Business Processes
3.5 Analysis‐based Information Systems
3.6 Structured Data – Data Warehouses
3.7 Poly‐structured (Unstructured) Data – NoSQL Technologies
3.8 Data Structures and Latency
3.9 Data Marts
4 Data Mining as a Key Technique for Monetisation
4.1 Introduction
4.2 Population and Sample
4.3 Supervised and Unsupervised Methods
4.4 Knowledge‐discovery Techniques
4.5 Theory of Modelling
4.6 The Data Mining Process
5 Background and Supporting Statistical Techniques
5.1 Introduction
5.2 Variables
5.3 Key Performance Indicators
5.4 Taming the Data
5.5 Data Visualisation and Exploration of Data
5.6 Basic Statistics
5.7 Feature Selection and Reduction of Variables
5.8 Sampling
5.9 Statistical Methods for Proving Model Quality and Generalisability and Tuning Models
6 Data Analytics Methods for Monetisation
6.1 Introduction
6.2 Predictive Modelling Techniques
6.3 Pattern Detection Methods
6.4 Methods in practice
7 Monetisation of Data and Business Issues: Overview
7.1 Introduction
7.2 General Strategic Opportunities
7.3 Data as a Donation
7.4 Data as a Resource
7.5 Data Leading to New Business Opportunities
7.6 Information Brokering using Data
7.7 Connectivity as a Strategic Opportunity
7.8 Problem‐solving Methodology
8 How to Create Profit Out of Data
8.1 Introduction
8.2 Business Models for Monetising Data
8.3 Data Product Design
8.4 Value of Data
8.5 Charging Mechanisms
8.6 Connectivity as an Opportunity for Streamlining a Business
9 Some Practicalities of Monetising Data
9.1 Introduction
9.2 Practicalities
9.3 Special focus on SMEs
9.4 Special Focus on B2B Lead Generation
9.5 Legal and Ethical Issues
9.6 Payments
9.7 Innovation
10 Case Studies
10.1 Job Scheduling in Utilities
10.2 Shipping
10.3 Online Sales or Mail Order
10.4 Intelligent Profiling with Loyalty Card Schemes
10.5 Social Media: a Mechanism to Collect and Use Contributor Data
10.6 Making a Business out of Boring Statistics
10.7 Social Media and Web Intelligence Services
10.8 Service Provider
10.9 Data Source
10.10 Industry 4.0: Metamodelling using Simulated Data
10.11 Industry 4.0: Modelling Pricing Data in Manufacturing
10.12 Monetising Data in an SME
10.13 Making Sense of Public Finance and Other Data
10.14 Benchmarking who is the Best in the Market
10.15 Change of Shopping Habits Part I
10.16 Change of Shopping Habits Part II
10.17 Change of Shopping Habits Part III
10.18 Service Providers, Households and Facility Management
10.19 Insurance, Healthcare and Risk Management
10.20 Mobility and Connected Cars
10.21 Production and Automation in Industry 4.0
Bibliography
Glossary
Index
End User License Agreement
List of Tables
Chapter 02
Table 2.1 Typical internal and external data in information systems.
Table 2.2 Extract of sales data.
Table 2.3 Company sales data analytics.
Table 2.4 Internal sales data enriched with external data.
Table 2.5 Scales of measurement examples.
Table 2.6 Checklist for data readiness.
Chapter 04
Table 4.1 Confusion matrix for comparing models.
Chapter 05
Table 5.1 Partially tamed data.
Table 5.2 Outcomes of a hypothesis test.
Table 5.3 Typical significance borders.
Table 5.4 Examples of statistical tests.
Table 5.5 Example of a contingency table.
Table 5.6 Target proportions.
Table 5.7 Confusion matrix.
Table 5.8 Gains chart.
Table 5.9 Non‐cumulative lift and gains table.
Chapter 06
Table 6.1 Example of a contingency table.
Table 6.2 Analysis table for goodness of fit.
Chapter 08
Table 8.1 Business models for types of exchange.
Table 8.2 Business models for B2C selling.
Table 8.3 Business models for service providers.
Chapter 09
Table 9.1 Business model canvas of the comparisons between data brokers and insight innovators.
Chapter 10
Table 10.1 Summary of case studies.
Table 10.2 Risk scores in a simple case.
Table 10.3 Distribution of risk scores in different seasons.
Table 10.4 Allowable stress for soft impact.
Table 10.5 Parameters used to describe a four‐sided glass panel.
Table 10.6 Data dimensions and stakeholders.
List of Illustrations
Chapter 01
Figure 1.1 Where does big data come from?.
Figure 1.2 Big data empowers business.
Figure 1.3 Roadmap to success.
Figure 1.4 Wish list for generating money out of data.
Figure 1.5 Monetising data.
Chapter 02
Figure 2.1 Deming’s ‘Plan, Do, Check, Act’ quality improvement cycle.
Figure 2.2 Six Sigma quality improvement cycle.
Figure 2.3 Example of data maturity model.
Figure 2.4 Data improvement cycle.
Chapter 03
Figure 3.1 Big data definition.
Figure 3.2 Internet of things timeline.
Figure 3.3 Example data structure.
Figure 3.4 NoSQL management systems.
Figure 3.5 Big data structure and latency.
Chapter 04
Figure 4.1 Supervised learning.
Figure 4.2 Unsupervised learning.
Figure 4.3 The CRISP‐DM process.
Figure 4.4 The SEMMA process.
Figure 4.5 General representation of the data mining process.
Figure 4.6 Time periods for data mining process.
Figure 4.7 Stratified sampling.
Figure 4.8 Lift chart for model comparison.
Figure 4.9 Lift chart at small scale.
Figure 4.10 An example of model control.
Chapter 05
Figure 5.1 Raw data from a customer transaction.
Figure 5.2 Bar chart of relative frequencies.
Figure 5.3 Example of cumulative view.
Figure 5.4 Example of a Pareto chart.
Figure 5.5 Example of a pie chart.
Figure 5.6 Scatterplot of company age and auditing behaviour with LOWESS line.
Figure 5.7 Scatterplot of design options.
Figure 5.8 Ternary diagram showing proportions.
Figure 5.9 Radar plot of fitness panel data.
Figure 5.10 Example of a word cloud.
Figure 5.11 Example of a mind map.
Figure 5.12 Location heat map.
Figure 5.13 Density map for minivans.
Figure 5.14 SPC chart of shipping journeys.
Figure 5.15 Decision tree analysis for older workers.
Figure 5.16 Gains chart.
Figure 5.17 Lift chart.
Figure 5.18 ROC curve development during predictive modelling.
Chapter 06
Figure 6.1 Example of logistic regression.
Figure 6.2 Corrected logistic regression.
Figure 6.3 Decision tree.
Figure 6.4 Artificial neural network.
Figure 6.5 Bayesian network analysis of survey data.
Figure 6.6 Bayesian network used to explore what‐if scenarios.
Figure 6.7 Plot of non‐linear separation on a hyperplane.
Figure 6.8 Dendrogram from hierarchical cluster analysis.
Figure 6.9 Parallel plot from K‐means cluster analysis.
Figure 6.10 Kohonen network with two‐dimensional arrangement of the output neurons.
Figure 6.11 SOM output.
Figure 6.12 T‐SNE output.
Figure 6.13 Correspondence analysis output: scatterplot of RPC2 vs RPC1, the two principal dimensions showing how the row profiles in a contingency table differ from each other.
Figure 6.14 Association rules.
Figure 6.15 Association analysis of products.
Figure 6.16 Comparison of customer base and population.
Figure 6.17 Relationship between energy usage and deprivation: scatterplot of mean AQ vs percentage of households deprived.
Figure 6.18 Map showing prices.
Chapter 07
Figure 7.1 Strategic opportunities.
Figure 7.2 How data can boost top‐ and bottom‐line results.
Figure 7.3 Typical data request.
Figure 7.4 Observed data and usage.
Figure 7.5 Maslow’s hierarchy of needs.
Figure 7.6 Data sources to empower consumer business.
Figure 7.7 Ready information on market opportunities.
Figure 7.8 Word cloud from keyword occurrences.
Figure 7.9 Using different data sources for analytics.
Figure 7.10 Daily sleep patterns.
Figure 7.11 Predictive analytics in insurance.
Chapter 08
Figure 8.1 Pathways to monetising data.
Figure 8.2 Segmentation features of walk‐in customers.
Figure 8.3 Business opportunities.
Chapter 09
Figure 9.1 Paths to monetisation.
Figure 9.2 Pareto diagram of customer compliments.
Figure 9.3 Graphical dashboard.
Figure 9.4 Decrypting the DNA of the best existing customers.
Figure 9.5 Aspects of digital maturity.
Figure 9.6 Closed loop of B2B customer profiling – continuous learning.
Figure 9.7 Automated B2B lead generation system.
Figure 9.8 New methods, new insights, smart business.
Figure 9.9 Misleading scatterplots.
Figure 9.10 Scatterplot with multiple features.
Figure 9.11 Histogram of suspicious‐quality recordings.
Chapter 10
Figure 10.1 The evolution of data analytics
Figure 10.2 Cumulative distribution of risk scores.
Figure 10.3 Data sources in the shipping industry.
Figure 10.4 Optimum speed recommendation.
Figure 10.5 Pruned decision tree.
Figure 10.6 Detail from decision tree
Figure 10.7 Customised communication.
Figure 10.8 Individualised communication.
Figure 10.9 Complexity of data mining steps.
Figure 10.10 Data in the customer journey.
Figure 10.11 Intelligent profiles and segments in B2C.
Figure 10.12 Personalised journey.
Figure 10.13 The reach of social media.
Figure 10.14 The power of social media.
Figure 10.15 Using peer group behaviour.
Figure 10.16 National statistics oil prices.
Figure 10.17 Example of reports portal
Figure 10.18 Making a business out of boring statistics.
Figure 10.19 Right place, right time.
Figure 10.20 Social media information summarised.
Figure 10.21 Visualisation of user engagement.
Figure 10.22 Concept of newsletter tracking.
Figure 10.23 Example report on testing different versions.
Figure 10.24 Customer profile details.
Figure 10.25 Company profile details.
Figure 10.26 Example of glass facades in buildings.
Figure 10.27 Half normal plot of a screening experiment.
Figure 10.28 Predicted vs calculated resistance factor with validation.
Figure 10.29 Residual plot of prices.
Figure 10.30 Visualisation of groups of products.
Figure 10.31 Open data available to enrich company data.
Figure 10.32 Diffusion map showing clusters of shares.
Figure 10.33 Sampling approach for benchmarking in China.
Figure 10.34 Three‐step approach to survey analytics.
Figure 10.35 Skateboard offer.
Figure 10.36 Customer journey.
Figure 10.37 Example of customer segments.
Figure 10.38 Virtual changing room.
Figure 10.39 Virtual supermarket at bus stop.
Figure 10.40 Input from miscellaneous IoT sensors.
Figure 10.41 Appealing sleep sensor display.
Figure 10.42 Sensors connected by mobile phone.
Figure 10.43 The connected car.
Figure 10.44 The new connected eco‐system.
Figure 10.45 Industry 4.0.
Figure 10.46 Industry 4.0 in action.
Guide
Cover
Table of Contents
Begin Reading
Pages
iii
iv
v
xi
xiii
xiv
xv
xvi
xvii
xix
xx
1
2
3
4
5
6
7
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
357
358
359
360
361
362
363