CHAPTER 67
Automated Machine Learning and Federated Learning

By Andreas Deppeler¹

¹Adjunct Associate Professor, National University of Singapore

Introduction

Artificial Intelligence (AI) is everywhere in financial services. Insurance marketing and underwriting models are based on behavioural micro-segmentation; call centre agents rely on natural language processing and voice transcription; banking compliance departments are using machine learning to reduce the number of “false positives” in anti-money laundering transaction monitoring. Nevertheless, financial services firms still seem to be slow in adopting AI.

There are three main reasons for this.

Shortage of Data

It is true that “the machine learning race is really a data race”.1 Even though financial services firms are traditionally data-rich, many struggle with outdated legacy architecture, organizational silos and poor data quality. Executives are now realizing that data is a strategic asset that needs to flow freely throughout the organization. Building a data-centric organization therefore means going beyond compliance with regulatory obligations like BCBS 239 or creating a chief data officer role: it means generating new business value by unlocking data.

Lack of Trust in AI

As a discipline, Machine Learning is still in its infancy. We speak about “Data Science”, but heuristic and iterative methods of algorithm selection and tuning resemble alchemy more than science.2 In the past 12–24 months, as companies’ unrestrained data collection practices have become better known, the public mood and intellectual discourse turned more sceptical and cautious.

Some augur a dystopian future in which corporations use AI prediction engines to steer human consumers towards “guaranteed commercial outcomes”.³ Others bring to light the failure of data protection law to protect data subjects from potentially discriminatory “inferential analytics”⁴ drawn from AI and big data. In recent months, financial regulators have started to publish guidelines on governance, accountability and risk management of AI for decision-making.⁵ Many expect high-risk AI model development to be subject to stringent regulatory and professional licensing standards in the future.

Shortage of Qualified Personnel

Although every major university has created data science and business analytics curriculums in recent years, industry demand for graduates still outstrips supply. Furthermore, the top AI talent seems to be drawn towards technology firms rather than traditional banks or insurance companies.

What can financial services firms do to accelerate their AI journeys? How can they build explainable models without relying on an army of data scientists, consultants or external vendors? How can they train models across larger data sets while keeping all data private and secure? Answers may be found in two recent AI innovations (automated machine learning and federated learning) that are presented in this chapter. This non-technical introduction will hopefully encourage readers to dive deeper into these technologies and adapt them to their own corporate environments.

Automated Machine Learning

The machine learning model development process can be broken down into four steps:

Data preparation: Collecting, cleaning and consolidating raw data from different sources into one file that can be analysed.

Feature engineering: Using technical and domain knowledge to construct additional variables that will improve model performance and accuracy.

Algorithm selection: Dozens of regression, classification, clustering and dimensionality reduction algorithms are available to choose from.6
Hyperparameter tuning: Optimizing the parameters for a given algorithm.

Depending on data infrastructure and quality, data preparation can take anywhere from a few days to several weeks. Feature engineering typically requires close collaboration and exploration between data scientists and domain experts. Algorithm selection and tuning is a time-consuming, subjective, repetitive and manual process – an art rather than science.

Automated machine learning promises to do all of this in a fraction of the time. The user drags and drops the training data files, selects the target variable (“What do I want to predict?”), defines the model category (e.g. regression or classification) and specifies runtime parameters. The platform autonomously performs data preparation and feature engineering, then iterates through a library of common models, tunes and trains each of them and ranks them by accuracy and speed. The user deploys a highly ranked model to production with the click of a button. The entire process can be completed in a few hours. Commercially available automated machine learning platforms offer modules for explainability and create standard documentation for model validation and model risk management.⁷

Automated machine learning increases the productivity of data scientists by reducing the time spent on mundane tasks of model development. Some firms are even deploying automated machine learning outside of data analytics functions, encouraging business users to experiment with AI. Over the next few years, off-the-shelf automated machine learning will likely establish itself as an efficient tool for developing low-risk models (e.g. marketing or prospecting), but data professionals will still be needed to develop and maintain high-risk models (e.g. underwriting or trading).

Federated Learning

Standard machine learning requires centralizing the training data on one machine or in a data centre. In federated learning, a shared global model is trained across many participating clients that keep their training data in local environments. Google coined the term “federated learning” in a paper 8 in February 2016, then wrote about it in their AI blog⁹ in April 2017 and made available an open-source software called TensorFlow Federated¹⁰ in March 2019. Another popular open-source library for encrypted, privacy-preserving deep learning is PySyft,¹¹ which is maintained by the OpenMined community.¹²

In federated learning, locally computed model training results (but no training data) are encrypted with a private key and sent to a central coordinating server. The server combines encrypted results from thousands of local models and only decrypts the average update, which is then used to improve a shared model. Once the shared model is trained and tested, all local models are updated.

Federated learning has been used for text and image prediction on phones and tablets, where training data are privacy sensitive. Soon, most smartphones¹³ and many IoT devices will be equipped with AI chipsets and connected through 5G. This will allow AI to move from the cloud and data centres to the “edge”. Federated learning allows those decentralized compute resources to train machine learning models in a privacy-preserving way. Potential applications include self-driving cars, industrial devices,¹⁴ smart buildings¹⁵ and medical diagnostics.¹⁶

In financial services, federated learning has been proposed for small business lending, anti-money laundering transaction monitoring and fraud detection.¹⁷ In a federated system, a shared model would be trained with aggregated model updates from several participating banks. Since each participant will have a slightly different data model, a trusted third party would be needed to standardize the model inputs and formally own and maintain the shared model.

Due to the high cost of compliance and the limited amount of good training data, the potential for federated learning in financial services is vast. The regulators will play a key role in encouraging the formation of industry consortia and in creating “sandbox” environments for experimentation and adaptation.

Notes

1 https://sloanreview.mit.edu/article/the-machine-learning-race-is-really-a-data-race/.
²www.sciencemag.org/news/2018/05/ai-researchers-allege-machine-learning-alchemy.
³Shoshana Zuboff (2019) The Age of Surveillance Capitalism, Public Affairs.
⁴Sandra Wachter and Brent Mittelstadt (2019) “A right to reasonable inferences: Re-thinking data protection law in the age of big data and AI”, Columbia Business Law Review, 1: https://ssrn.com/abstract=3248829.
⁵The “FEAT Principles” issued by the Monetary Authority of Singapore are a good example: https://www.mas.gov.sg/~/media/MAS/News%20and%20Publications/Monographs%20and%20Information%20Papers/FEAT%20Principles%20Final.pdf.
⁶A useful decision tree for finding the right algorithm can be found at https://scikit-learn.org/stable/tutorial/machine_learning_map/index.html.
⁷Well-known providers of automated machine learning platforms are Google, Microsoft and DataRobot. An overview of current academic research can be accessed at www.ml4aad.org/automl/.
⁸https://arxiv.org/abs/1602.05629.
⁹https://ai.googleblog.com/2017/04/federated-learning-collaborative.html.
¹⁰https://medium.com/tensorflow/introducing-tensorflow-federated-a4147aa20041.
¹¹ https://github.com/OpenMined/PySyft.
¹²www.openmined.org/.
¹³Gartner predicts that by 2022, 80% of smartphones shipped will have on-device AI capabilities, up from 10% in 2017: www.gartner.com/en/newsroom/press-releases/2018-03-20-gartner-highlights-10-uses-for-ai-powered-smartphones.
¹⁴https://aws.amazon.com/greengrass/.
¹⁵https://conferences.oreilly.com/artificial-intelligence/ai-eu/public/schedule/detail/78152.
¹⁶www.technologyreview.com/s/613098/a-little-known-ai-method-can-train-on-your-health-data-without-threatening-your-privacy/.
¹⁷www.fedai.org/.