Free Willy

Orcas

Orcas are perfect models for future knowledge grabbers: intelligent, selective hunters that, working individually or collaboratively, choose their prey from anywhere and at any depth down to their absolute limit. Let’s see how these qualities might apply to scientists. Ideally, open, online collaboration would move scientists toward building on the latest and most relevant science and away from schooling, like herrings, around citation statistics and network links. An encouraging model is GitHub, the Wikipedia of software design, where thousands of developers openly collaborate on projects. Fueled mainly by validating comments from their peers (“Good job!”), the GitHub community dives deeply and even does free projects for giants such as Microsoft and Hewlett-Packard. On GitHub, “ultimately you have an expert—the person who wrote the original program,” noted Thomas Friedman, “who gets to decide what to accept and what to reject.” With the best ideas vetted by experts, GitHub exemplifies the Tasmania model we looked at in chapter 9, in which progress is accelerated by a large population size. The massive global population of orca-like GitHub developers can complete projects in a fraction of the time that it might take a team of paid herring-like employees.

As our allegorical orcas dive deep for prey, they encounter millions of scientific articles lying in their cold underwater tomb. As we saw in chapter 9, many of those articles, especially the older ones, have never even been cited. Virtually any article, no matter how obscure or old, can turn up in an orca’s search. “Nothing in the past is lost. … [E]verything exists on one plane,” wrote poet and journalist Dan Chiasson, so if you are a researcher with brilliant, but uncited articles, take heart, because it’s a good bet that they will one day be brought to the surface. Drawing on digital information both globally and historically, knowledge evolution will be aided by artificial intelligence to select for well-specified qualities. If open science adopts this kind of expert selection, science will shift up a gear. Following the release of the gene-editing technology CRISPR in 2014, Kevin Esvelt of MIT saw open science as morally imperative, especially as humans begin engineering the evolution of animals, insects, plants, microorganisms, and possibly even themselves.

A fully open science can also study itself in order to optimize its own evolution. Forecasting future citations of new medical papers, for example, can help predict whether a drug approved by the Food and Drug Administration might appear a decade later. Deeper insights will come through meta-analyses of text-mined scientific publications. Biomedical researchers at the University of Manchester, for instance, are looking for disciplinary trends and mapping the flow of information between them. Identifying such networks can highlight key new areas of research—potentially those that an algorithm could use to generate a new hypothesis. “Algorithms will build on algorithms,” promised a trade magazine at Hewlett-Packard, “with every prediction smarter than the last.” Predictive algorithms learn through a process called supervised learning, where thousands of successive estimates are checked against the correct answer, each time adjusting the model parameters after each trial to improve the estimate incrementally. This is the essence of Bayesian modeling.

The algorithmic approach needn’t be restricted to published literature; it can also study real people. It could use a platform such as Amazon’s Mechanical Turk, which now hosts over twenty thousand online participants per month on experiments ranging from rating facial attractiveness to studies of generosity and religiosity. Online social research already has global reach, as over half the world’s population has mobile phones. Machine learning can infer much from basic phone data, even the personal wealth of someone in a developing country. Researchers recently compared billions of interactions on Rwanda’s largest mobile phone network to personal phone surveys that provided direct estimates of personal wealth. Machine learning estimated the personal wealth from the phone contacts, volume, and timing of calls or texts and geolocations. It could even predict whether a person owned a motorcycle or had electricity in the house.

Anonymous phone data could also be used to predict conflicts. No conversation content is needed. Instead, all you need is the timing of events, as they tend to accelerate in a predictable pattern, like a ball dropped on the floor: bop, … bop, … bop, bop, bopitybopitybopbopbop. As a conflict escalates, the shortening time intervals between responding events—whether years, days, or seconds—are inversely proportional to the numerical order of the event, via a negative exponent called the escalation parameter. The method has been developed on data sets on the ground and online, regarding escalations of warfare as well as online discussions preceding an attack or civil unrest. It even applies to a fight at the family dinner table (you can verify this with a stopwatch and Will Ferrell’s classic “I Drive a Dodge Stratus!” skit on YouTube).

After wealth and conflict, the next step is to predict health. The future promises many marvels. Hewlett-Packard says that by 2030, your embedded microchips will alert you when it’s time to 3-D-print yourself a new kidney. This is not so farfetched. Even now, a Google user’s search activity—for certain diagnostic symptoms of, say, the onset of diabetes or a chronic condition—can reveal a developing health problem before it is even known to the user. Governments are interested too, of course, at the public scale. In the United Kingdom in 2015, the National Health Service agreed to share millions of personal health records with DeepMind, the Google-owned company that developed a neural network called a “differentiable neural computer” that learns to understand narratives, analyze networks, and solve complex logistical problems.

Speaking of which, did we mention that the orca has a huge brain? Neural networks aim to solve problems the same way a human brain would, with layers of networks that discover patterns of patterns. An image of a face might enter the input data layer, which is passed through layers of intermediate representations, like the edges that make a shape, and then the shapes that make a face, to the response layer. A dynamic neural network learns by rewiring its neurons to reinforce whichever millions of neurons that activated—“voted”—for the correct answer. Given that the number of connections in a neural network grows with the square of the number of nodes, the pattern recognition of neural networks can become much more granular real fast.

Nevertheless, we have not yet built machines that truly reason like people. Researchers at MIT’s Center for Brains, Minds, and Machines say artificial intelligence needs to move from mere pattern recognition—no matter how sophisticated or fast—to causal explanation, which we covered in chapter 8. Many of the advances in artificial intelligence have come about through playing games, which requires a great deal of supervised trial-and-error learning with feedback about the correct answer. To beat a player at Go, for example—even one who’s new to the game—a deep neural network must first observe millions of moves by expert players and play millions of practice games. Facebook’s deep convolutional network needs thousands of examples to judge how a tower of just several toy blocks will fall, which is something a child would know intuitively.

In game-playing terms, artificial intelligence is still chess-like, trained to optimize the long-term reward of a particular action in a particular situation. It struggles to interpret novel input, such as reading a new style of handwriting. It cannot easily generalize or combine simple elements into complex concepts with infinite possibility—a feature of human thought and language known as compositionality. To make conversation, a neural network predicts the next sentence based on the previous one. In this sense it recalls the 1960s’ MIT “Eliza” program, which faked its way through a conversation in “phrases tacked together like the sections of a prefabricated henhouse,” as George Orwell once described the language of politicians. A half century later, the neural network shows more originality. “What is immoral?” Google researchers asked their neural conversational machine. “The fact that you have a child,” it replied, somewhat ominously. To be fair, it and other intelligent personal assistants like Amazon’s Alexa are designed to answer customer service questions or sell products, not to make original conversation compositionally or develop causal explanations about the world.

Compared to present artificial intelligence—distant future versions may be reading this and “laughing”—humans learn a lot more from much less. Learning both individually and socially, children can isolate variables and test causal hypotheses. Children who are taught how to learn, such as at Montessori schools, acquire measurable advantages in language, math, creativity, social interaction, and understanding. Humans can generalize explanatory concepts from just a few examples. To get closer to human creativity and flexibility of reasoning, artificial intelligence must become compositional and thus be able to generalize rather than simply look up each answer from an encyclopedic reference set.

This is precisely the goal of researchers who are experimenting with stochastic programs that can parse objects and goals into their essential components and then recombine them into new concepts and larger goals. This brings us back to the importance of memory. A breakthrough for DeepMind’s neural computer came about through the integration of external read-write memory with the powerful neural network. This allows the computer to represent and manipulate complex data structures, and like a neural network, learn from the data. Just as humans have better working memory than chimpanzees, memory may be the key to humanlike artificial intelligence. Science fiction knows this. At the end of 2001: A Space Odyssey, Hal becomes less human as his memory is unplugged. In HBO’s Westworld, robots acquire human reasoning through their growing retention of personal memories.

As memory and artificial intelligence are integrated, however, and searchable digital records bring everything to one plane, we need to remember how to forget, as Elvis sang in 1955. Our metaphorical orca is not a fish but rather a mammal, and it occasionally comes up for air and to clear its mind. At the population scale, forgetting re-sorts existing variation, cleans the slate, and starts a new phylogenetic branch. We may owe our cultural modernity to this. About seventy-five thousand years ago, the volcanic eruption of Toba, in Sumatra, blanketing southern Asia in ash, might have left fewer than ten thousand people on the planet. Some paleoanthropologists believe the Upper Paleolithic era emerged out of Toba’s ashes, with art, modern behaviors, and new technologies, all of which form the cultural foundation of humanity.

This is a pertinent question as memory and artificial intelligence become more integrated—literally, as machines begin to make decisions. Neuroadaptive technology already exists that can learn to interpret simple human intentions directly from brain activity. While a person moves a cursor on the screen of a computer that is simultaneously doing real-time analysis of the person’s brain activity at five hundred hertz, through dozens of electrodes configured around the scalp, the neuroadaptive system learns through trial and error how to translate the brain activity directly into the intended movement of the cursor. In short, the computer literally reads the person’s mind. As with artificial intelligence in general, however, the question is, How big is the gap—in this case, between simple intentions, such as moving a cursor, and real thought and causal explanation?

10 Free Willy

Bluefins and Herring

Orcas

Along Come Mice