From Maxwell to the Internet

Transistors, Chips, and Microprocessors

The first patent for a transistor-like device (Lilienfeld 1930) dates from 1925. Experimental work alone led to the discovery of the effect that makes transistors possible. However, the understanding of quantum mechanics and the resulting field of solid-state physics were instrumental in the realization that the electrical properties of semiconductors could be used to obtain behaviors that could not be obtained using simpler electrical components, such as resistors and capacitors.

In 1947, researchers at Bell Telephone Laboratories observed that when electrical contacts were applied to a germanium crystal (a semiconductor) the output signal had more power than the input signal. For this discovery, which may have been the most important invention ever, William Shockley, John Bardeen, and Walter Brattain received the Nobel Prize in Physics in 1956. Shockley, foreseeing that such devices could be used for many important applications, set up the Shockley Semiconductor Laboratory in Mountain View, California. He was at the origin of the dramatic transformations that would eventually lead to the emergence of modern computer technology.

A transistor is a simple device with three terminals, one of them a controlling input. By varying the voltage at this input, a large change in electrical current through the two other terminals of the device can be obtained. This can be used to amplify a sound captured by a microphone or to create a powerful radio wave. A transistor can also be used as a controlled switch.

The first transistors were bipolar junction transistors. In such transistors, a small current also flows through the controlling input, called the base. Other types of transistors eventually came to dominate the technology. The metal-oxide-semiconductor field-effect transistor (MOSFET) is based on different physical principles, but the basic result is the same: A change in voltage in the controlling input (in this case called the gate) creates a significant change in current through the two other terminals of the device (in this case called the source and the drain). From the simple description just given, it may be a little difficult to understand why such a device would lead to the enormous changes that have occurred in society in the last thirty years, and to the even larger changes that will take place in coming decades. Vacuum tubes, first built in the early twentieth century, exhibit a behavior similar to that of transistors, and can indeed be used for many of the same purposes. Even today, vacuum tubes are still used in some audio amplifiers and in other niche applications. Even though transistors are much more reliable and break down much less frequently than vacuum tubes, the critical difference, which took several decades to exploit to its full potential, is that, unlike vacuum tubes, transistors can be made very small, in very large numbers, and at a very small cost per unit. Although the British engineer Geoffrey Dummer was the first to propose the idea that many transistors could be packed into an integrated circuit, Jack Kilby and Robert Noyce deserve the credit for realizing the first such circuits, which required no connecting wires between the transistors.

Transistors, tightly packed into integrated circuits, have many uses. They can be used to amplify, manipulate, and generate analog signals, and indeed many devices, such as radio and television receivers, sound amplifiers, cellular phones, and GPS receivers, use them for that purpose. Transistors have enabled circuit designers to pack into very small volumes amplifiers and other signal-processing elements that could not have been built with vacuum tubes or that, if built with them, would have occupied a lot of space and weighed several tons. The development of personal mobile communications was made possible, in large measure, by this particular application of transistors.

However, the huge effect transistor technology has in our lives is due even more to the fact that integrated circuits with large numbers of transistors can be easily mass produced and to the fact that transistors can be used to process digital information by behaving as controlled on-off switches. Digital computers manipulate data in binary form. Numbers, text, images, sounds, and all other types of information are stored in the form of very long strings of bits (binary digits—zeroes and ones). These binary digits are manipulated by digital circuits and stored in digital memories. Digital circuits and memories are built out of logic gates, all of them made of transistors. For example, a nand gate has two inputs and one output. Its output is 0 if and only if both inputs are 1. This gate, one of the simplest gates possible, is built using four transistors, as shown in figure 3.3a, which shows four MOSFETs: two of type N at the bottom and two of type P at the top. A type N MOSFET behaves like a closed switch if the gate is held at a high voltage, and like an open switch if the gate is held at a low voltage. A P MOSFET, which can be recognized because it has a small circle on the gate, behaves in the opposite way. It behaves like a closed switch when the gate is held at a low voltage, and like an open switch when the gate is held at a high voltage.

Figure 3.3 (a) A *nand* gate made of MOSFET transistors. (b) The logic symbol for a *nand* gate.

If both X and Y (the controlling inputs of the transistors) are at logical value 1 (typically the supply voltage, V_DD), the two transistors shown at the bottom of figure 3.3a work as closed switches and connect the output Z to the ground, which corresponds to the logical value 0. If either X or Y (or both) is at the logical value 0 (typically ground, or 0 volt), or if both of them are, then at least one of the two transistors at the bottom of figure 3.3a works as an open switch and one (or both) or the top transistors works as a closed switch, pulling the value of Z up to V_DD, which corresponds to logical value 1. This corresponds in effect to computing , where the bar denotes negation and ∧ denotes the logic operation and. This is the function with the so-called truth table shown here as table 3.1.

Table 3.1 A truth table for logic function nand.

X	Y	Z
0	0	1
0	1	1
1	0	1
1	1	0

Besides nand gates there are many other types of logic gates, used to compute different logic functions. An inverter outputs the opposite logic value of its input and is, in practice, a simplified nand gate with only two transistors, one of type P and one of type N. An and gate, which computes the conjunction of the logical values on the inputs and outputs 1 only when both inputs are 1, can be obtained by using a nand gate followed by an inverter. Other types of logic gates, including or gates, exclusive-or gates, and nor gates, can be built using different arrangements of transistors and basic gates. More complex digital circuits are built from these simple logic gates.

Nand gates are somewhat special in the sense that any logic function can be built out of nand gates alone (Sheffer 1913). In fact, nand gates can be combined to compute any logic function or any arithmetic function over binary numbers. This property results from the fact that nand gates are complete, meaning that they can be used to create any logic function, no matter how complex. For instance, nand gates can be used to implement the two-bit exclusive-or function, a function that evaluates to 1 when exactly one of the input bits is at 1. They can also be used to implement the three-bit majority function, which evaluates to 1 when two or more bits are at 1. Figure 3.4 illustrates how the two-bit exclusive-or function (which evaluates to 1 when exactly one input is 1) and the three-bit majority function (which evaluates to 1 when at least two inputs are 1) can be implemented using nand gates.

Figure 3.4 *Exclusive-or* and *majority* gates made of *nand* gates.

In fact, circuits built entirely of nand gates can compute additions, subtractions, multiplications, and divisions of numbers written in binary, as well as any other functions that can be computed by logic circuits. In practice, almost all complex digital circuits are built using nand gates, nor gates, and inverters, because these gates not only compute the logic function but also regenerate the level of the electrical signal, so it can be used again as input to other logic gates. Conceptually, a complete computer can be built of nand gates alone; in fact, a number of them have been built in that way.

Internally, computers manipulate only numbers written in binary form. Although we are accustomed to the decimal numbering system, which uses the digits 0 through 9, there is nothing special about base 10. That base probably is used because humans have ten fingers and so it seemed to be natural. A number written in base 10 is actually a compact way to describe a weighted sum of powers of 10. For instance, the number 121 represents

1 × 10² + 2 × 10¹ + 1 × 10⁰,

because every position in a number corresponds to a specific power of 10.

When writing numbers in other bases, one replaces the number 10 with the value of the base used. If the base is smaller than 10, fewer than ten symbols are required to represent each digit. The same number, 121 in base 10, when written in base 4, becomes

1 × 4³ + 3 × 4² + 2 × 4¹ + 1 × 4⁰

(which also can be written as 1321₄, the subscript denoting the base). In base 2, powers of 2 are used and there are only two digits, 0 and 1, which can be conveniently represented by two electrical voltage levels—for example, 0 and 5 V. The same number, 121₁₀ in base 10, becomes, in base 2, 1111001₂, which stands for

1 × 2⁶ + 1 × 2⁵ + 1 × 2⁴ + 1 × 2³ + 0 × 2² + 0 × 2¹ + 1 × 2⁰.

Arithmetic operations between numbers written in base 2 are performed using logic circuits that compute the desired functions. For instance, if one is using four-bit numbers and wishes to add 7₁₀ and 2₁₀, then one must add the equivalent representations in base 2, which are 0111₂ and 0010₂.

The addition algorithm, shown in figure 3.5, is the one we all learned in elementary school. The algorithm consists in adding the digits, column by column, starting from the right, and writing the carry bit from the previous column above the next column to the left. The only difference is that for each column there are only four possible combinations of inputs, since each digit can take only the value 0 or 1. However, since the carry in bit can also take two possible values (either there is a carry or there is not), there are a total of eight possible combinations for each column. Those eight combinations are listed in table 3.2, together with the desired values for the output, C, and the carry bit, Cout, which must be added to the bits in the next column to the left. The carry is simultaneously an output of a column (Cout) and an input in the next column to the left (Cin).

Figure 3.5 Adding together the numbers 0111₂ and 0010₂.

10998_003_T3.2 — Table 3.2 Logic functions for the addition of two binary digits. C gives the value of the result bit; *Cout* gives the result of the carry bit.

It is easy to verify by inspection that the function C is given by the exclusive-or of the three input bits, a function that takes the value 1 when an odd number of bits are 1. The function Cout is given by the majority function of the same three input bits.

Therefore, the circuit on the left of figure 3.6 computes the output (C) and the carry out (Cout) of its inputs, A, B, and Cin. More interestingly, by wiring together four of these circuits one obtains a four-bit adder, like the one shown on the right of figure 3.6. In this four-bit adder, the topmost single-bit adder adds together the least significant bits of numbers A and B and the carry bit propagates through the chain of adders. This circuit performs the addition of two four-bit numbers, using the algorithm (and the values in the example) from figure 3.5.

Figure 3.6 (a) A single-bit adder. (b) A four-bit adder, shown adding the numbers 0111 and 0010 in binary..

More complex circuits (for example, multipliers, which compute the product of two binary numbers) can be built out of these basic blocks. Multiplications can be performed by specialized circuits or by adding the same number multiple times, using the same algorithm we learned in elementary school. Building blocks such as adders and multipliers can then be interconnected to form digital circuits that are general in the sense that they can execute any sequence of basic operations between binary numbers. The humble transistor thus became the workhorse of the computer industry, making it possible to build cheaply and effectively the adders and multipliers Thomas Hobbes imagined, in 1651, as the basis of all human reasoning and memory.

Transistors and logic gates, when arranged in circuits that store binary values over long periods of time, can also be used to build computer memories. Such memories, which can store billions or trillions of bits, are part of every computer in use today. Transistors can, therefore, be used to build general-purpose circuits that compute all possible logic operations quickly, cheaply, and effectively. A sufficiently complex digital circuit can be instructed to add the contents of one memory position to the contents of another memory position, and to store the result in a third memory position. Digital circuits flexible enough to perform these and other similar operations are called Central Processing Units (CPUs). A CPU is the brain of every computer and almost every advanced electronic device we use today. CPUs execute programs, which are simply long sequences of very simple operations. (In the next chapter, I will explain how CPUs became the brains of modern computers as the result of pioneering work by Alan Turing, John von Neumann, and many, many others.)

The first digital computers were built by interconnecting logic gates made from vacuum tubes. They were bulky, slow, and unreliable. The ENIAC—the first fully electronic digital computer, announced in 1946—contained more than 17,000 vacuum tubes, weighted more than 27 tons, and occupied more than 600 square feet.

When computers based on discrete transistors became the norm, large savings in area occupied and in power consumed were achieved. But the real breakthrough came when designers working for the Intel Corporation recognized that they could use a single chip to implement a CPU. Such chips came to be called microprocessors. The first single-chip CPU—the 4004 processor, released in 1971—manipulated four-bit binary numbers, had 2,300 transistors, and weighted less than a gram. A present-day high-end microprocessor has more than 3 billion transistors packed in an area about the size of a postage stamp (Riedlinger et al. 2012).

Nowadays, transistors are mass produced at the rate of roughly 150 trillion (1.5 × 10¹⁴) per second. More than 3 × 10²¹ of them have been produced to date. This number compares well with some estimates of the total number of grains of sand on Earth. In only a few years, we will have produced more transistors than there are synapses in the brains of all human beings currently alive.

The number of transistors in microprocessors has grown rapidly since 1971, following an approximately exponential curve which is known as Moore’s Law. (In 1965, Intel’s co-founder, Gordon Moore, first noticed that the number of transistors that could be placed inexpensively on an integrated circuit increased exponentially over time, doubling approximately every two years.) Figure 3.7 depicts the increase in the number of transistors in Intel’s microprocessors since the advent of the 4004. Note that, for convenience, the number of transistors is shown in a logarithmic scale. Although the graph is relative to only a small number of microprocessors from one supplier, it illustrates a typical case of Moore’s Law. In this case, the number of transistors in microprocessors has increased by a factor of a little more than 2²⁰ in 41 years. This corresponds roughly to a factor of 2 every two years.

Figure 3.7 Evolution of the number of transistors of Intel microprocessors.

Many measures of the evolution of digital electronic devices have obeyed a law similar to Moore’s. Processing speed, memory capacity, and sensor sensitivity have all been improving at an exponential rate that approaches the rate predicted by Moore’s Law. This exponential increase is at the origin of the impact digital electronics had in nearly every aspect of our lives. In fact, Moore’s Law and the related exponential evolution of digital technologies are at the origin of many of the events that have changed society profoundly in recent decades.

Other digital technologies have also been improving at an exponential rate, though in ways that are somewhat independent of Moore’s Law. Kryder’s Law states that the number of bits that can be stored in a given area in a magnetic disk approximately doubles every 13 months. Larry Roberts has kept detailed data on the improvements of communication equipment and has observed that the cost per fixed communication capacity has decreased exponentially over a period of more than ten years.

For the case of Moore’s Law, the progress is even more dramatic than that shown in figure 3.7, since the speed of processors has also been increasing. In a very simplified view of processor performance, the computational power increases with both the number of transistors and the speed of the processor. Therefore, processors have increased in computational power by a factor of about 30 billion over the period 1971–2012, which corresponds to a doubling of computational power every 14 months.

The technological advances in digital technologies that led to this exponential growth are unparalleled in other fields of science, with a single exception (which I will address in chapter 7): DNA sequencing. The transportation, energy, and building industries have also seen significant advances in recent decades. None of those industries, however, was subject to the type of exponential growth that characterized semiconductor technology. To put things in perspective, consider the fuel efficiency of automobiles. In approximately the same period as was discussed above, the fuel efficiency of passenger cars went from approximately 20 miles per gallon to approximately 35. If cars had experienced the same improvement in efficiency over the last 40 years as computers, the average passenger car would be able to go around the Earth more than a million times on one gallon of fuel.

The exponential pace of progress in integrated circuits has fueled the development of information and communication technologies. Computers, interconnected by high-speed networks made possible by digital circuit technologies, became, in time, the World Wide Web—a gigantic network that interconnects a significant fraction of all the computers in existence.

There is significant evidence that, after 25 years, Moore’s Law is running out of steam—that the number of transistors that can be packed onto a chip is not increasing as rapidly as in the past. But it is likely that other technologies will come into play, resulting in a continuous (albeit slower) increase in the power of computers.

The Digital Economy

Easy access to the enormous amounts of information available on the World Wide Web, by itself, would have been enough to change the world. Many of us can still remember the effort that was required to find information on a topic specialized enough not to have an entry in a standard encyclopedia. Today, a simple Internet search will return hundreds if not thousands of pages about even the most obscure topic. With the advent of Web 2.0, the amount of information stored in organized form exploded. At the time of this writing, the English version of the online encyclopedia Wikipedia includes more than 4.6 million articles containing more than a billion words. That is more than 30 times the number of words in the largest English-language encyclopedia ever published, the Encyclopaedia Britannica. The growth of the number of articles in Wikipedia (plotted in figure 3.9) has followed an accelerating curve, although it shows a tendency to decelerate as Wikipedia begins to cover a significant fraction of the world knowledge relevant to a large set of persons.

Figure 3.9 Evolution of the number of articles in Wikipedia.

Wikipedia is just one of the many examples of services in which a multitude of users adds value to an ever-growing community, thus leading to a quadratic growth of utility. Other well-known examples are YouTube (which makes available videos uploaded by users), Flickr and Instagram (photos), and an array of social networking sites, of which the most pervasive is Facebook. A large array of specialized sites cater to almost any taste or persuasion, from professional networking sites such ass LinkedIn to Twitter (which lets users post very small messages any time, from anywhere, using a computer or a cell phone).

Electronic commerce—that is, use of the Web to market, sell, and ship physical goods—is changing the world’s economy in ways that were unpredictable just a decade ago. It is now common to order books, music, food, and many other goods online and have them delivered to one’s residence. In 2015 Amazon became the world’s most valuable retailer, outpacing the biggest brick-and-mortar stores. Still, Amazon maintains physical facilities to store and ship the goods it sells. In a more radical change, Uber and Airbnb began to offer services (respectively transportation and lodging) using an entirely virtual infrastructure; they now pose serious threats to the companies that used to provide those services in the form of physical facilities (cabs and hotels).

Online games offer another example of the profound effects computers and the Internet can have on the way people live their daily lives. Some massively multiplayer online role-playing games (MMORPGs) have amassed very large numbers of subscribers, who interact in a virtual game world. The players, interconnected through the Internet, develop their game-playing activities over long periods of time, building long-term relationships and interacting in accordance with the rules of the virtual world. At the time of this writing, the most popular games have millions of subscribers, the size of the population of a medium-size country. Goods existing only in the virtual world of online games are commonly traded, sometimes at high prices, in the real world.

Another form of interaction that may be a harbinger of things to come involves virtual worlds in which people can virtually live, work, buy and sell properties, and pursue other activities in a way that mimics the real world as closely as technology permits. The best-known virtual-world simulator of this type may be Second Life, launched in 2003. Second Life has a parallel economy, with a virtual currency that can be exchanged in the same ways as conventional currency. Second Life citizens can develop a number of activities that parallel those in the real world. The terms of service ensure that users retain copyright for content they create, and the system provides simple facilities for managing digital rights. At present the user interface is still somewhat limited in its realism, since keyboard-based interfaces and relatively low-resolution computer-generated images are used to interact with the virtual world. Despite its relatively slow growth, Second Life now boasts about a million regular users.

This is a very brief and necessarily extremely incomplete overview of the impact of Internet technology on daily life. Much more information about these subjects is available in the World Wide Web, for instance, in Wikipedia. However, even this cursory description is enough to make it clear that there are millions of users of online services that, only a few years ago, simply didn’t exist.

One defining aspect of present-day society is its extreme dependency on information and communication technologies. About sixty years ago, IBM was shipping its first electronic computer, the 701. At that time, only an irrelevant fraction of the economy was dependent on digital technologies. Telephone networks were important for the economy, but they were based on analog technologies. Only a vanishingly small fraction of economic output was dependent on digital computers.

Today, digital technologies are such an integral part of the economy that it is very difficult, if not impossible, to compute their contribution to economic output. True, it is possible to compute the total value created by makers of computer equipment, by creators of software, and, to a lesser extent, by producers of digital goods. However, digital technologies are so integrated in each and every activity of such a large fraction of the population that it isn’t possible to compute the indirect contribution of these technologies to the overall economy. A number of studies have addressed this question but have failed to assign concrete values to the contributions of digital technologies to economic output.

It is clear, however, that digital technologies represent an ever-increasing fraction of the economy. This fraction rose steadily from zero about sixty years ago to a significant fraction of the economic output today. In the United States, the direct contribution of digital technologies to the gross domestic product (GDP) is more than a trillion dollars (more than 7 percent of GDP), and this fraction has increased at a 4 percent rate in the past two decades (Schreyer 2000)—a growth unmatched by any other industry in history. This, however, doesn’t consider all the effects of digital technologies on everyday life that, if computed, would lead to a much higher fraction of GDP.

There is no reason to believe that the growth in the importance of digital technologies in the economy will come to a stop, or even that the rate of growth will reduce to a more reasonable value. On the contrary, there is ample evidence that these technologies will account for an even greater percentage of economic output in coming decades. It may seem that, at some point, the fraction of GDP due to digital technologies will stop growing. After all, some significant needs (e.g., those for food, housing, transportation, clothing, and energy) cannot be satisfied by digital technologies, and these needs will certainly account for some fixed minimum fraction of overall economic activity. For instance, one may assume, conservatively, that some fixed percentage (say, 50 percent) of overall economic output must be dedicated to satisfying actual physical needs, since, after all, there is only so much we can do with computers, cell phones and other digital devices. That, however, is an illusion based on the idea that overall economic output will, at some point, stagnate—something that has never happened and that isn’t likely to happen any time soon. Although basic needs will have to be satisfied (at least for quite a long time), the potential of new services and products based on digital technology is, essentially, unbounded. Since the contribution of digital technologies to economic growth is larger than the contribution of other technologies and products, one may expect that, at some point in the future, purely digital goods will represent the larger part of economic output. In reality, there is no obvious upper limit on the overall contribution of the digital economy. Unlike physical goods, digital goods are not limited by the availability of physical resources, such as raw materials, land, or water. The rapid development of computer technology made it possible to deploy new products and services without requiring additional resources, other than the computing platforms that already exist. Even additional energy requirements are likely to be marginal or even non-existent as computers become more and more energy efficient.

This is nothing new in historical terms. Only a few hundred years ago, almost all of a family’s income was used to satisfy basic needs, such as those for food and housing. With the technological revolutions, a fraction of this income was channeled to less basic but still quite essential things, such as transportation and clothing. The continued change toward goods and services that we deem less essential is simply the continuation of a trend that was begun long ago with the invention of agriculture.

One may think that, at some point, the fraction of income channeled into digital goods and services will cease to increase simply because people will have no more time or more resources to dedicate to the use of these technologies. After all, how many hours a day can one dedicate to watching digital TV, to browsing the Web, or to phone messaging? Certainly no more than 24, and in most cases much less. Some fraction of the day must be, after all, dedicated to eating and sleeping. However, this ignores the fact that the digital economy may create value without direct intervention of human beings. As we will see in chapter 11, digital intelligent agents may, on their behalf or on behalf of corporations, create digital goods and services that will be consumed by the rest of the world, including other digital entities. At some point, the fraction of the overall economic output actually attributable to direct human activity will be significantly less than 100 percent. To a large extent, this is already the case today. Digital services that already support a large part of our economy are, in fact, performed by computers without any significant human assistance. However, today these services are performed on behalf of some company that is, ultimately, controlled by human owners or shareholders. Standard computation of economic contributions ultimately attributes the valued added by a company to the company’s owners.

In this sense, all the economic output generated today is attributable to human activities. It is true that in many cases ownership is difficult to trace, because companies are owned by other companies. However, in the end, some person or group of persons will be the owner of a company and, therefore, the generator of the economic output that is, in reality, created by very autonomous and, in some cases, very intelligent systems. This situation will remain unchanged until the day when some computational agent is given personhood rights, comparable to those of humans or corporations, and can be considered the ultimate producer of the goods or services. At that time, and only at that time, we will have to change the way we view the world economy as a product of human activity.

However, before we get to that point, we have to understand better why computers have the potential to be so disruptive, and so totally different from any other technology developed in the past. The history of computers predates that of the transistor and parallels the history of the discovery of electricity. Whereas the construction of computers that actually worked had to await the existence of electronic devices, the theory of computation has its own parallel and independent history.

3 From Maxwell to the Internet

Four Equations That Changed the World

The Century of Physics

Transistors, Chips, and Microprocessors

The Rise of the Internet

The Digital Economy