Hour 17

Summarizing Data Results from a Query

What You’ll Learn in This Hour:

▶ Defining functions
▶ Using aggregate functions
▶ Summarizing data with aggregate functions
▶ Getting results from functions
▶ Understanding why you want to group data
▶ Grouping results with the GROUP BY clause
▶ Using group value functions
▶ Understanding group functions
▶ Grouping by columns
▶ Deciding between GROUP BY and ORDER BY
▶ Reducing groups with the HAVING clause

In this hour, you learn about the aggregate functions of SQL. You can perform a variety of useful functions with aggregate functions, such as getting the highest total of a sale or counting the number of orders processed on a given day. The real power of aggregate functions is discussed in the next hour when you tackle the GROUP BY clause. You have already learned how to query the database and return data in an organized way. You have also learned how to sort data from a query. During this hour, you learn how to return data from a query and break it into groups for improved readability.

Using Aggregate Functions

Functions are keywords in SQL that you use to manipulate values within columns for output purposes. A function is a command normally used with a column name or expression that processes the incoming data to produce a result. SQL contains several types of functions. This hour covers aggregate functions. An aggregate function provides summarization information for a SQL statement, such as counts, totals, and averages.

This basic set of aggregate functions is discussed in this hour:

▶ COUNT
▶ SUM
▶ MAX
▶ MIN
▶ AVG

In previous hours, you updated some of the data in the BIRDS database. For example, several records in the table were updated to place a NULL value in the WINGSPAN column. During this hour, you rerun the scripts to rebuild the tables and the data, to return the data to its original form. Remember that at any point, you can execute the script tables.sql to drop and rebuild your tables, and the script data.sql to insert the original data back into your tables. The following tables are used for examples during this hour:

▶ BIRDS
▶ MIGRATION, BIRDS_MIGRATION
▶ FOOD, BIRDS_FOOD
▶ PHOTOGRAPHERS (created during the previous hour)

Be sure to check your data and reset it, if necessary, so that your results match the results in the examples during this hour. For example, if there are null values in the WINGSPAN column of BIRDS, any aggregate functions will return different results than if numeric values existed in all rows of data for WINGSPAN . Remember that you can reset your example BIRDS database at any time by rerunning the provided scripts tables.sql and data.sql to rebuild your tables and insert fresh data back into your tables.

The `COUNT` Function

You use the COUNT function to count rows or values of a column that do not contain a NULL value. When used within a query, the COUNT function returns a numeric value. You can also use the COUNT function with the DISTINCT command to count only the distinct rows of a dataset. ALL (opposite of DISTINCT) is the default; including ALL in the syntax is not necessary. Duplicate rows are counted if DISTINCT is not specified. One other option with the COUNT function is to use it with an asterisk. COUNT(*) counts all the rows of a table, including duplicates, regardless of whether a column contains a NULL value.

Note

Use DISTINCT Only in Certain Circumstances

You cannot use the DISTINCT command with COUNT(*), only with COUNT (column_name). This is because DISTINCT is a function that looks for a unique value in a column, whereas (*) represents all columns in a table or a complete row of data.

The syntax for the COUNT function follows:

Hour 17

Summarizing Data Results from a Query

Using Aggregate Functions

The COUNT Function

The SUM Function

The AVG Function

The MAX Function

The MIN Function

Grouping Data

Using the GROUP BY Clause

Group Functions

Grouping Selected Data

Creating Groups and Using Aggregate Functions

Understanding the Difference Between GROUP BY and ORDER BY

Using CUBE and ROLLUP Expressions

Using the HAVING Clause

Summary

Q&A

Workshop

Quiz

Exercises

The `COUNT` Function

The `SUM` Function

The `AVG` Function

The `MAX` Function

The `MIN` Function

Understanding the Difference Between `GROUP BY` and `ORDER BY`

Using `CUBE` and `ROLLUP` Expressions

Using the `HAVING` Clause