Other demographic information

The patient residence is included in the data. Since it is categorical, there is no need to alter it.

Let's see what we have so far and print the first five rows using the head() function:

X_train.head(n=5)

Scrolling horizontally through the output, you should confirm that all of our transformations and variable drops have been done correctly.