Creating sample data

Next, we want to create a Hive external table on top of S3 logs and use EMR to compute the results. We can do this using the following three different methods: