Think Summer: Project 3 — 2023

Submission

Students need to submit the following file by 10:00PM EST through Gradescope inside Brightspace.

  1. A Jupyter notebook (a .ipynb file).

We’ve provided you with a template notebook for you to use. Please carefully read this section to get started.

When you are finished with the project, please make sure to run every cell in the notebook prior to submitting. To do this click Run  Run All Cells. Next, to export your notebook (your .ipynb file), click on File  Download, and download your .ipynb file.

Questions

Question 1

How many flights departed from O’Hare (ORD) and landed in Denver (DEN)?

Items to submit
  • R used to solve this problem. (2 pts)

  • Output from running R. (1 pt)

Question 2

What is the Distance (in miles) of any individual flight from O’Hare to Denver?

Items to submit
  • R used to solve this problem. (2 pts)

  • Output from running R. (1 pt)

Question 3

Consider only the flights that arrive to Honolulu (airport code HNL), i.e., for which Honolulu is the destination. What are the 10 most popular origin airports?

Items to submit
  • R used to solve this problem. (2 pts)

  • Output from running R. (1 pt)

Question 4

Each airplane has a unique TailNum. Which airplane flew the most flights in 2005? Hint: If you strictly look at the TailNum values, you will need to ignore the top two results, because they are missing data.

Items to submit
  • R used to solve this problem. (2 pts)

  • Output from running R. (1 pt)

Question 5

Which airplane flew the largest number of times from O’Hare to Denver? Hint: Again, if you strictly look at the TailNum values, you will need to ignore the top result, because it has missing data.

Items to submit
  • R used to solve this problem. (2 pts)

  • Output from running R. (1 pt)

Question 6

What were the 10 most popular days of the year to fly in 2005 (where "popularity" is evaluated by the number of flights on that day)? Hint: You might paste together the year, month, and day of the flights.

Items to submit
  • R used to solve this problem. (2 pts)

  • Output from running R. (1 pt)

Question 7

In which month are the average departure delays the worst? Hint: You might use a tapply function.

Items to submit
  • R used to solve this problem. (2 pts)

  • Output from running R. (1 pt)

Question 8

Make a dotchart that illustrates the data from the previous question (about the average flight delays in each month).

Items to submit
  • R used to solve this problem. (2 pts)

  • Output from running R. (1 pt)

Question 9

The UniqueCarrier is the airline carrier for the flight. If you add all of the miles flown (in the Distance) for each airline carrier, which carrier flew the most miles altogether in 2005? Hint: You might use a tapply function.

Items to submit
  • R used to solve this problem. (2 pts)

  • Output from running R. (1 pt)

Question 10

Create your own interesting question about the 2005 flight data. What insights can you find?

Items to submit
  • R used to solve this problem. (2 pts)

  • Output from running R. (1 pt)