Company
Date Published
Author
MongoDB
Word count
1973
Language
English
Hacker News points
None

Summary

The authors of this blog post used MongoDB's aggregation framework to analyze a domestic flights dataset, which contains information on commercial flights from 1987 onwards. They answered various questions about flight delays, aircraft ages, and the effects of Hurricane Sandy on air transportation in New York. The authors started by importing the data into MongoDB, removing unnecessary fields, and creating indexes on relevant fields to improve query performance. They used simple aggregations, queries, and visualizations to answer their questions, and also explored more complex topics such as cascading delays and the effects of Hurricane Sandy. The results showed that morning flights have the least average arrival delays, summer and Christmas seasons have the most delays, and certain states experience the worst delays. The authors also discussed challenges and lessons learned from the project, including the importance of indexing and handling issues with date fields.