Exploring and predicting the probability of flights being delayed at airports across the United States given prior flight records. Apache Spark is used in processing the dataset.