Go Back  FlyerTalk Forums > Travel&Dining > TravelBuzz
Reload this Page >

What can I predict using DOT data?

Community
Wiki Posts
Search

What can I predict using DOT data?

Thread Tools
 
Search this Thread
 
Old Mar 27, 2012 | 12:32 am
  #1  
Original Poster
15 Years on Site
 
Join Date: Feb 2008
Location: SFO/SJC
Programs: UA 1K, NW forever
Posts: 1,391
What can I predict using DOT data?

I need to propose a project for a data mining class. Given my favorite "hobby," I'd like to work with the DOT airline data sets on (domestic) ticket sales and on-time performance, and use them to predict some feature of air travel. My first two ideas were:

1. Predict number of unsold F seats -- probably can't be done with the available data.
2. Predict on-time performance for flights/city pairs -- already done by FlightCaster.

Any other ideas? The data are the following:

- Ticket data: for 10% of all domestic itineraries, includes origin/destination/connecting cities, ticketing/operating carriers, fare, class, distance, number of pax.

- On-time data: date, carrier, tail number, flight number, origin/destination, scheduled/actual arrival/departure times, actual wheels up/down and taxi times, delay reason, diversion/cancellation info.

The ticket data has *no* information about purchase date or flight date (other than which quarter the flight was in), and no info about domestic flights connecting to/from international flights.
dfreeman02 is offline  
Old Mar 27, 2012 | 8:33 am
  #2  
FlyerTalk Evangelist
10 Countries Visited
20 Countries Visited
30 Countries Visited
15 Years on Site
 
Join Date: Feb 2010
Posts: 13,595
Originally Posted by dfreeman02
I need to propose a project for a data mining class. Given my favorite "hobby," I'd like to work with the DOT airline data sets on (domestic) ticket sales and on-time performance, and use them to predict some feature of air travel. My first two ideas were:

1. Predict number of unsold F seats -- probably can't be done with the available data.
2. Predict on-time performance for flights/city pairs -- already done by FlightCaster.

Any other ideas? The data are the following:

- Ticket data: for 10% of all domestic itineraries, includes origin/destination/connecting cities, ticketing/operating carriers, fare, class, distance, number of pax.

- On-time data: date, carrier, tail number, flight number, origin/destination, scheduled/actual arrival/departure times, actual wheels up/down and taxi times, delay reason, diversion/cancellation info.

The ticket data has *no* information about purchase date or flight date (other than which quarter the flight was in), and no info about domestic flights connecting to/from international flights.
How about weather delays in certain cities / at certain times of year? Or on time performance, but just for major holidays (ie the use would be 'how likely is your flight the day before Thanksgiving to be on time')
emma69 is offline  


Contact Us - Archive - Advertising - Cookie Policy - Privacy Statement - Terms of Service -

This site is owned, operated, and maintained by MH Sub I, LLC dba Internet Brands. Copyright © 2026 MH Sub I, LLC dba Internet Brands. All rights reserved. Designated trademarks are the property of their respective owners.