FlyerTalk Forums

FlyerTalk Forums (https://www.flyertalk.com/forum/index.php)
-   TravelBuzz (https://www.flyertalk.com/forum/travelbuzz-176/)
-   -   What can I predict using DOT data? (https://www.flyertalk.com/forum/travelbuzz/1329402-what-can-i-predict-using-dot-data.html)

dfreeman02 Mar 27, 2012 12:32 am

What can I predict using DOT data?
 
I need to propose a project for a data mining class. Given my favorite "hobby," I'd like to work with the DOT airline data sets on (domestic) ticket sales and on-time performance, and use them to predict some feature of air travel. My first two ideas were:

1. Predict number of unsold F seats -- probably can't be done with the available data.
2. Predict on-time performance for flights/city pairs -- already done by FlightCaster.

Any other ideas? The data are the following:

- Ticket data: for 10% of all domestic itineraries, includes origin/destination/connecting cities, ticketing/operating carriers, fare, class, distance, number of pax.

- On-time data: date, carrier, tail number, flight number, origin/destination, scheduled/actual arrival/departure times, actual wheels up/down and taxi times, delay reason, diversion/cancellation info.

The ticket data has *no* information about purchase date or flight date (other than which quarter the flight was in), and no info about domestic flights connecting to/from international flights.

emma69 Mar 27, 2012 8:33 am


Originally Posted by dfreeman02 (Post 18280364)
I need to propose a project for a data mining class. Given my favorite "hobby," I'd like to work with the DOT airline data sets on (domestic) ticket sales and on-time performance, and use them to predict some feature of air travel. My first two ideas were:

1. Predict number of unsold F seats -- probably can't be done with the available data.
2. Predict on-time performance for flights/city pairs -- already done by FlightCaster.

Any other ideas? The data are the following:

- Ticket data: for 10% of all domestic itineraries, includes origin/destination/connecting cities, ticketing/operating carriers, fare, class, distance, number of pax.

- On-time data: date, carrier, tail number, flight number, origin/destination, scheduled/actual arrival/departure times, actual wheels up/down and taxi times, delay reason, diversion/cancellation info.

The ticket data has *no* information about purchase date or flight date (other than which quarter the flight was in), and no info about domestic flights connecting to/from international flights.

How about weather delays in certain cities / at certain times of year? Or on time performance, but just for major holidays (ie the use would be 'how likely is your flight the day before Thanksgiving to be on time')


All times are GMT -6. The time now is 10:27 pm.


This site is owned, operated, and maintained by MH Sub I, LLC dba Internet Brands. Copyright © 2026 MH Sub I, LLC dba Internet Brands. All rights reserved. Designated trademarks are the property of their respective owners.