Skip to content

p-s-vishnu/Severity-of-airplane-accident

Repository files navigation

Severity of airplane accident

Flying has been the go-to mode of travel for years now; it is time-saving, affordable, and extremely convenient. According to the FAA, 2,781,971 passengers fly every day in the US, as in June 2019. Passengers reckon that flying is very safe, considering strict inspections are conducted and security measures are taken to avoid and/or mitigate any mishappenings. However, there remain a few chances of unfortunate incidents.

Imagine you have been hired by a leading airline. You are required to build Machine Learning models to anticipate and classify the severity of any airplane accident based on past incidents. With this, all airlines, even the entire aviation industry, can predict the severity of airplane accidents caused due to various factors and, correspondingly, have a plan of action to minimize the risk associated with them.

Problem statement

Predict how severe can an airplane accident be in order to take actions to minimize the risk associated with it.

Evaluation Criteria

score = 100 * f1_score(average="weighted")

Data

The dataset comprises 3 files:

  • **Train.csv: **[10000 x 12 excluding the headers] contains Training data
  • **Test.csv: **[2500 x 11 excluding the headers] contains Test data
  • **sample_submission.csv: **contains a sample of the format in which the Results.csv needs to be
Columns Description
Accident_ID unique id assigned to each row
Accident_Type_Code the type of accident (factor, not numeric)
Cabin_Temperature the last recorded temperature before the incident, measured in degrees fahrenheit
Turbulence_In_gforces the recorded/estimated turbulence experienced during the accident
Control_Metric an estimation of how much control the pilot had during the incident given the factors at play
Total_Safety_Complaints number of complaints from mechanics prior to the accident
Days_Since_Inspection how long the plane went without inspection before the incident
Safety_Score a measure of how safe the plane was deemed to be
Violations number of violations that the aircraft received during inspections
Severity
(Highly_Fatal_And_Damaging,
Significant_Damage_And_Serious_Injuries,
Minor_Damage_And_Injuries,
Significant_Damage_And_Fatalities)
a description (4 level factor) on the severity of the crash [Target]

Target: Severity

Points to note

  1. 50% public data and 50% private data => Have good cross validation strategy to avoid overfitting
  2. Only 10 Submissions per day

Todo

  • Your Baseline
  • Literacy survey
  • Outlier checks
  • Scaling check
  • Choosing the right k for Cross val
  • Reveiw discussions on Hackerearth
  • Try out different models
  • Refer competition.md
  • Refer kaggle guides
  • Maximum time: Feature engineering - Synergy variables
  • Important: Ensembling
  • Research papers
  • Blend