Birds Strike on Flights

Data Analysis
Analyze CSV file using R studio, Sqlite and MySQL

Analyze the bird Strike accident in a dimensional way for future prevention.

R studio, MySQL, SQLite, CSV

About this project

Steps to deal with the data:

1. Inspecting the data file
Overview of the csv file:
Part of rows

2.Then create the appropriate database schema using lucidchart
Schema I designed for the data:

3.Read csv file in R studio, and populate data into database.
Tables Created in MySql:
Incidents table
4.Analyze data using R
Find the most dangerous flightphase:
landing and takeoff are the most dangerous
Top 10 Airlines of high incidents:
Top 10 airlines with most incidents

Month with most incidents:

Month with most incidents

Incidents trends:

What I learned in this project:
  • install/procure MySQL or MariaDB
  • connect to MySQL/MariaDB from R in an R Notebook
  • build a relational schema in at least 3NF (but ideally in BCNF) for an existing data set
  • load data from CSV files into a relational database through R
  • execute SQL queries against a MySQL/MariaDB database through R
  • perform simple analytics in R
  • identify and resolve programming errors
  • look up details for R, SQL, and MySQL/MariaDB
  • time-box work