• Skip to primary navigation
  • Skip to main content
  • Skip to primary sidebar
  • Skip to footer
projectsgeek

ProjectsGeek

Download Mini projects with Source Code, Java projects with Source Codes

  • Home
  • Java Projects
  • C++ Projects
  • VB Projects
  • PHP projects
  • .Net Projects
  • NodeJs Projects
  • Android Projects
    • Project Ideas
      • Final Year Project Ideas
      • JSP Projects
  • Assignment Codes
    • Fundamentals of Programming Language
    • Software Design Laboratory
    • Data Structure and Files Lab
    • Computer Graphics Lab
    • Object Oriented Programming Lab
    • Assembly Codes
  • School Projects
  • Forum

Hadoop Projects

Airline On-Time Performance Hadoop Project

January 24, 2018 by ProjectsGeek Leave a Comment

Airline On-Time Performance

 

Objective

The objective is to analyze the airline data to provide the airline on time performance statistics to the end user using R programming.

Project Overview

Airline on time performance refers the service success rate by the airlines based on the schedule. Airline delay is the most important issue in the airline industry, because it will lead to economic crisis in the airline business for the owners. This project analyses the airline data to provide the necessary statistics related to airline on time performance.One of the research study shows that that every year nearly 20% of airlines are delayed or cancelled. This delay or cancel makes big issue is airline industry for their service and their business. It affects both travellers and airlines in big way.

The project focuses on extracting airline on time performance statistics based on airline data history using R programming. Factors like weather, issues in scheduling, passenger arrival delay and etc., are causing the airline delay. The airline on time performance is measured by the following formula.

On-Time Performance = (On-Time Service/Total Number of Services)*100%

Proposed System

The proposed system concentrates on analyzing airline data history to provide the important and interesting statistics related to airline on time performance. The proposed system architecture is shown in the figure.

Airline On-Time Performance                     Figure: Proposed System Architecture

Module 1:Data Collection

The required data set US Department of Transportation airline on-time performance datais collected from the web. The attributes of the data set are origin, destination, date, early time and late time.

Module 2: Data Preparation

The collected raw data set is loaded into MySQL database with R integration. This raw data is susceptible to missing data and noisy data. So necessary preprocessing techniques like data cleaning methods applied to the data set to replace missing values and to smooth the noisy data.

Module 3: Statistics

The pre processed data set is processed in R tool to identify the important statistics. R packages dplyrand ggplot2 are used here to generate the necessary statistics.

Statistics answers the following,

  • Number of airlines from same origin
  • Number of airlines to same destination
  • Arrival delay reasons
    • Late Aircraft
    • Weather
    • Security
    • Carrier
    • National Aviation System
  • Cancellations
    • Weather
    • Carrier
    • National Aviation System

Module4: Data Visualization

The extracted statistics and information are visualized using R packages dplyr and ggplot2.

Benefits

  • This project is used to find the interesting factors for airline on time performance. So business owners will benefit from the statistics by making better decisions in future ad understand the business thoroughly.
  • Travelers will find the user friendly airline based on the airline on time performance statistics.

Software Requirements

  • Windows
  • MySQL
  • R

Hardware Requirements

  • Hard Disk – 500 GB or Above
  • RAM required – 4 GB or Above
  • Processor – Core i3 or Above

Technology Used

  • Statistics
  • Business Intelligence

Other Projects to Try:

  1. Facebook Data Analysis Using Hadoop Project
  2. Aadhar Based Analysis using Hadoop Projects
  3. Big Data Hadoop Projects Ideas
  4. Climatic Data analysis using Hadoop Project
  5. Flight History Analysis Using Hadoop Project

Filed Under: Hadoop Projects Tagged With: Hadoop Projects

Aadhar Based Analysis using Hadoop Projects

January 22, 2018 by ProjectsGeek Leave a Comment

Objective

  • To analyze the Aadhar data using Hadoop to extract meaningful knowledge for the purpose of better decision-making by the central and state government.

Project Overview

The worlds largest democracy, India is the second largest nation in terms of population, with 1.3 billion population. Among these, 99% of adult population enrolled for Aadhar, the unique identity provided by the Government of India for diverse purposes. The government maintains the Aadhar related data in digital format. https://data.uidai.gov.in/uiddatacatalog/dataCatalogHome.do website provides the access to Aadhar card related data set. The Public can access some of the sources of these data and they can analyze to extract useful information and generate reports.

The data set covers more than 99% adult population of our nation. So the amount of data generated by Aadhar is very huge. Similarly, all the data collected for this unique identity is not in structured data. It also consists of unstructured and semi-structured data. Also, the enrollment is still in the process. The processing speed of this data generation is high. Therefore, theses characteristics come under the big data concept.

The purpose of Hadoop is storing and processing large amount of the data. So this project uses the Hadoop for processing Aadhar data. The input data is processed using MapReduce and then result is loaded into Hadoop Distributed File System (HDFS). Finalreports generated using Tableau (Business Intelligence Software).

Proposed System

The proposed system concentrates on analyzing Aadhar related data using Hadoop for the purpose of better decision making by the Government of India. The proposed system architecture is shown in the figure.

Aadhar Based Analysis using Hadoop Project

Step 1: Data Preparation

Data Selection: The required data set is collected from the government web portal.

Data Loading: The collected data set loaded into Hadoop Distributed File System environment.

Data Pre processing: The collected data set might consist of missing values and noisy data. If analysis is performed on this data, it may lead to wrong results. So to avoid this, data pre processing is done on the data set.

Step 2: Data Analysis

Data Analysis: Now the collected data set is ready for data analysis. Descriptive statistics like mean, median, mode, percentile are applied.

Step 3: Results

Report Generation: After the data analysis, the analyzed results need to be visualized. Tableau can be used for this purpose. Bar charts, Line charts and Pie charts are generated along with the table format.

Statistics Questions

  • Identify the total number of cards approved by gender wise
  • Identify the total number of cards approved by state wise
  • Identify the total number of cards approved by age wise
  • Identify the total number of cards approved in rural areas
  • Identify the total number of cards approved in rural areas
  • Identify the total number of cards approved in city areas
  • Identify the number cards rejected by government (State wise)
  • Identify the number cards rejected by government (Gender wise)
  • Identify the number cards rejected by government (Age wise)

Advantages

  • Government can immediately take corrective measures for the issues found in the Aadhar card related data analysis.
  • Central and state government can take necessary precaution measurements to avoid the issues in future.

Software Requirements

  • Linux OS
  • MySQL
  • Hadoop&MapReduce
  • Tableau

Hardware Requirements

  • Hard Disk – 1 TB or Above
  • RAM required – 8 GB or Above
  • Processor – Core i3 or Above

Technology Used

  • Big Data – Hadoop
  • Statistics

Other Projects to Try:

  1. Climatic Data analysis using Hadoop Project
  2. Facebook Data Analysis Using Hadoop Project
  3. Flight History Analysis Using Hadoop Project
  4. Twitter Data Sentimental Analysis Using Hadoop Project
  5. Big Data Hadoop Projects Ideas

Filed Under: Hadoop Projects Tagged With: Hadoop Projects

  • « Go to Previous Page
  • Page 1
  • Page 2
  • Page 3

Primary Sidebar

Tags

.Net Projects Download Android Project Ideas Android Projects Angular 2 Assembly Codes C # Projects C & C++ Projects C++ Projects Class Diagrams Computer Graphics Database Project Data Mining Projects DataScience Projects Datastructure Assignments Download Visual Basic Projects Electronics project Hadoop Projects Installation Guides Internet of Things Project IOS Projects Java Java Interview Questions Java Projects JavaScript JavaScript Projects java tutorial JSON JSP Projects Mechanical Projects Mongodb Networking Projects Node JS Projects OS Problems php Projects Placement Papers Project Ideas Python Projects seminar and presentation Struts

Search this Website


Footer

Download Java Project
Download Visual Basic Projects
Download .Net Projects
Download VB Projects
Download C++ Projects
Download NodeJs Projects
Download School Projects
Download School Projects
Ask Questions - Forum
Latest Projects Ideas
Assembly Codes
Datastructure Assignments
Computer Graphics Lab
Operating system Lab
australia-and-India-flag
  • Home
  • About me
  • Contact Form
  • Submit Your Work
  • Site Map
  • Privacy Policy