Ambeone Student’s Projects Gallery

Based on the Big Data Analytics & Machine Learning Techniques taught in Ambeone’s Programs

This is a Gallery of some glimpses Data Science projects done by recent Ambeone students as part of their program.In case you are interested to know more about a particular project/projects, you may contact us for details .


Journey Transition Time Analysis & Delay Prediction

Submitted by: Manjusha


Ambeone Data Science Course Project

Transport Vehicle Transit Time & Delay Prediction Model Using Data Science


The Transit Time and Delay Prediction Model was developed using data Science Regression Modelling for a leading Transport and Logistics company based in UAE and servicing whole of GCC with their fleet of Trucks. The transport trucks reach their destination ,crossing many checkpoints ( stages)  .Time stamp is inserted to the Database  at the entry and exit of each journey check point. A standard expected  time is assigned to each of these check points and the variance is calculated from the actual and expected time taken by the vehicle to reach the check  point .

This data was used to create a Prediction Model using the Time Variances  at different check points as the independent variables and the Time variance in reaching  final destination as  the dependent variable.


  • Predict the transition delay of each journey
  • Predict the ratio of delayed trips per month

Data Preparation & Processing

  1. Handled NAs and Data Entry errors
  2. Created new variables like day-of-week of the journey
  3. Calculated the transition time for all stages from  entry, exit time
  4. Divided Dataset into Training & Testing
  5. Shapiro test was done
  6. Correlation matrix was plotted
  7. Created models using different data science methods ,evaluated and improved them

Data Science Techniques and Models used

  • Linear Regression
  • Time Series
  • Neural Network

Key Results

  • TT to start journey, TT inside Batha,TT  to destination from Batha are well correlated to journey completion time.
  • Time taken to Batha from Silla is inversely correlated to the time spent inside Silla.
  • Time Taken for Loading Arrival is inversely correlated to time taken for loading.
Linear regression:

Created a model to predict the transition delay for a journey in hours using Linear Regression and Improved the model up to 93.39 R-Square using training and testing data.

Time Series:

ARIMA,ETS,ARIMA-STL,Simple Exponential Smoothening,Holt’s method were applied to predict the ratio of delayed journeys (monthly) and found that ARIMA-STL and ARIMA yielded the best results.

Neural Network:

Further improved the model using NN and RMSE was reduced from 86 to 41.

A Robust Transit Time and Delay model was developed which helped the organisation in accurately predicting the Transit times for various journeys and help manage the predicted delays.


Predicting Diabetes using Classification Models
error: Content is protected !!