End to end application for monitoring Real-Time Uber Data Using Apache APIs - Jacksonville Java User Group - July 12, 2017 - FL, United States

Document created by maprcommunity Employee on Jun 22, 2017Last modified by maprcommunity Employee on Jun 22, 2017
Version 2Show Document
  • View in full screen mode


Date:July 12, 2017
Location:10752 Deerwood Park Blvd S, Ste 110, Jacksonville FL 32256, Jacksonville, FL
Registration Link:End to end application for monitoring Real-Time Uber Data Using Apache APIs - Jacksonville JAVA User Group (JaxJUG) (Ja…  



Building of an end-to-end data pipeline to ingest, process and store high speed, Uber trip data. 

For this session we will explore the power of streaming real time events in the context of the IoT and connected cars.

We will look at a solution combines real-time data streams with iterative machine learning to predict and visualize popular Uber trip locations in New York City. Ingestion of the real time data (location, date,time) , analyzing it to provide location clusters, as well as providing real time dashboards will all be covered.  You will see the end-to-end process required to build this application using  Apache APIs for Kafka, Spark, HBase and other technologies.

According to Gartner, by 2020, smart cities will be using about 1.39 billion connected cars, IoT sensors and devices. The analysis of behavior patterns within cities will allow optimization of traffic, better planning decisions, and smarter advertising.  You may be excited about the possibilities of exploiting data streams to gain actionable insights from continuously produced data in real-time but you may find it difficult to conceptualize how to implement such a solution and how this can fit into your business. In this presentation, we will walk you through an architecture that combines data streaming with machine learning to enhance a Uber service with an ability to analyze, predict and visualize the most popular taxi pick-up/drop-off locations by date and time so that drivers' locations can be optimized.

• Part 1 Spark machine learning

• Part 2 Kafka and Spark Streaming

• Part 3 Real time dashboard using Vert.x

• Part 4 Spark Streaming, Dataframes and HBase


Carol McDonald

Carol has extensive experience as a developer and architect building complex, mission-critical applications in the Banking, Health Insurance and Telecom industries. As a Java Technology Evangelist at Sun Microsystems, Carol traveled all over the world speaking at Sun Tech Days, JUGs, companies, and conferences. She is a recognized speaker in Java communities.




End To End Application For Monitoring Real-Time Uber Data Using Apache Apis: Kafka, Spark, Hbase – Part 3: Real-Time Dashboard Using Vert.X

Real-Time Streaming Data Pipelines With Apache Apis: Kafka, Spark Streaming, And Hbase

Getting Started with MapR-ES Event Data Streams

Learn how to code Fast Scalable Applications with Spark Streaming, the Kafka API and the HBase API  

Spark Streaming And Twitter Sentiment Analysis 

A Better Way to Build a Fraud Detector: Streaming Data and Microservices Architecture

Streaming Stock Market Data with Apache Spark and Kafka 

Apache Spark