Spark SQL & Machine Learning - A Practical Demonstration - Oct 11, 2016 - GA - Remote Attendees Welcome!

Document created by cwarman Employee on Oct 10, 2016Last modified by aalvarez on Nov 9, 2016
Version 3Show Document
  • View in full screen mode


Date:Tuesday, October 11, 2016
Time:6:30pm-8:30pm ET

Emory Continuing Education - Room 116

6 Executive Park Dr., Atlanta, GA

Registration Link: 
Webcast Link: 



For all levels, from beginners to experts - Craig Warman will be covering: 

  • Apache Spark - High-level overview, Spark SQL architecture, BI tool access via ODBC/JDBC
  • Machine Learning - General workflow, Collaborative Filtering basics, Spark MLlib introduction


This presentation explores how developers can deliver powerful machine learning applications by leveraging Spark's SQL and MLlib libraries. A brief overview covering Spark components and architecture kicks things off, and then we dive right in with a live demonstration of loading and querying data using Spark SQL. Next, we'll examine the basics of machine learning algorithms and workflows before getting under the hood of a Spark MLlib-based recommendation engine. Our final demonstration looks at how familiar tools can be used to query our recommendation data before we wrap up with a survey of real-world use cases.



  • Spark Background/Overview - The Spark+Hadoop team, Spark's five main components
  • Spark SQL Architecture - How DataFrames work, The SQLContext, Data sources
  • Demo #1: Loading And Querying a Dataset with Spark SQL
  • Machine Learning with Spark MLlib - Collaborative filtering basics, Alternating Least Squares (ALS) algorithm, General machine learning workflow
  • Demo #2: Under The Hood With A Spark MLlib Recommendation Engine, Recommender model code review and live demonstration of training-test loop iterations
  • Demo #3 Putting It All Together - Tableau with Spark SQL ODBC/JDBC
  • Some Real-World Use Cases



IF YOU WILL JOIN IN PERSON - Please arrive at 6:30pm. We'll have soft drinks, water, and light snacks on-hand, so plan to eat dinner before you get there!

Emory has asked us to wrap up by 8:30pm since their building closes at 9:00. Directions to the location are here:

IF YOU WILL JOIN IN REMOTELY - Please register now for the webcast at this link: There's a very short registration form, plus an opportunity to test your browser to make sure you can join when the event starts.



On apache spark


1 person found this helpful