Spark SQL & Machine Learning - A Practical Demonstration - Oct 11, 2016 - GA - Remote Attendees Welcome!

Document created by cwarman Employee on Oct 10, 2016Last modified by aalvarez on Nov 9, 2016
Version 3Show Document
  • View in full screen mode

SUMMARY

Date:Tuesday, October 11, 2016
Time:6:30pm-8:30pm ET
Location:

Emory Continuing Education - Room 116

6 Executive Park Dr., Atlanta, GA

Registration Link:https://www.meetup.com/Atlanta-Apache-Spark-User-Group/events/233622127/ 
Webcast Link:http://bit.ly/ATLsparkWebcast 

 

ABOUT THE EVENT

For all levels, from beginners to experts - Craig Warman will be covering: 

  • Apache Spark - High-level overview, Spark SQL architecture, BI tool access via ODBC/JDBC
  • Machine Learning - General workflow, Collaborative Filtering basics, Spark MLlib introduction

 

This presentation explores how developers can deliver powerful machine learning applications by leveraging Spark's SQL and MLlib libraries. A brief overview covering Spark components and architecture kicks things off, and then we dive right in with a live demonstration of loading and querying data using Spark SQL. Next, we'll examine the basics of machine learning algorithms and workflows before getting under the hood of a Spark MLlib-based recommendation engine. Our final demonstration looks at how familiar tools can be used to query our recommendation data before we wrap up with a survey of real-world use cases.

 

Outline:

  • Spark Background/Overview - The Spark+Hadoop team, Spark's five main components
  • Spark SQL Architecture - How DataFrames work, The SQLContext, Data sources
  • Demo #1: Loading And Querying a Dataset with Spark SQL
  • Machine Learning with Spark MLlib - Collaborative filtering basics, Alternating Least Squares (ALS) algorithm, General machine learning workflow
  • Demo #2: Under The Hood With A Spark MLlib Recommendation Engine, Recommender model code review and live demonstration of training-test loop iterations
  • Demo #3 Putting It All Together - Tableau with Spark SQL ODBC/JDBC
  • Some Real-World Use Cases

 

LOGISTIC DETAILS

IF YOU WILL JOIN IN PERSON - Please arrive at 6:30pm. We'll have soft drinks, water, and light snacks on-hand, so plan to eat dinner before you get there!

Emory has asked us to wrap up by 8:30pm since their building closes at 9:00. Directions to the location are here: http://www.ece.emory.edu/new_location/directions.html

IF YOU WILL JOIN IN REMOTELY - Please register now for the webcast at this link: http://bit.ly/ATLsparkWebcast There's a very short registration form, plus an opportunity to test your browser to make sure you can join when the event starts.

 

RELATED MATERIALS

On apache spark

 

SPONSORS
1 person found this helpful

Attachments

    Outcomes