Mid-Ohio Hadoop User Group - August 30, 2016 - OH

Document created by aalvarez on Aug 15, 2016Last modified by aalvarez on Aug 26, 2016
Version 4Show Document
  • View in full screen mode

SPEAKER SLIDES & RELATED MATERIAL AVAILABLE NOW

Scroll down this page to find them all.

 

SUMMARY

Date:Aug 30, 2016
Location:

Cardinal Health FUSE

4305 W. Dublin Granville Rd., Dublin, OH

Time:17:45 - 20:00
Registration Link:

http://www.meetup.com/MOHUG-Mid-Ohio-Hadoop-User-Group/events/232891865/

Ticket PriceFree

    

ABOUT THE EVENT

The MOHUG is a cross-industry community actively involved in leveraging big data to solve real-world business problems. The goal of the meetup is to focus on the practical aspects of using big data to solve business problems in the enterprise.

 

Join the August edition to celebrate the Meetup's 1st birthday.

 

AGENDA

  • Doing Data Science with Apache Spark– MapR Technologies by Dong Meng

Spark is a distributed computational framework that make data science handy over huge datasets. This presentation will cover some spark core introduction. Then dive in with use cases to run ad-hoc analytical query with SparkSQL, build machine learning pipeline with MLlib, doing graph modeling on GraphX

 

  • Impala performance benchmarks and use cases - Derek Kane from Cloudera.

 

  • Security in the cluster - Erik Nor from Moser Consulting.

As data in Enterprise Hadoop clusters continues to grow, securing that data continues to be an important part of any implementation, yet it often is an afterthought in many implementations. This presentation will cover best practices of securing a cluster including authentication via Kerberos, authorization, ongoing administration, auditing via Ranger, access via Knox, and encryption via TDE, SASL, and SSL. This presentation will demonstrate why each aspect of security is needed, how it is implemented, and what each tool does to protect the data. If time allows, live examples of how the tools are configured and how they protect your data will be shown.

 

MORE INFORMATION

Visit: http://www.meetup.com/MOHUG-Mid-Ohio-Hadoop-User-Group/events/232891865/

 

 

SPEAKER SLIDES

Dong Meng- Data Science with Spark

 

OTHER RELATED MATERIALS

On apache spark

     Find more on: spark

 

On analytics

 

Attachments

    Outcomes