Seattle Data/Analytics/Machine Learning- June 23, 2016 - WA, Seattle

Document created by aalvarez on Jun 9, 2016Last modified by aalvarez on Jun 23, 2016
Version 3Show Document
  • View in full screen mode


Date:June 23, 2016

Microsoft Bldg 27

3009 157th Pl Ne, Redmond, WA

Registration Link:

Ticket PriceFree



This event is not to be missed! Ted Dunning, Chief Application Architect at MapR, will be giving a talk on anomaly detection. Ted is an amazing speaker. If there is one talk you don't want to miss this year, this would be it. (And we have some seriously good speakers and topics--that's how awesome Ted is!)

Registration is open to all DAML members. Due to limited space, we are disabling guests for this event. Friends are welcome, they'll just need to register for the group first.

This event will be hosted on the Microsoft main campus, Building 27. See map at the bottom of the page. Food is generously sponsored by MapR.

Arrive early for food. The talk will start around 6:30pm. There will be plenty of time for questions after.

The Seattle Data/Analysis/Machine Learning Meetup Group  is a relatively informal group of Seattle-area data scientists, software engineers, and related people.  They meet monthly at tech companies who are willing to host them in downtown Seattle and Bellevue, and have a format which varies from a series of lightning talks on different topics, to having visiting or local speakers give hour-long tutorials or tech talks.



- Ted Dunning - How to Find What You Didn't Know to Look For, Practical Anomaly Detection

Anomaly detection is the art of automating surprise. To do this, we have to be able to define what we mean by normal and recognize what it means to be different from that. The basic ideas of anomaly detection are simple. You build a model and you look for data points that don’t match that model. The mathematical underpinnings of this can be quite daunting, but modern approaches provide ways to solve the problem in many common situations.

We will describe these modern approaches with particular emphasis on several real use-cases including:

  1. rate shifts to determine when events such as web traffic, purchases or process progress beacons shift rate.
  2. time series generated by machines or biomedical measurements.
  3. topic spotting to determine when new topics appear in a content stream such as Twitter.
  4. network flow anomalies to determine when systems with defined inputs and outputs act strangely.

In building a practical anomaly detection system you have to deal with practical details starting with algorithm selection, data flow architecture, anomaly alerting, user interfaces and visualizations.

We will show how to deal with each of these aspects of the problem with an emphasis on realistic system design.


Ted Dunning is Chief Application Architect at MapR Technologies and committer and PMC member of the Apache Mahout, Apache ZooKeeper, and Apache Drill projects . Ted has been very active in mentoring new Apache projects and is currently serving as vice president of incubation for the Apache Software Foundation .  Ted was the chief architect behind the MusicMatch (now Yahoo Music) and Veoh recommendation systems. He built fraud detection systems for ID Analytics (LifeLock) and he has 24 patents issued to date and a dozen pending. Ted has a PhD in computing science from the University of Sheffield. When he’s not doing data science, he plays guitar and mandolin. He also bought the beer at the first Hadoop user group meeting.






For exact directions and more details visit: