Practical Uses and Methods for Synthetic Data

Video created by slimbaltagi on Aug 14, 2017

    A 'Free Code Friday' talk by Ted Dunning. Synthetic data is remarkably useful for many data science tasks and can even improve security. Ted Dunning  uses log-synth, an open-source program, to generate interesting randomized data. Watch this 30-min demo to see how you can use log-synth to:

    • Make up names and addresses or sample from realistically perverse numerical distributions
    • Build data sets that can join cleanly but have long-tailed frequency distributions
    • Build fairly realistic session histories