MediaMath Developer Blog Authors

A Picture of Seth Wiesman
SETH WIESMAN

Seth Wiesman is a Data Engineer on the reporting team at MediaMath, where he works on real-time distributed systems and ‘Big Data’ technologies to provide analytics and insights to the users of T1.

A Picture of Seth Wiesman
SETH WIESMAN

Seth Wiesman is a Data Engineer on the reporting team at MediaMath, where he works on real-time distributed systems and ‘Big Data’ technologies to provide analytics and insights to the users of T1.

articles by this author:

Apache Flink® at MediaMath: Rescaling Stateful Applications in Production

// 06.13.2017 // Data Science

This article was originally posted by DataArtisans, on June 12, 2017. Every once in awhile, Amazon Web Services experiences a service disruption, and millions of internet users around the globe panic as their favorite apps and websites cease to function. A short time later, the issue is resolved, and it’s back to business as usual. Most people move along with their day, eventually forgetting the micro-crisis altogether. But it’s not so simple for the software engineers whose companies are built on top of AWS and who are responsible for recovering from the disruption. Such was the case for MediaMath, a programmatic marketing company […]

Counting at Scale: HyperLogLog to the Rescue

MediaMath processes many terabytes of data each day for the various reports available in T1. One metric we show is the number of unique impressions for each campaign, there is a big difference between showing an ad to 100 different people and showing the same ad to one person 100 times. While this is conceptually a simple problem, solving it at scale is not quite as straightforward. The canonical way of solving this problem would be for any given campaign to put the id of each person who saw an ad for that campaign into a set and then check […]