MediaMath processes many terabytes of data each day for the various reports available in T1. One metric we show is the number of unique impressions for each campaign, there is a big difference between showing an ad to 100 different people and showing the same ad to one person 100 times. While this is conceptually a simple problem, solving it at scale is not quite as straightforward. The canonical way of solving this problem would be for any given campaign to put the id of each person who saw an ad for that campaign into a set and then check […]
Seth Wiesman was the 2015 summer intern on the Data Platform team at MediaMath, where he worked distributed analytics systems. He recently graduated with a B.S. in Computer Science and a BS Information Technology from the University of Missouri. He will be returning to the University of Missouri were he will be returning to pursue his M.S. in Computer Science.