Search results for: May 2016

Cassandra War Stories: Part 1

// 05.17.2016 // Data

This is part one of a multi-part series exploring the successes (and scars) that we’ve had while tuning Cassandra to perform well in MediaMath’s Data Management Platform. Fast reads on time series data We use Cassandra as the backend data store for our Data Management Platform (DMP) system here at MediaMath. DMPs are used by advertisers to store their first party data as well as third party data segments they buy so that they can deploy these to bid on ad opportunities targeted audiences. This requires hardware that can handle extremely large volumes of data and then search them very quickly. We chose to […]

Video: Fast Distributed Online Classification and Clustering

// 05.05.2016 // Data Science

Thousands of software developers, full-stack engineers, consultants and systems architects flocked to Dublin, Ireland this last April for the 2016 Hadoop Summit. Hosted by Hortonworks – a major Apache Hadoop distributors – and Yahoo, the Hadoop Summit was home to 3 full days packed with Hadoop and big data innovations – straight from the elephant’s mouth. The first day featured our own Prasad Chalasani, SVP of Data Science at MediaMath, and his talk on Fast Distributed Online Classification and Clustering. He outlines how he and Ram Sriharsha at Databricks leveraged recent machine-learning research to develop a fast, practical, scalable, online, distributed, […]