This event has ended. View the official site or create your own event → Check it out
This event has ended. Create your own
View analytic
Tuesday, August 18 • 11:20am - 11:40am
Creating an index of all US Small and Medium Business with Scala and Spark

Sign up or log in to save this to your schedule and see who's attending!

Radius Intelligence (www.radius.com) empowers Data Science to deliver an unique marketing intelligence platform used by hundreds of US companies. At Radius we have moved our entire data processing platform from Hadoop to Spark and this presentation will discuss how data scientists, data engineers and product managers come together to explore data and build new data processing and predictive models on top of our database of tens of millions of US businesses. The presentation will explain how Spark is used to deliver high speed matching across hundreds of millions of records leveraging Scala / Spark for data processing and how MLLib machine learning libraries are used to resolve and impute values for the Index.

avatar for Thomas Gerber

Thomas Gerber

Thomas Gerber is a Big Data Engineer lead @ Radius, where he crunches lots of data on lots of machines, using Spark and Scala. | | He was a Solution Architect for 6 years at search engine software editor Exalead (acquired by Dassault Systemes), which gave him the passion for distributed systems. | | Thomas also was cofounder and CTO of AODocs, which provides Smart Document Management as a service, on top of Google Drive. 

Tuesday August 18, 2015 11:20am - 11:40am
Track B

Attendees (13)