2017-02-13 Keeping Spark on Track: Best Practices Using Apache Spark in Production - Miklos Christine

A joint Chicago Chapter ACM / Loyola University Computer Science Department meeting

Keeping Spark on Track: Best Practices Using Apache Spark in Production

Speaker: Miklos Christine

Monday, February 13, 2017

5:45 pm (Social Hour, light refreshments)

6:30 pm Presentation

Loyola University Water Tower Campus (Chicago/Michigan Area)

111 E. Pearson Street, Chicago IL 60611

Beane Ballroom (13th Floor, Lewis Towers) Campus map

Admission: Free, General Admission, open to the public

The purpose of this talk is to share best practices learned while developing Apache Spark workflows in production across various industries. Apache Spark is a popular distributed processing framework that allows organizations to analyze multiple streams of data for machine learning and exploratory SQL workloads. We will discuss debugging tips, best practices for transforming datasets, and discuss integrating with existing libraries such as numpy and pandas. Code examples will be shared in python.

Miklos Christine is a solutions architect for Databricks where he helps customers deploy and use Apache Spark to build batch and streaming applications. Miklos was previously a systems engineer at Cloudera where he helped strategic customers deploy and use the Apache Hadoop ecosystem in production. He has contributed to several projects in the open source community and holds a BS in electrical engineering and computer sciences from the University of California-Berkeley.

There will be refreshments catered thanks to the good people @databricks.

Reservations:

Click here to Reserve for Monday, February 13

or send an e-mail to greg@neumarke.net

Proposed Future

Meeting Dates:

4/19/2017

5/24/2017

6/21/2017

9/13/2017

10/18/2017

11/8/2017

12/13/2017

Subscribe to the Chicago Chapter ACM e-mail list. (Look for an e-mail after pressing the button)