Chicago Data Discovery Virtual Summit
April 22nd, 11:30 am – 3:30 pm
April 23rd, 10:00 am – 12:00 pm
Many organizations have a wealth of data, but lack the necessary infrastructure to extract value from these data sources. This virtual event will bring together researchers and practitioners to discuss on how to overcome technical, organizational, and regulatory barriers that hinder data science. The summit will consist of talks, panels, and a demo of a prototype software stack developed at the University of Chicago.
Note: this workshop will include the CDAC Data & Technology Outlook, “Principles for AI in the Real World” featuring Martin Casado, General Partner at Andreessen Horowitz and Nick Feamster, Neubauer Professor of Computer Science at the University of Chicago.
Agenda
Thursday, April 22nd
11:30 am – 12:30 pm Center for Data and Computing Fireside Chat with Nick Feamster (CS, University of Chicago) and Martin Casado (Andreessen Horowitz)
Despite the immense possibilities, many machine learning techniques present barriers to entry that make them difficult, if not prohibitive to implement, deploy, and maintain for all but the very largest cloud providers and content delivery networks. Professor Nick Feamster and Martin Casado present practical advice for industrial practitioners, researchers, and students to overcome or mitigate these challenges.
1:30 pm – 2:30 pm Intro to Data Stations: Lightning talks from Graduate Students moderated by Raul Castro Fernandez
Three barriers hinder effective sharing of data sharing: easy access to sensitive data, data discovery and integration, and data governance and compliance are all challenges with both technical and human components. Professor Raul Castro Fernandez and his students present a new data architecture that facilitates: secure access to sensitive data; data discovery and integration across organizational boundaries; and governance and compliance. This session is intended to highlight state-of-the-art research to industrial practitioners and researchers.
2:30 pm – 3:30 pm Industry Panel on Data Discovery (led by Mike Franklin)
- Sarah Catanzaro (Amplify Partners)
- Kent Graziano (Snowflake)
- Shant Hovsepian (Databricks)
What is the future of the “data lake”? This panel brings together technical, business, and venture capital perspectives on the future of enterprise data management and data discovery. The panel will speak to a diverse audience of researchers, students, and industrial practitioners to understand this emerging area of data management.
Friday, April 23rd
10:00 am – 11:00 am Data Stations Tutorial
Professor Raul Castro Fernandez and his students demonstrate the software artifacts developed from their research. This session is intended for industrial practitioners interested in deploying these technologies at their own organization.
11:00 am – 12:00 pm Partner Feedback Session
The data stations team will host an open forum to all attendees to understand how our research and software can best address their organizations needs. This session is intended for industrial practitioners and future project partners.