EVENT DETAILS
Wednesday / CS Seminar
January 15th / 12:00 PM
Hybrid / Mudd 3514
Speaker
Sainyam Galhotra, Cornell University
Talk Title
Context-aware Responsible Data Science
Abstract
"Data-based systems are increasingly used in applications that have far-reaching consequences and long-lasting societal impact. However, the development process remains highly specialized, tedious, and unscalable. This produces a manually fine-tuned rigid solution that works only for one specific problem in one specific context. The system fails to adapt to the changing world and severely limits the full utilization of valuable data.
So, how can you avert this fate for your systems?
In this talk, I present my vision of context-aware systems that enable even non-expert users to develop correct, explainable, and equitable data-science pipelines. To achieve this, I will focus on i) re-thinking the design of data science pipelines, and ii) the importance of causal inference for trustworthy data analysis. I will present a data discovery framework that automatically identifies useful data on behalf of end-users for various tasks. Lastly, I will discuss my proposal of leveraging counterfactual reasoning and causal inference to quantify the impact of an input on the outcome. These topics are the pieces of the puzzle that come together to create the Data Scientists' holy grail - an easily deployable, scalable, and robust system that you can trust even as everything around it evolves."
Biography
Sainyam Galhotra is an Assistant Professor in Computer Science at Cornell University and a field member for Computer Science, Statistics and Data Science. Previously, he was a Computing Innovation Fellow pursuing postdoctoral research at the University of Chicago. He received his Ph.D. from the University of Massachusetts Amherst under the supervision of Prof. Barna Saha (currently at UC San Diego). The goal of his research is to lay the foundation of responsible data science, that enable efficient development and deployment of trustworthy data analytics applications. His research has combined techniques from Data Management, Probabilistic Methods, Causal Inference, Machine Learning, and Software Engineering. His research has been published in top-tier Data Management (SIGMOD, VLDB, PODS, & ICDE), AI (NeurIPS, AAAI & AIES) and Software Engineering (FSE) conferences. He is a recipient of the Best Paper Award in FSE 2017 and Most Reproducible Paper Award in both SIGMOD 2017 and 2018, and Best Artifact Paper Honorable Mention Award in SIGMOD 2023. He was recognized as a Data Science rising star, a DAAD AInet Fellow, and as the first recipient of the Krithi Ramamritham Award at UMass for contribution to database research.
Research/Interest Areas
Data Management
---
Zoom: https://northwestern.zoom.us/j/92345079181?pwd=4K3AzzUtPHxoMnaB97EZqRXJ5s4vva.1
Panopto: https://northwestern.hosted.panopto.com/Panopto/Pages/Viewer.aspx?id=07420843-9285-40df-81c0-b26001574825
DEI Minute: Diversity tinyurl.com/cspac-dei-minute
TIME Wednesday January 15, 2025 at 12:00 PM - 1:00 PM
LOCATION 3514, Mudd Hall ( formerly Seeley G. Mudd Library) map it
ADD TO CALENDAR&group=&location=&pipurl=" class="button_outlook_export">
CONTACT Wynante R Charles wynante.charles@northwestern.edu
CALENDAR Department of Computer Science (CS)