Date & Time:
February 18, 2026 2:30 pm – 3:30 pm
Location:
DSI 105, 5460 S University, Chicago, IL,
02/18/2026 02:30 PM 02/18/2026 03:30 PM America/Chicago Shreya Shankar (UC Berkeley)- Building Effective Unstructured Data Systems DSI 105, 5460 S University, Chicago, IL,

Abstract: Databases and other data systems have successfully democratized data-oriented computation across domains, thanks to decades of research in system internals and end-user interfaces. However, such systems center on structured (i.e., tabular) data; unstructured data—the vast majority of data—has largely been ignored. Large language models (LLMs) now give us a building block for unstructured data analysis, and we face the same questions as in the early days of data systems—e.g., how should users author queries? How do we efficiently execute queries at scale?—but many well-established tenets from traditional data systems no longer hold. In my talk, I will present DocETL, a system I developed for unstructured data analysis. I will discuss how we had to rethink query optimization under these new assumptions, optimizing user-written pipelines for both accuracy and efficiency—as well as end-user interfaces for authoring, iterating on, and debugging pipelines. DocETL is open-source with 3.5k+ GitHub stars; our hosted interface has supported 4.1k+ pipelines across 30+ S&P-500 industries. Query optimization ideas from our work have been adopted in databases such as Snowflake and BigQuery, and our interface design principles have been adopted by companies like LangChain and OpenAI.

Speakers

headshot

Shreya Shankar

PhD student, UC Berkeley

Shreya Shankar is a fifth and final-year PhD student in the Data Systems and Foundations group at UC Berkeley, advised by Dr. Aditya Parameswaran. She is broadly interested in data systems, large language models, and human-computer interaction. Her PhD has been supported by an NDSEG Fellowship and a Bridgewater Research Fellowship, and her work has been recognized with EECS Rising Stars (2025) and a best paper honorable mention award at UIST. Beyond her research, Shreya authored the curriculum and companion book for AI Evals for Engineers and PMs, an industry course on evaluating AI applications taken by 4,000+ professionals from 500+ companies, including 50+ students each from Google, Microsoft, OpenAI, Meta, Amazon, Intuit, and First American. Before her PhD, Shreya worked as the first data/ML engineer at a startup after her undergraduate degree in CS at Stanford.

Related News & Events

headshot
UChicago CS News

University of Chicago PhD Student Riki Otaki Receives MongoDB PhD Fellowship Award

Feb 26, 2026
Robert Grossman presenting
UChicago CS News

M3 Workshop Advances Federated AI for Biomedical Research

Feb 23, 2026
headshot
UChicago CS News

Aloni Cohen Named Sloan Research Fellow for Work Bridging Law and Computer Science

Feb 17, 2026
TEI conference announcement
UChicago CS News

This Spring at UChicago: TEI’26 Unites Technology, Art, and Design on Campus

Feb 03, 2026
neutron star
UChicago CS News

RADAR: A new era of collaborative cosmic exploration

Jan 28, 2026
privacy settings example
UChicago CS News

Designed to Deceive: Why Knowledge Isn’t Enough to Beat Dark Patterns

Jan 27, 2026
headshot
UChicago CS News

Bridging Physics and CS: A Conversation with our latest IBM PhD Fellow, Soumik Ghosh

Jan 23, 2026
Tanya presenting research
UChicago CS News

Ranya Sharma Receives CRA Outstanding Undergraduate Researcher Award

Jan 22, 2026
Tensormesh CEO Junchen Jiang
Video

Building Tensormesh: A Conversation with the CEO (Junchen Jiang)

Jan 08, 2026
cityscape
UChicago CS News

UChicago Researchers Help Launch First International Conference on AI Scientists in Beijing

Jan 08, 2026
test of time headshots
UChicago CS News

Five Paths to Lasting Influence: Celebrating Five UChicago CS Test of Time Award Recipients

Dec 02, 2025
technology architecture
UChicago CS News

Researchers Built Their Own ISP to Fix the Internet– A Decade Later, It’s Still Running

Nov 20, 2025
arrow-down-largearrow-left-largearrow-right-large-greyarrow-right-large-yellowarrow-right-largearrow-right-smallbutton-arrowclosedocumentfacebookfacet-arrow-down-whitefacet-arrow-downPage 1CheckedCheckedicon-apple-t5backgroundLayer 1icon-google-t5icon-office365-t5icon-outlook-t5backgroundLayer 1icon-outlookcom-t5backgroundLayer 1icon-yahoo-t5backgroundLayer 1internal-yellowinternalintranetlinkedinlinkoutpauseplaypresentationsearch-bluesearchshareslider-arrow-nextslider-arrow-prevtwittervideoyoutube