CLOUD COMPUTING SEMINAR SERIES
Speaker : Alok Pareek, Striim Inc.
Title : Striim: a Next generation distributed streaming platform.
Date & Time : January 17, 2025 (Friday), 03:00 PM
Venue : # 102, CDS Seminar Hall
Abstract: Striim is a novel end-to-end distributed streaming ETL and intelligence platform that enables rapid development and deployment of real time streaming applications. A key area of innovation is to process database events in a continuous streaming manner while accommodating for schema drift, and failures. The real-time continuous query processing engine has been architected from ground-up to enable simple, distributed, stateful management of events with E1P semantics. This talk will cover core architectural choices of the real time engine (i) built-in adapters, and caching structures to extract and load data in real-time from legacy and new cloud endpoints (ii) an extensible SQL-based transformation engine to transform events; users can inject custom logic via a component called Open Processor (iv) New primitives like MODIFY, BEFORE and AFTER to manage database change records and (v) built-in data validation that continuously checks if everything is continually making it to the destination. Finally, we will cover the platforms novel LLM extensions for continuous sensitive data detection. Real world implementation challenges and future open problems will be presented.
Guest Speaker Bio: Alok Pareek is Founder and head of engineering at Striim, a Palo Alto based software startup. Alok started his career as a developer in the Oracle kernel development team where he contributed to core redo generation algorithms, point in time media recovery, and high-speed data movement algorithms for ten years. Subsequently, Alok served as CTO and software architect at GoldenGate software, the industry leader in heterogeneous database replication. GoldenGate was acquired by Oracle in 2009. Post-acquisition, Alok led the product strategy for Oracle’s data integration software product portfolio including GoldenGate. Alok also led the engineering and performance teams that collaborated with strategic customers on architecture, and real-world implementations. He holds multiple patents in data management and has presented at numerous academic and industry conferences. Alok holds a graduate degree in Computer Science from Stanford University.
Host Faculty: Prof. Yogesh Simmhan
About: The IBM-IISc Hybrid Cloud Lab (IIHCL) hosted at IISc is curating the Cloud Computing Seminar series with guest speakers from Industry and Academia speaking about the latest technologies and research on Cloud and edge computing, distributed computing systems, and AI/ML/Big Data platforms.
ALL ARE WELCOME