Welcome to the Webinar Series: Towards Data Lakehouse Architecture
The topic for the third webinar of the TDLA series is: Data Lakehouse Storage Layer - openness, interoperability, and performance.
Open-source technologies have shaped the modern data ecosystem, with files like Apache Parquet, table formats like Apache Iceberg and Delta Lake, and query engines such as Apache Spark and Apache Flink, now widely recognized as industry standards. But what about the core of the Open Data Lakehouse - the data catalog? Are there production-ready open-source projects that we can confidently rely on when building a platform?
In this webinar, we will break down the concept of a data catalog, highlight emerging open-source projects that are gaining momentum, and explore whether they are ready to serve as the backbone of a modern data platform.
During the webinar, you will find out:
- What is a data catalog in the context of a Data Lakehouse?
- What are the key features and main components of it?
- What are the most popular open-source examples of data catalogs?
- What is the role of data catalogs in data management?
- How does the data catalog secure access to data?
When: October 07, 15:00 CEST; 14:00 BST; 09:00 EDT
Duration: 1h, Online on Zoom
Meet the Speakers:
Marek Wiewiórka
Chief Data Architect, Xebia
Assistant at Warsaw University of Technology
Marek is a seasoned Big Data and Cloud Architect with 15+ years of experience in designing and implementing modern data and MLOps platforms. Currently, he is the Chief Data Architect at Xebia, and a Research Assistant at Warsaw University of Technology, putting the finishing touches to his PhD dissertation. Privately - a keen long-distance runner, gravel bikes enthusiast, and absolutely in love with the Italian Lakes!
Radosław Szmit
Data Platform Architect, Xebia
Experienced Data Platform Architect with over 11 years in designing and implementing scalable data solutions. Proficient in data transformation and advanced analytics, database management, stream processing and cloud solutions. Key achievements include leading successful data platform implementations and driving data migration projects. Big Data trainer, blogger and conference speaker.
Toward Data Lakehouse Architecture
Data Lakehouse seems to be a new buzzword; everyone wants it, but does everyone need it? In many cases, Data Lakehouse will be a perfect solution, but not always, and not just any Data Lakehouse. In this Webinar Series, we want to get closer to the Data Lakehouse concept, what problems it addresses, and how to design its architecture to make it the holy grail.
Watch previous TDLA Webinars: