Webinar | Towards Data Lakehouse Architecture

03 Clash of The Data Catalogs - Market Leaders vs. Challengers

October 07, 2025
-

Welcome to the Webinar Series: Towards Data Lakehouse Architecture

The topic for the third webinar of the TDLA series is: Data Lakehouse Storage Layer - openness, interoperability, and performance.

Open-source technologies have shaped the modern data ecosystem, with files like Apache Parquet, table formats like Apache Iceberg and Delta Lake, and query engines such as Apache Spark and Apache Flink, now widely recognized as industry standards. But what about the core of the Open Data Lakehouse - the data catalog? Are there production-ready open-source projects that we can confidently rely on when building a platform?

In this webinar, we will break down the concept of a data catalog, highlight emerging open-source projects that are gaining momentum, and explore whether they are ready to serve as the backbone of a modern data platform.

During the webinar, you will find out:

  • What is a data catalog in the context of a Data Lakehouse?
  • What are the key features and main components of it?
  • What are the most popular open-source examples of data catalogs?
  • What is the role of data catalogs in data management?
  • How does the data catalog secure access to data?

When: October 07, 15:00 CEST; 14:00 BST; 09:00 EDT

Duration: 1h, Online on Zoom

 

Marek Wiewiórka

Meet the Speakers:
Marek Wiewiórka

Chief Data Architect, Xebia
Assistant at Warsaw University of Technology

Marek is a seasoned Big Data and Cloud Architect with 15+ years of experience in designing and implementing modern data and MLOps platforms. Currently, he is the Chief Data Architect at Xebia, and a Research Assistant at Warsaw University of Technology, putting the finishing touches to his PhD dissertation. Privately - a keen long-distance runner, gravel bikes enthusiast, and absolutely in love with the Italian Lakes!

 

Radoslaw Szmit

Radosław Szmit

Data Platform Architect, Xebia

Experienced Data Platform Architect with over 11 years in designing and implementing scalable data solutions. Proficient in data transformation and advanced analytics, database management, stream processing and cloud solutions. Key achievements include leading successful data platform implementations and driving data migration projects. Big Data trainer, blogger and conference speaker.

Toward Data Lakehouse Architecture 

Data Lakehouse seems to be a new buzzword; everyone wants it, but does everyone need it? In many cases, Data Lakehouse will be a perfect solution, but not always, and not just any Data Lakehouse. In this Webinar Series, we want to get closer to the Data Lakehouse concept, what problems it addresses, and how to design its architecture to make it the holy grail.

Watch previous TDLA Webinars:

Data Lakehouse Architecture

data lakehouse storage layer

 

 

 

 

 

 

 

 

 

Watch the Webinar