Day 2 - Rhapsody

Morning break & Poster sessions

11:10 - 11:40.

Architecting Applications With Multiple Open-Source Big Data Technologies

Photo of images/speakers/paul-brebner.jpg

Paul Brebner

Instaclustr Open Source Technology Evangelist

11:50 - 12:20.

Gravitino: A multi-regional, geo-distributed meta datalake

Photo of images/speakers/justin-mclean.jpg

Justin Mclean

ASF Director, VP ASF Incubator, Datastrato Community Manager

12:30 - 13:00.

Enhancing Flexibility and Productivity with Access Patterns and Storage-Agnostic Abstractions

Photo of images/speakers/jan-lukavsky.jpg

Jan Lukavský

Freelance Data Engineer, Apache Beam committer and PMC member

Lunch

14:00 - 14:30.

Modern Data Orchestrators

Photo of images/speakers/riccardo-amadio.jpg

Riccardo Amadio

Agile Lab, Senior Data Engineer

14:40 - 15:10.

Apache SIS library for geospatial applications

Photo of images/speakers/martin-desruisseaux.jpg

Martin Desruisseaux

Geomatys

15:20 - 15:50.

Hive-Iceberg - Breaking the Ice: A Closer Look at Hive-Iceberg

Photo of images/speakers/simhadri-govindappa.jpg

Simhadri Govindappa

Senior Software Engineer at Cloudera
Photo of images/speakers/attila-turoczy.jpg

Attila Turóczy

Senior Director of Engineering at Cloudera

Afternoon break

16:10 - 16:40.

Orchestrating Scalable Data Pipelines with Apache Toree, YuniKorn, Spark, and Airflow

Photo of images/speakers/luciano-resende.jpg

Luciano Resende

AI Platform Architect
Photo of images/speakers/hongyue-zhang.jpg

Hongyue Zhang

Software Engineer at Apple

16:50 - 17:20.

Anatomy of reading Apache Parquet files (from the Apache Impala perspective)

Photo of images/speakers/csaba-ringhofer.jpg

Csaba Ringhofer

Software engineer at Cloudera
Photo of images/speakers/daniel-becker.jpg

Daniel Becker

Software engineer at Cloudera

Birds of a Feather