11:10-11:40 | Architecting Applications With Multiple Open-Source Big Data Technologies by Paul Brebner |
11:50-12:20 | Gravitino: A multi-regional, geo-distributed meta datalake by Justin Mclean |
12:30-13:00 | Enhancing Flexibility and Productivity with Access Patterns and Storage-Agnostic Abstractions by Jan Lukavský |
14:00-14:30 | Modern Data Orchestrators by Riccardo Amadio |
14:40-15:10 | Apache SIS library for geospatial applications by Martin Desruisseaux |
15:20-15:50 | Hive-Iceberg - Breaking the Ice: A Closer Look at Hive-Iceberg by Simhadri Govindappa & Attila Turóczy |
16:10-16:40 | Orchestrating Scalable Data Pipelines with Apache Toree, YuniKorn, Spark, and Airflow by Luciano Resende & Hongyue Zhang |
16:50-17:20 | Anatomy of reading Apache Parquet files (from the Apache Impala perspective) by Csaba Ringhofer & Daniel Becker |