FOSDEM 2021
/
Schedule
/
Tracks
/
Developer rooms
/
HPC, Big Data and Data Science

HPC, Big Data and Data Science devroom

Room: D.hpc
Calendar: iCal, xCal
Video with Q&A: D.hpc
Video only: D.hpc
Chat: Join the conversation!

	09												10												11												12												13												14												15												16												17												18
Saturday													Accelerating HPC applications with Out-of-Order Commit Processors						HPC Container Engine State-of-Art Base-line our understanding of what the execution of HPC containers looks like in 2021.						Faster Spark SQL: Adaptive Query Execution in Spark v3						DiscoPoP: A tool to identify parallelization opportunities in sequential programs and suggest OpenMP constructs and clauses																																										Lessons in Programming Model Comparisons Using OpenMP and CUDA for Targetting GPUs						EESSI: One Scientific Software Stack to Rule Them All						XALT: Lessons from attaching to almost every program in Linux						Hitchhiker's guide from traditional HPC Cluster to containerized ensemble run at scale How to lift GROMACS into a cloudy SLURM cluster and evolve to run GROMACS globally using containers.						Guix Workflow Language Extending a reproducible software deployment system for HPC
Sunday													Deploying Containerized Applications on Secure Large Scale HPC Production Systems.						Scalable, Automated ML Model Monitoring with KFServing and Hopsworks						Make life easier for big data users on Arm platform						Getting ready for the AMD GPUs Introduction to AMD ecosystem						GPU Computing Made Simple with the C++ Vulkan SDK & the C++ Kompute Framework (AMD, Qualcomm, NVIDIA & Friends)																																				An Environment for Interactive Parallel Programming with MPI and OpenMP						Analyzing Performance Profiles using Hatchet						buildtest: HPC Testing Framework for Acceptance Testing						Open Source HPC Research Tools at the Institute for Scientific Computing						Flux: Solving Exascale Workflow and Resource Challenges Plus - How Open-Source Drives Our Project Design

Read the Call for Papers at https://lists.fosdem.org/pipermail/fosdem/2020q4/003112.html.

High Performance Computing (HPC) and Big Data are two important approaches to scientific computing. HPC typically deals with smaller, highly structured data sets and huge amounts of computation while Big Data, not surprisingly, deals with gigantic, unstructured data sets and focuses on the I/O bottlenecks. With the Big Data trend unlocking access to an unprecedented amount of data, Data Science has emerged to tackle the problem of creating processes and approaches to extracting knowledge or insights from these data sets. Machine learning and predictive analytics algorithms have joined the family of more traditional HPC algorithms and are pushing the requirements of cluster and data scalability.

Free and Open Source communities have been the foundation of the HPC and Big Data communities for some time. In the HPC community, it should be no surprise that currently 100% of the Top500 supercomputers in the world run (some variant of) Linux. On the Big Data side, the Hadoop ecosystem has had a tremendous amount of Open Source contributions from a wide range of organizations coming together under the Apache Software Foundation.

Our goal is to bring the communities together, share expertise, learn how we can benefit from each other's work and foster further joint research and collaboration. We welcome talks about Free and Open Source solutions to the challenges presented by large scale computing, data management and data analysis.

Event		Speakers	Start	End
Saturday
	Accelerating HPC applications with Out-of-Order Commit Processors	Ali Hajiabadi	10:00	10:30
	HPC Container Engine State-of-Art Base-line our understanding of what the execution of HPC containers looks like in 2021.	Christian Kniep	10:30	11:00
	Faster Spark SQL: Adaptive Query Execution in Spark v3	Nicolas Poggi	11:00	11:30
	DiscoPoP: A tool to identify parallelization opportunities in sequential programs and suggest OpenMP constructs and clauses	Mohammad Norouzi	11:30	12:00
	Lessons in Programming Model Comparisons Using OpenMP and CUDA for Targetting GPUs	Alaina Edwards	15:00	15:30
	EESSI: One Scientific Software Stack to Rule Them All	Bob Dröge	15:30	16:00
	XALT: Lessons from attaching to almost every program in Linux	Robert McLay	16:00	16:30
	Hitchhiker's guide from traditional HPC Cluster to containerized ensemble run at scale How to lift GROMACS into a cloudy SLURM cluster and evolve to run GROMACS globally using containers.	Carsten Kutzner	16:30	17:00
	Guix Workflow Language Extending a reproducible software deployment system for HPC	Ricardo Wurmus	17:00	17:30
Sunday
	Deploying Containerized Applications on Secure Large Scale HPC Production Systems.	David Brayford	10:00	10:30
	Scalable, Automated ML Model Monitoring with KFServing and Hopsworks	Javier de la Rúa Martínez	10:30	11:00
	Make life easier for big data users on Arm platform	Zhenyu Zheng	11:00	11:30
	Getting ready for the AMD GPUs Introduction to AMD ecosystem	Georgios Markomanolis	11:30	12:00
	GPU Computing Made Simple with the C++ Vulkan SDK & the C++ Kompute Framework (AMD, Qualcomm, NVIDIA & Friends)	Alejandro Saucedo	12:00	12:30
	An Environment for Interactive Parallel Programming with MPI and OpenMP	Christian Terboven, Jonas Hahnfeld	15:00	15:30
	Analyzing Performance Profiles using Hatchet	Abhinav Bhatele	15:30	16:00
	buildtest: HPC Testing Framework for Acceptance Testing	Shahzeb Siddiqui	16:00	16:30
	Open Source HPC Research Tools at the Institute for Scientific Computing	Jan-Patrick Lehr, Tim Jammer, Michael Burger, Alexander Hück	16:30	17:00
	Flux: Solving Exascale Workflow and Resource Challenges Plus - How Open-Source Drives Our Project Design	Stephen Herbein	17:00	17:30

FOSDEM21

Online / 6 & 7 February 2021

HPC, Big Data and Data Science devroom

Saturday

Sunday

FOSDEM

This year

Practical information

Media and press