Distributed Data Management (ST 2021) - tele-TASK

tele-TASK

Distributed Data Management (ST 2021) - tele-TASK

An Education podcast

Good podcast? Give it some love!
Distributed Data Management (ST 2021) - tele-TASK

tele-TASK

Distributed Data Management (ST 2021) - tele-TASK

About
Distributed Data Management (ST 2021) - tele-TASK

tele-TASK

Distributed Data Management (ST 2021) - tele-TASK

An Education podcast
Good podcast? Give it some love!
Rate Podcast

The free lunch is over! Computer systems up until the turn of the century became constantly faster without any particular effort simply because the hardware they were running on increased its clock speed with every new release. This trend has changed and today's CPUs stall at around 3 GHz. The size of modern computer systems in terms of contained transistors (cores in CPUs/GPUs, CPUs/GPUs in compute nodes, compute nodes in clusters), however, still increases constantly. This caused a paradigm shift in writing software: instead of optimizing code for a single thread, applications now need to solve their given tasks in parallel in order to expect noticeable performance gains. Distributed computing, i.e., the distribution of work on (potentially) physically isolated compute nodes is the most extreme method of parallelization.

Big data analytics and management are a multi-million dollar markets that grow constantly! The ability to control and utilize large amounts of data is the most valuable ability of today's computer systems. Because data volumes grow so rapidly and with them the complexity of questions they should answer, data analytics, i.e., the ability of extracting any kind of information from the data becomes increasingly difficult. As data analytics systems cannot hope for their hardware getting any faster to cope with performance problems, they need to embrace new software trends that let their performance scale with the still increasing number of processing elements.

In this lecture, we take a look at various technologies involved in building distributed, data-intensive systems. We start by discussing fundamental concepts in distributed computing, such das data models, encoding formats, messaging, data replication and partitioning, fault tollerance, and batch- and stream processing. In between, we consider different practical systems from the Big Data Landscape, such as Akka and Spark. In the end, we concentrate on data management aspects, such as distributed database management system architectures and distributed query optimization.

Show More

Creators & Guests

We don't know anything about the creators of this podcast yet. You can so they can be credited for this and other podcasts.

Podcast Reviews

This podcast hasn't been reviewed yet. You can to show others what you thought.

Mentioned In These Lists

There are no lists that include "Distributed Data Management (ST 2021) - tele-TASK". You can add this podcast to a new or existing list.

Host or manage this podcast?

Do you host or manage this podcast?
Claim and edit this page to your liking.
Are we missing an episode or update?
Use this to check the RSS feed immediately.

Podcast Details

Created by
tele-TASK
Podcast Status
Idle
Started
Apr 12th, 2021
Latest Episode
Jul 21st, 2021
Release Period
3 per week
Episodes
26
Avg. Episode Length
About 1 hour
Explicit
No
Language
English

Podcast Tags

This podcast, its content, and its artwork are not owned by, affiliated with, or endorsed by Podchaser.
Rate
Contact This Podcast

Join Podchaser to...

  • Rate podcasts and episodes
  • Follow podcasts and creators
  • Create podcast and episode lists
  • & much more

Unlock more with Podchaser Pro

  • Audience Insights
  • Contact Information
  • Demographics
  • Charts
  • Sponsor History
  • and More!
Pro Features