Back
December 19, 2024
5
min read

Checkpoint Chronicle - December 2024

By
Robin Moffatt
Share this post

Welcome to the Checkpoint Chronicle, a monthly roundup of interesting stuff in the data and streaming space. Your hosts and esteemed curators of said content are Gunnar Morling, Robin Moffatt (your editor-in-chief for this edition), and Hans-Peter Grahsl. Feel free to send our way any choice nuggets that you think we should feature in future editions.

Stream Processing, Streaming SQL, and Streaming Databases

Event Streaming

  • Excellent research and analysis from RedMonk’s Kate Holterhoff in Why Message Queues Endure: A History
  • Strimzi 0.44 added auto-rebalancing on cluster scaling—this article from Paolo Patierno has a good overview of how to use it. If you want to know more also check out this KubeCon NA 2024 conference talk from Jakub Scholz.
  • KIP-853 added dynamic controller quorum to Kafka in 3.9—this article from Federico Valeri and Luke Chen at RedHat explains how KRaft clusters can now scale controller nodes without downtime.
  • An excellent exploration of Apache Kafka Internals from Moncef Abboud. I love this kind of approach to learning more about a technology—writing about it in public is not only a great way to improve one’s own understanding, but to help others too.
  • Gohlay is a tool for Kafka for scheduling messages. I was pretty intrigued by it, and its author gave some really useful background about it in this Reddit post.
  • Apache Kafka 4.0 is scheduled for release late January/early February, and the feature freeze just passed. One of the most significant changes is the removal of ZooKeeper and marking KRaft as production ready (KIP-833). You can find the release plan here, and also see a list of included JIRAs.

Data Ecosystem

Data Platforms and Architecture

RDBMS and Change Data Capture

  • Analysis of how Postgres’s OLAP performance has improved over time, by Tomas Vondra. It uses TPC-H and shows a 4x improvement between Postgres 8 and 18.
  • I had fun exploring Flink CDC—it’s a pretty cool thing! Also check out details of WikiMedia’s plans for using it too.
  • A useful look at how Postgres and Debezium work together, from Arijit Mazumdar.
  • Gunnar wrote up details of using failover replication slots in Postgres 17, including a nice demo of their use with Decodable’s Postgres CDC connector.

Papers of the Month

A couple of papers from Amazon to look at this month. First up, their 2022 paper describing improvements made to Redshift since its launch in 2013, and then their paper from earlier this year analysing the workloads seen on Amazon Redshift and looking at how they differ from the canonical TPC simulated workloads often used for benchmarking.

Events & Call for Papers (CfP)

  • NDC London (London, UK) January 27-31
  • Current 2025 (Bangalore, India) March 19 (CfP open until December 19)
  • Current 2025 (London, UK) May 20-21
  • Current 2025 (New Orleans, LA) October 29-30

New Releases


That’s all for this month! We hope you’ve enjoyed the newsletter and would love to hear about any feedback or suggestions you have.

Gunnar (LinkedIn / Bluesky / X / Mastodon / Email)
Robin (LinkedIn / Bluesky / X / Mastodon / Email)
Hans-Peter (LinkedIn / Bluesky / X / Mastodon / Email)

📫 Email signup 👇

Did you enjoy this issue of Checkpoint Chronicle? Would you like the next edition delivered directly to your email to read from the comfort of your own home?

Simply enter your email address here and we'll send you the next issue as soon as it's published—and nothing else, we promise!

👍 Got it!
Oops! Something went wrong while submitting the form.
Robin Moffatt

Robin is a Principal DevEx Engineer at Decodable. He has been speaking at conferences since 2009 including QCon, Devoxx, Strata, Kafka Summit, and Øredev. You can find many of his talks online and his articles on the Decodable blog as well as his own blog.

Outside of work, Robin enjoys running, drinking good beer, and eating fried breakfasts—although generally not at the same time.

Welcome to the Checkpoint Chronicle, a monthly roundup of interesting stuff in the data and streaming space. Your hosts and esteemed curators of said content are Gunnar Morling, Robin Moffatt (your editor-in-chief for this edition), and Hans-Peter Grahsl. Feel free to send our way any choice nuggets that you think we should feature in future editions.

Stream Processing, Streaming SQL, and Streaming Databases

Event Streaming

  • Excellent research and analysis from RedMonk’s Kate Holterhoff in Why Message Queues Endure: A History
  • Strimzi 0.44 added auto-rebalancing on cluster scaling—this article from Paolo Patierno has a good overview of how to use it. If you want to know more also check out this KubeCon NA 2024 conference talk from Jakub Scholz.
  • KIP-853 added dynamic controller quorum to Kafka in 3.9—this article from Federico Valeri and Luke Chen at RedHat explains how KRaft clusters can now scale controller nodes without downtime.
  • An excellent exploration of Apache Kafka Internals from Moncef Abboud. I love this kind of approach to learning more about a technology—writing about it in public is not only a great way to improve one’s own understanding, but to help others too.
  • Gohlay is a tool for Kafka for scheduling messages. I was pretty intrigued by it, and its author gave some really useful background about it in this Reddit post.
  • Apache Kafka 4.0 is scheduled for release late January/early February, and the feature freeze just passed. One of the most significant changes is the removal of ZooKeeper and marking KRaft as production ready (KIP-833). You can find the release plan here, and also see a list of included JIRAs.

Data Ecosystem

Data Platforms and Architecture

RDBMS and Change Data Capture

  • Analysis of how Postgres’s OLAP performance has improved over time, by Tomas Vondra. It uses TPC-H and shows a 4x improvement between Postgres 8 and 18.
  • I had fun exploring Flink CDC—it’s a pretty cool thing! Also check out details of WikiMedia’s plans for using it too.
  • A useful look at how Postgres and Debezium work together, from Arijit Mazumdar.
  • Gunnar wrote up details of using failover replication slots in Postgres 17, including a nice demo of their use with Decodable’s Postgres CDC connector.

Papers of the Month

A couple of papers from Amazon to look at this month. First up, their 2022 paper describing improvements made to Redshift since its launch in 2013, and then their paper from earlier this year analysing the workloads seen on Amazon Redshift and looking at how they differ from the canonical TPC simulated workloads often used for benchmarking.

Events & Call for Papers (CfP)

  • NDC London (London, UK) January 27-31
  • Current 2025 (Bangalore, India) March 19 (CfP open until December 19)
  • Current 2025 (London, UK) May 20-21
  • Current 2025 (New Orleans, LA) October 29-30

New Releases


That’s all for this month! We hope you’ve enjoyed the newsletter and would love to hear about any feedback or suggestions you have.

Gunnar (LinkedIn / Bluesky / X / Mastodon / Email)
Robin (LinkedIn / Bluesky / X / Mastodon / Email)
Hans-Peter (LinkedIn / Bluesky / X / Mastodon / Email)

📫 Email signup 👇

Did you enjoy this issue of Checkpoint Chronicle? Would you like the next edition delivered directly to your email to read from the comfort of your own home?

Simply enter your email address here and we'll send you the next issue as soon as it's published—and nothing else, we promise!

Robin Moffatt

Robin is a Principal DevEx Engineer at Decodable. He has been speaking at conferences since 2009 including QCon, Devoxx, Strata, Kafka Summit, and Øredev. You can find many of his talks online and his articles on the Decodable blog as well as his own blog.

Outside of work, Robin enjoys running, drinking good beer, and eating fried breakfasts—although generally not at the same time.