AWS DynamoDB CDC

CDC
NoSQL
Source

Amazon DynamoDB CDC Overview

Amazon DynamoDB is a fully managed NoSQL database service that provides fast and predictable performance with seamless scalability. DynamoDB lets you offload the administrative burdens of operating and scaling a distributed database so that you don't have to worry about hardware provisioning, setup and configuration, replication, software patching, or cluster scaling. DynamoDB also offers encryption at rest, which eliminates the operational burden and complexity involved in protecting sensitive data. With DynamoDB, you can create database tables that can store and retrieve any amount of data and serve any level of request traffic. You can scale up or scale down your tables' throughput capacity without downtime or performance degradation. You can use the AWS Management Console to monitor resource utilization and performance metrics.

The traditional approach to syncing DynamoDB with complementary data stores is batch-based, where pipelines extract data from DynamoDB and send it to downstream data stores. Change data capture (CDC) is a modern alternative to inefficient bulk imports. CDC extracts change events (INSERTs, UPDATEs, and DELETEs) in real-time from data stores, such as DynamoDB, and provides them to downstream systems which can be applied to a copy of the data.

Decodable + DynamoDB CDC

Decodable can take input from a DynamoDB database as a source using a Debezium server which converts table changes into a stream of change records. Decodable can be a super convenient means to replicate the contents of a DynamoDB database into a range of systems including messaging such as Kafka, analytical databases such as Apache Pinot, or machine learning feature stores. Decodable's SQL-on-the-stream transformation process also includes data virtualization on change streams so you're processing against the table, not the change records.