1,143 questions
1
vote
1
answer
76
views
Understanding Debezium Snapshot
Debezium’s PostgreSQL snapshot docs note :
always
The connector performs a snapshot every time that it starts. The snapshot includes the structure and data of the captured tables. Specify this value ...
0
votes
1
answer
69
views
Capture Oracle metadata via Debezium connector
I'm sourcing Oracle data into Kafka topics using Debezium Oracle connector.
I have a source Oracle table, as below:
CREATE TABLE MYDATA.MY_USER
(
USER_ID RAW(16) DEFAULT SYS_GUID() NOT ...
0
votes
1
answer
90
views
Setup Kafka Consumers to process Debezium snapshot before current events
I am working with Java/Spring/Kafka on a new system that will consume events from both compacted topics and standard non compacted CDC topics. The events on these topics are published by a Debezium ...
0
votes
1
answer
62
views
Google Cloud PubSub Synchronous Pull: Getting One Message at a Time Instead of Batches
Problem
I'm using Google Cloud PubSub with synchronous pull and ordering enabled to consume messages from a Debezium Server setup, but I'm only receiving one message at a time instead of getting ...
0
votes
1
answer
127
views
Custom Debezium image falling with FileNotFoundException: /opt/kafka/custom-config/log4j.properties
I am trying to get debezium up and running on EKS. However i keep running into errors.
Below is the dockerfile which i have created to build the image and push it to ECR
FROM quay.io/strimzi/kafka:...
0
votes
1
answer
126
views
Cannot create second debezium connector to the same oracle DB
First, I know it is not suggested to run multiple connectors to the same DB but we have to do that in current ciscumstances, at least for a while.
Soo in current scenario, we have two separate Azure ...
0
votes
0
answers
37
views
Debezium initial snapshot on demand
I have a debezium 3.0 in my python project. Postgres-debezium connector has the following structure:
{
"name": "dbz_name",
"config": {
"connector.class&...
0
votes
0
answers
29
views
Confluent S3 Sink Connector producing millions of small files despite large flush.size and rotate.interval
The S3 connector creates approximately 1 million tiny files per hour instead of the expected larger files. Each file contains only a few records (~3KB)
RootCause :
Two Debezium CDC connectors (owned ...
0
votes
0
answers
47
views
Restart Debezium by clicking a button on the web application In the Kubernetes environment
I have 3 pods running in a Kubernetes environment. The three pods are as follows:
(1) Web application pod
(2) Oracle database
(3) Debezium
The web application is connected to an Oracle database, ...
0
votes
1
answer
70
views
How to solve undo_retention error while migration data via Debezium from table without primary key?
We are using Debezium to process migration from Oracle database to PostgreSQL.
Oracle contains a large table with hundreds of millions of rows without a primary key.
After 2-3 days of collecting data ...
2
votes
1
answer
138
views
Debezium + Flink Oracle CDC - "db history topic or its content is fully or partially missing" for some tables
I am using Flink with Debezium to consume CDC changes from Oracle DB tables via LogMiner.
For some tables, everything works fine. For example, the following table works without issues:
CREATE TABLE ...
0
votes
0
answers
66
views
Flink SQL Job: com.starrocks.data.load.stream.exception.StreamLoadFailException: Could not get load state because
I'm encountering a Flink job failure and would appreciate any input on what might be misconfigured:
2025‑07‑28 17:30:52
org.apache.flink.runtime.JobException: Recovery is suppressed by ...
0
votes
0
answers
53
views
Debezuim ui create connector
I have the following docker-compose yaml:
version: '3'
networks:
kafka-network:
driver: bridge
services:
zookeeper:
image: confluentinc/cp-zookeeper:7.6.0
environment:
...
0
votes
2
answers
88
views
how have CDC on redis?
I'm using CDC services like (Debezium) on my Mongo or Postgres but somehow I came up with situation that I need have to CDC on Redis . for example get streams of event occur in Redis like adding new ...
0
votes
0
answers
67
views
Unable to ingest MongoDB data with its schema in s3 using Kafka debezium connectors
I am trying to ingest data from MongoDB to S3 in parquet format, along with the schema registered in the schema registry. The Setup is as follows:
MSK -> EC2(EKS)
Below is my MongoDB Source ...
2
votes
0
answers
68
views
Getting prefixed header when I after data is loaded to questdb using kafka and debezium from postgresql
I successfully load data to questdb from kafka topic, however my columns are prefixed with source_, after_. how can I remove these prefix and get only values from the after key of the message from the ...
0
votes
0
answers
102
views
how to solve debezium Kafka Connect vulnerability
Our team noticed that the current Debezium images (e.g., debezium/connect) still include Apache Kafka 3.9.0, which is affected by several recently disclosed security vulnerabilities:
CVE-2025-27817
...
0
votes
0
answers
58
views
how to add NEW tables to the mysql source kafka debezium connector
I am using AWS MSK cluster and I have created a MySQL debezium connector that runs on EC2 instance and reads from a specific table, and it is working fine, but it could be the use case where the user ...
0
votes
1
answer
143
views
KafkaConnector fails to auto-create topics while KafkaConnect creates its own successfully (Strimzi + Debezium + Avro + Oracle)
I'm working in a Kubernetes environment where we deploy Kafka and related services using a custom Helm chart. Here's the stack:
Kafka via Strimzi Operator: 0.46.0, deployed in KRaft mode, version 4.0....
0
votes
0
answers
62
views
Docker Yugabyte CDC with Debezium not able to start the connector
I am trying to setup a sample CDC for YugabyteDB using Debezium connector in my local.
The connect and DB containers are able to start but when I try to register the connector it fails
On trying to ...
0
votes
2
answers
113
views
MongodDB Debezium connector issue with partial avro schema
I am encountering issue when trying to CDC data from MongoDB source to kafka using Debezium connector with confluent schema registry. I only want the CDC data to have some of the fields which are ...
0
votes
0
answers
59
views
CDC PostgreSQL with Debezium
We have a requirement where we want to sync changes occurring in specific tables of 2 separate systems. For one of the system we are looking to use Debezium PostgreSQL source.
We wanted to check for a ...
0
votes
1
answer
210
views
Kafka Connect 4.0.0 and Debezium 2.7
I get the following error when I run kafka when i run connect-standalone.sh. I've ensured that I'm using Kafka 4.0.0, as well as having the correct jars for debezium in the plugins directory. I've ...
0
votes
0
answers
44
views
Stop automatic conversion of empty date values or null values to EPOCH in debezium
I am using mysql as the source database and Debezium mysql connector to get the values to kafka.
The issue is that there are SMT's written for date conversion but theres one specific case where the ...
0
votes
0
answers
69
views
Dead letter storage and failure recovery in Debezium standalone server with HTTP sink
I have set up Debezium's standalone server and configured it to use HTTP client as sink and SQL Server as source, the problem is that if the HTTP server is not responding, the entire Debezium server ...
0
votes
0
answers
10
views
Debezium Connector 2.7.1 version - Communication link failure [duplicate]
I get this error from my debezium connector when the database host reboots
'trace':
'io.debezium.DebeziumException: Error reading MySQL variables: Communications link failure\n\nThe last packet sent ...
0
votes
1
answer
166
views
Debezium convert types
I have a MySQL column which should store a boolean value. My Django driver models this with a TINYINT value. I want to replicate this data in a PostgreSQL but there this column is a BOOLEAN.
The ...
1
vote
1
answer
98
views
Debezium JDBC sink Connector and MSSQL varbinary
I have Debezium 3.0.8. I setup source SQL connector, which produce data from CDC SQL Server (MSSQL). Trying to setup io.debezium.connector.jdbc.JdbcSinkConnector for a table with varbinary field
...
1
vote
0
answers
170
views
Debezium ad hoc snapshot not working because of no maximum key
I'm having problem trying ad hoc snapshot in debezium postgres source connector. I already created the signalling table and add the signal table to my debezium config.
When I tried to insert new row ...
0
votes
1
answer
96
views
Debezium connector failing with `Unexpected Kafka request of type METADATA during SASL handshake` error
We have a Kafka cluster that we're trying to connect Debezium to. We are able to successfully deploy a Producer or Consumer using the following producer.config/consumer.config (these are temporary ...
0
votes
0
answers
109
views
How to Set Table-Specific Ordering Key in Debezium for Google Pub/Sub Sink?
I'm using Debezium Server to capture changes from a PostgreSQL database and publish them to Google Cloud Pub/Sub. I want to set a table-specific ordering key so that messages related to the same ...
0
votes
1
answer
73
views
Debezium MongoDB: Recover after MongoDB is down
We're using Debezium MongoDB-connector with the outbox-router, which works very well. But when the database becomes unavailable, the connector fails without trying to recover.
Task threw an uncaught ...
0
votes
0
answers
186
views
Debezium RabbitMQ outbox event static routing key
I am trying to implement outbox pattern by reading inserts in a table from a MySQL and transforming them into events in RabbitMQ.
My Outbox table contains:
id
uuid
payload
exchange
routing_key
...
0
votes
1
answer
91
views
How to define timestamp in Flink SQL based on Kafka connector with Avro format
I have a Kafka topic that uses messages with value in Avro format with debezium types.
It contains fields defined in Avro format in the following way:
{
"name": "updated",
&...
0
votes
2
answers
341
views
Postgres CDC failure and WAL filling up disk space?
Suppose a CDC data loading tool (Fivetran, Airbyte, Debezium, etc) gets into a failure state and is not successfully running its incremental syncs of a Postgres data source. Also suppose that ...
2
votes
0
answers
116
views
Debezium MySQL Source Connector takes too long to resume streaming on restart (in schema_only mode)
I am using the Debezium MySQL Source Connector (v2.2.1.Final) in a multi-tenant application. As the number of tenant databases increased, the size of the schema history topic grown significantly. ...
0
votes
0
answers
113
views
Debezium mysql connector is capturing schema changes for tables that are not in the table include list
I am using Debebezium CDC with the outbox transformer and we need it to start up as fast as possible. Since we are using the outbox, there is only ever going to be one table that we are interested in (...
2
votes
0
answers
22
views
Can I merge handlebatch in debezium-embedded?
I'm using debezium-embedded 2.7.0(without kafka) to listen mysql binlog data and put in other storage system.
but the handlebatch method of ChangeConsumer excuted too often(every 5 seconds) and just ...
1
vote
1
answer
89
views
Custom transform for Kafka Connect
I get the below error:
[2025-02-10 14:54:53,530] INFO AbstractConfig values:
(org.apache.kafka.common.config.AbstractConfig:370)
[2025-02-10 14:54:53,639] ERROR Failed to create connector for .\kafka-...
0
votes
0
answers
90
views
Error configuring an instance of JdbcSinkConnectorConfig with Postgres-sink-connector
I want to run CDC (Oracle - Kafka - Postgres) using Debezium. I set up kafka.yaml
services:
oracle:
image: gvenzl/oracle-xe:21.3.0
container_name: oracle
ports:
- "1522:1521&...
0
votes
0
answers
68
views
Debezium problem with Array bidimensional
I have a table with an _integer field. It is a two-dimensional array, and when Debezium reads all the fields, this field returns null. For example, the data:
{{23966},{8301}}
How can I solved this ...
0
votes
1
answer
48
views
Event Ordering Across Tables in Kafka
In Kafka, it's generally recommended to map one table to one topic. However, what if the order of events across multiple tables is important for a specific key, such as CompanyId?
For example, ...
0
votes
0
answers
154
views
Kafka Connect Sink Connector
I am using kafka connect with debezium jdbc sink.
My config is
"config": {
"connector.class": "io.debezium.connector.jdbc.JdbcSinkConnector",
"table.name.format&...
1
vote
1
answer
771
views
Debezium connector for Postgres doest not work for Partitioned tables
I set up Postgres->kafka CDC using Debezium connector. Debezium CDC is able publish all changes to Kafka topic public.some_topic for regular postgres tables.
But it is not publishing changes to ...
0
votes
0
answers
57
views
How to Reduce Metadata Queries in Debezium JDBC Sink Connector?
I’m using the Debezium JDBC Sink Connector to ingest data from a Kafka topic into PostgreSQL. The database is experiencing high CPU utilization, primarily due to frequent metadata queries made by the ...
0
votes
0
answers
52
views
Prioritization of Live Changes During Incremental Snapshot in Debezium (PostgreSQL)
During the incremental snapshot process, it is critical for me to ensure that live change events are prioritized and processed near real-time. However, since my Kafka Consumer operates with a fixed ...
0
votes
0
answers
739
views
DebeziumException: Error processing row in XXXX, internal schema size 39, but row size 2 , restart connector with schema recovery mode
I have an set-up with Kafka connector using debezium(2.7) and After updating into debezium as 3.0.
A insertion of new row into table working as expected but Facing below errors, while tries to update ...
0
votes
1
answer
334
views
Why Debezium Mongo Source Kafka Connector produces string `after` field instead of a Json Object?
Here is my configuration I am using -
{
"name": "mongo-debezium-connector",
"config": {
"connector.class": "io.debezium.connector.mongodb....
0
votes
0
answers
163
views
Getting Error: trying to setup connection with GCP managed kafka service with Debezium using docker via compose.yaml?
[org.apache.kafka.clients.admin.AdminClientConfig]
debezium-connect-managed-new | 2024-12-20 18:01:24,017 INFO || Retrieved token with principal sub [org.apache.kafka.common.security.oauthbearer....
0
votes
0
answers
144
views
Debezium Columns with user defined types not available
After configuring debezium to capture events from my table T1 whenever I change a row I get a warning like the following
WARN Column 'myCol1' available in capture table not found among source table ...