You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/integrations/data-ingestion/kafka/kafka-clickhouse-connect-sink.md
+79-6Lines changed: 79 additions & 6 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -310,20 +310,93 @@ For additional details check out the official [tutorial](https://docs.confluent.
310
310
311
311
ClickHouse Kafka Connect reports runtime metrics via [Java Management Extensions (JMX)](https://www.oracle.com/technical-resources/articles/javase/jmx.html). JMX is enabled in Kafka Connector by default.
The connector integrates with the Kafka Connect framework and exposes metrics for task lifecycle and error tracking.
362
+
363
+
**Task Status Metrics:**
364
+
-`task-count`: Total number of tasks in the connector
365
+
-`running-task-count`: Number of tasks currently running
366
+
-`paused-task-count`: Number of tasks currently paused
367
+
-`failed-task-count`: Number of tasks that have failed
368
+
-`destroyed-task-count`: Number of destroyed tasks
369
+
-`unassigned-task-count`: Number of unassigned tasks
370
+
371
+
Task status values include: `running`, `paused`, `failed`, `destroyed`, `unassigned`
372
+
373
+
**Error Metrics:**
374
+
-`deadletterqueue-produce-failures`: Number of failed DLQ writes
375
+
-`deadletterqueue-produce-requests`: Total DLQ write attempts
376
+
-`last-error-timestamp`: Timestamp of the last error
377
+
-`records-skip-total`: Total number of records skipped due to errors
378
+
-`records-retry-total`: Total number of records that were retried
379
+
-`errors-total`: Total number of errors encountered
380
+
381
+
**Performance Metrics:**
382
+
-`offset-commit-failures`: Number of failed offset commits
383
+
-`offset-commit-avg-time-ms`: Average time for offset commits
384
+
-`offset-commit-max-time-ms`: Maximum time for offset commits
385
+
-`put-batch-avg-time-ms`: Average time to process a batch
386
+
-`put-batch-max-time-ms`: Maximum time to process a batch
387
+
-`source-record-poll-total`: Total records polled
388
+
389
+
#### Monitoring Best Practices {#monitoring-best-practices}
390
+
391
+
1.**Monitor Consumer Lag**: Track `records-lag` per partition to identify processing bottlenecks
392
+
2.**Track Error Rates**: Watch `errors-total` and `records-skip-total` to detect data quality issues
393
+
3.**Observe Task Health**: Monitor task status metrics to ensure tasks are running properly
394
+
4.**Measure Throughput**: Use `records-send-rate` and `byte-rate` to track ingestion performance
395
+
5.**Monitor Connection Health**: Check node-level connection metrics for network issues
396
+
6.**Track Compression Efficiency**: Use `compression-rate` to optimize data transfer
397
+
398
+
For detailed JMX metric definitions and Prometheus integration, see the [jmx-export-connector.yml](https://github.com/ClickHouse/clickhouse-kafka-connect/blob/main/jmx-export-connector.yml) configuration file.
0 commit comments