Skip to main content
Version: 5.0

Troubleshooting

Description

The Troubleshooting section of the module is intended to provide data capable of assisting with identifying potential points of failure on the cluster.

Kafka monitoring troubleshooting

Kafka monitoring troubleshooting

Kafka monitoring troubleshooting

Displayed Data

  • metric charts for failed requests
    • failed requests from producers for every broker per second
    • failed sampling requests for every broker per second
  • metric charts for general cluster load
    • overall request rates for every broker per second
    • consumer request rates per every broker per second
    • controller request rates per every broker per second (controller and broker-side)
  • metric charts for operations in queue
    • sampling requests waiting in queue
    • producer requests waiting in queue
    • shrink and expansion rates of in-sync replicas per second
  • general metric charts
    • rate and time of log flush operations in ms
    • leader election rates on the cluster

Dashboard List

  • Failed Requests from Producers per Broker, rqtps
  • Failed Sampling Requests per Broker, rqtps
  • Total Requests per Broker
  • Consumer Requests per Broker, rqtps
  • Follower Requests per Broker, rqtps (Broker Side)
  • Follower Requests per Broker, rqtps (Controller Side)
  • Requests Waiting in Sampling Queue
  • Requests Waiting in Producer Queue
  • ISR Operations per Second, opps
  • Log Flush Stats
  • Leader Election Rate