HiveBrain v1.2.0
Get Started
← Back to all entries
patternMinor

Cassandra cluster monitoring

Submitted by: @import:stackexchange-dba··
0
Viewed 0 times
cassandramonitoringcluster

Problem

We using Nagios for our servers monitoring.

Each web-environment have 1 cluster as backend - Cassanra 2.0. Each cluster have 3-5 nodes.

Question is: I want write a plugin for Nagios, for Cassandra's nodes monitoring.
Unfortunately - I'm not so good familiar with Cassandra, and don't sure - what parameters need exactly to be checked?

I'm planing use Nodetool utility to grab data from nodes - but it have a lot of commands, and each provides lot of information (e.g. - cfstats, info, status etc).

So - for monitoring I need get some data about memory each node's usage, used disk space, may be - something else?

Solution


  • You can use check Java Heap memory for each node. Like Total Java Heap Memory and Using Java Heap Memory.



  • One most important is CPU Utilization for each node.



  • Set alerts for errors. system.log have lots of information about.



  • You may set alerts for data disk and log disk.



  • Heartbeat check of servers like if you dont receive in few min it will get alerts to you.



  • Also dropped mutations and hinted hindoff clear alerts.



Basically you need to start observe system.log and will get more and more error for monitoring.

Context

StackExchange Database Administrators Q#107597, answer score: 5

Revisions (0)

No revisions yet.