patternMinor
Cassandra cluster monitoring
Viewed 0 times
cassandramonitoringcluster
Problem
We using Nagios for our servers monitoring.
Each web-environment have 1 cluster as backend - Cassanra 2.0. Each cluster have 3-5 nodes.
Question is: I want write a plugin for Nagios, for Cassandra's nodes monitoring.
Unfortunately - I'm not so good familiar with Cassandra, and don't sure - what parameters need exactly to be checked?
I'm planing use Nodetool utility to grab data from nodes - but it have a lot of commands, and each provides lot of information (e.g. -
So - for monitoring I need get some data about memory each node's usage, used disk space, may be - something else?
Each web-environment have 1 cluster as backend - Cassanra 2.0. Each cluster have 3-5 nodes.
Question is: I want write a plugin for Nagios, for Cassandra's nodes monitoring.
Unfortunately - I'm not so good familiar with Cassandra, and don't sure - what parameters need exactly to be checked?
I'm planing use Nodetool utility to grab data from nodes - but it have a lot of commands, and each provides lot of information (e.g. -
cfstats, info, status etc).So - for monitoring I need get some data about memory each node's usage, used disk space, may be - something else?
Solution
- You can use check Java Heap memory for each node. Like Total Java Heap Memory and Using Java Heap Memory.
- One most important is CPU Utilization for each node.
- Set alerts for errors. system.log have lots of information about.
- You may set alerts for data disk and log disk.
- Heartbeat check of servers like if you dont receive in few min it will get alerts to you.
- Also dropped mutations and hinted hindoff clear alerts.
Basically you need to start observe system.log and will get more and more error for monitoring.
Context
StackExchange Database Administrators Q#107597, answer score: 5
Revisions (0)
No revisions yet.