graphite-metrics

Standalone Graphite metric data collectors for various stuff thats not (or poorly) handled by other monitoring daemons

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

graphite-metrics: metric collectors for various stuff not (or poorly) handled by other monitoring daemons

Core of the project is a simple daemon (harvestd), which collects metric values and sends them to graphite carbon daemon (and/or other configured destinations) once per interval.

Includes separate data collection components (“collectors”) for processing of:

/proc/slabinfo for useful-to-watch values, not everything (configurable).
/proc/vmstat and /proc/meminfo in a consistent way.
/proc/stat for irq, softirq, forks.
/proc/buddyinfo and /proc/pagetypeinfo (memory fragmentation).
/proc/interrupts and /proc/softirqs.
Cron log to produce start/finish events and duration for each job into a separate metrics, adapts jobs to metric names with regexes.
Per-system-service accounting using systemd and it’s cgroups.
sysstat data from sadc logs (use something like sadc -F -L -S DISK -S XDISK -S POWER 60 to have more stuff logged there) via sadf binary and it’s json export (sadf -j, supported since sysstat-10.0.something, iirc).
iptables rule “hits” packet and byte counters, taken from ip{,6}tables-save, mapped via separate “table chain_name rule_no metric_name” file, which should be generated along with firewall rules (I use this script to do that).

Additional metric collectors can be added via setuptools/distribute graphite_metrics.collectors entry point and confgured via the common configuration mechanism.

Same for the datapoint sinks (destinations - it doesn’t have to be a single carbon host), datapoint processors (mangle/rename/filter datapoints) and the main loop, which can be replaced with the async (simple case - threads or gevent) or buffering loop.

Currently supported backends (data destinations, sinks):

graphite carbon daemon (enabled/used by default)
librato metrics

Look at the shipped collectors, processors, sinks and loops and their base classes (like graphite_metrics.sinks.Sink or loops.Basic) for API examples.

Installation

It’s a regular package for Python 2.7 (not 3.X).

Using pip is the best way:

% pip install graphite-metrics

If you don’t have it, use:

% easy_install pip
% pip install graphite-metrics

Alternatively (see also):

% curl https://raw.github.com/pypa/pip/master/contrib/get-pip.py | python
% pip install graphite-metrics

Or, if you absolutely must:

% easy_install graphite-metrics

But, you really shouldn’t do that.

Current-git version can be installed like this:

% pip install -e 'git://github.com/mk-fg/graphite-metrics.git#egg=graphite-metrics'

Requirements

Basic requirements are (pip or easy_install should handle these for you):

setuptools / distribute (for entry points)
layered-yaml-attrdict-config

Some shipped modules require additional packages to function (which can be installed automatically by specifying extras on install, example: pip install 'graphite-metrics[collectors.cgacct]'):

collectors
- cgacct
  - dbus-python
- cron_log
  - xattr (unless –xattr-emulation is used)
  - iso8601
- sysstat
  - xattr (unless –xattr-emulation is used)
  - (optional) simplejson - for better performance than stdlib json module
sinks
- librato_metrics
  - requests
  - (optional) simplejson - for better performance than stdlib json module
  - (optional) gevent - to enable constant-time (more scalable) async submissions of large data chunks via concurrent API requests

Also see requirements.txt file or “install_requires” and “extras_require” in setup.py.

Running

First run should probably look like this:

% harvestd --debug -s dump -i10

That will use default configuration with all the collectors enabled, dumping data to stderr (only “dump” data-sink enabled) and using short (5s) interval between collected datapoints, dumpng additional info about what’s being done.

After that, see default harvestd.yaml configuration file, which contains configuration for all loaded collectors and can/should be overidden using -c option.

Note that you don’t have to specify all the options in each override-config, just the ones you need to update.

For example, simple configuration file (say, /etc/harvestd.yaml) just to specify carbon host and log lines format (dropping timestamp, since it will be piped to syslog or systemd-journal anyway) might look like this:

sinks:
  carbon_socket:
    host: carbon.example.host

logging:
  formatters:
    basic:
      format: '%(levelname)s :: %(name)s: %(message)s'

And be started like this: harvestd -c /etc/harvestd.yaml

Full CLI reference:

usage: harvestd [-h] [-t host[:port]] [-i seconds] [-e collector]
                [-d collector] [-s sink] [-x sink] [-p processor]
                [-z processor] [-c path] [-a db-path] [-n] [--debug]

Collect and dispatch various metrics to destinations.

optional arguments:
  -h, --help            show this help message and exit
  -t host[:port], --destination host[:port]
                        host[:port] (default port: 2003, can be overidden via
                        config file) of sink destination endpoint (e.g. carbon
                        linereceiver tcp port, by default).
  -i seconds, --interval seconds
                        Interval between collecting and sending the
                        datapoints.
  -e collector, --collector-enable collector
                        Enable only the specified metric collectors, can be
                        specified multiple times.
  -d collector, --collector-disable collector
                        Explicitly disable specified metric collectors, can be
                        specified multiple times. Overrides --collector-
                        enable.
  -s sink, --sink-enable sink
                        Enable only the specified datapoint sinks, can be
                        specified multiple times.
  -x sink, --sink-disable sink
                        Explicitly disable specified datapoint sinks, can be
                        specified multiple times. Overrides --sink-enable.
  -p processor, --processor-enable processor
                        Enable only the specified datapoint processors, can be
                        specified multiple times.
  -z processor, --processor-disable processor
                        Explicitly disable specified datapoint processors, can
                        be specified multiple times. Overrides --processor-
                        enable.
  -c path, --config path
                        Configuration files to process. Can be specified more
                        than once. Values from the latter ones override values
                        in the former. Available CLI options override the
                        values in any config.
  -a db-path, --xattr-emulation db-path
                        Emulate filesystem extended attributes (used in some
                        collectors like sysstat or cron_log), storing per-path
                        data in a simple shelve db.
  -n, --dry-run         Do not actually send data.
  --debug               Verbose operation mode.

Rationale

Most other tools can (in theory) collect this data, and I’ve used collectd for most of these, but it:

Doesn’t provide some of the most useful stuff - nfs stats, disk utilization time percentage, etc.
Fails to collect some other stats, producing strange values like 0’es, unrealistic or negative values (for io, network, sensors, …).
General-purpose plugins like “tail” add lot of complexity, making configuration into a mess, while still lacking some basic functionality which 10 lines of code (plugin) can easily provide (support is there, but see below).
Plugins change metric names from the ones provided by /proc, referenced in kernel Documentation and on the internets, making collected data unnecessary hard to interpret and raising questions about it’s meaning (which is increasingly important for low-level and calculated metrics).

Initially I’ve tried to address these issues (implement the same collectors) with collectd plugins, but it’s python plugin system turned out to be leaking RAM and collectd itself segfaults something like once-a-day, even in the latest releases, although probably because of issues in C plugins.

Plus, collectd data requires post-processing anyway - proper metric namespaces, counter handling, etc.

Given that the alternative is to just get the data and echo it as “name val timestamp” to tcp socket, decided to avoid the extra complexity and problems that collectd provides.

Other than collectd, I’ve experimented with ganglia, munin, and some other monitoring infrastructures, but found little justification in re-using their aggregation and/or collection infrastructure, if not outright limitations (like static data schema in ganglia).

Daemon binary is (weirdly) called “harvestd” because “metricsd” name is already used to refer to another related daemon (also, there’s a “metrics” w/o “d”, probably others), and is too generic to be used w/o extra confusion, I think. That, and I seem to lack creativity to come up with a saner name (“reaperd” sounds too MassEffect’ish these days).

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

15.03.0

Mar 16, 2015

14.09.0

Sep 21, 2014

14.06.2

Jun 1, 2014

14.01.13

Jan 29, 2014

14.01.9

Jan 28, 2014

14.01.0

Jan 25, 2014

13.08.6

Aug 29, 2013

13.08.1

Aug 8, 2013

13.08.0

Aug 3, 2013

13.07.5

Aug 3, 2013

13.07.4

Jul 29, 2013

12.11.0

Nov 19, 2012

12.07.4

Jul 29, 2012

12.07.3

Jul 28, 2012

12.07.2

Jul 2, 2012

This version

12.07.0

Jul 2, 2012

12.06.15

Jun 29, 2012

12.06.14

Jun 21, 2012

12.06.12

Jun 13, 2012

12.06.9

Jun 5, 2012

12.05.8

May 24, 2012

12.05.7

May 21, 2012

12.05.1

May 2, 2012

12.05.0

May 2, 2012

12.04.46

Apr 26, 2012

12.04.44

Apr 26, 2012

12.04.43

Apr 25, 2012

12.04.42

Apr 25, 2012

12.04.37

Apr 22, 2012

12.04.35

Apr 22, 2012

12.04.33

Apr 22, 2012

12.04.31

Apr 20, 2012

12.04.26

Apr 15, 2012

12.04.22

Apr 13, 2012

12.04.21

Apr 12, 2012

12.04.20

Apr 12, 2012

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

graphite-metrics-12.07.0.tar.gz (32.3 kB view hashes)

Uploaded Jul 2, 2012 Source

Hashes for graphite-metrics-12.07.0.tar.gz

Hashes for graphite-metrics-12.07.0.tar.gz
Algorithm	Hash digest
SHA256	`632eaddbd045a66483776c55057d3f18ecd6be3ffadae2551fd22afcff2bbedc`
MD5	`a226c0b9009c04a29c303603706873fe`
BLAKE2b-256	`80f155413648f6e581a15a6072142ffc631addc7f1a3388f75a1d7b61bd56cba`