moby--moby/libcontainerd
Thomas Leonard b6c7becbfe
Add support for user-defined healthchecks
This PR adds support for user-defined health-check probes for Docker
containers. It adds a `HEALTHCHECK` instruction to the Dockerfile syntax plus
some corresponding "docker run" options. It can be used with a restart policy
to automatically restart a container if the check fails.

The `HEALTHCHECK` instruction has two forms:

* `HEALTHCHECK [OPTIONS] CMD command` (check container health by running a command inside the container)
* `HEALTHCHECK NONE` (disable any healthcheck inherited from the base image)

The `HEALTHCHECK` instruction tells Docker how to test a container to check that
it is still working. This can detect cases such as a web server that is stuck in
an infinite loop and unable to handle new connections, even though the server
process is still running.

When a container has a healthcheck specified, it has a _health status_ in
addition to its normal status. This status is initially `starting`. Whenever a
health check passes, it becomes `healthy` (whatever state it was previously in).
After a certain number of consecutive failures, it becomes `unhealthy`.

The options that can appear before `CMD` are:

* `--interval=DURATION` (default: `30s`)
* `--timeout=DURATION` (default: `30s`)
* `--retries=N` (default: `1`)

The health check will first run **interval** seconds after the container is
started, and then again **interval** seconds after each previous check completes.

If a single run of the check takes longer than **timeout** seconds then the check
is considered to have failed.

It takes **retries** consecutive failures of the health check for the container
to be considered `unhealthy`.

There can only be one `HEALTHCHECK` instruction in a Dockerfile. If you list
more than one then only the last `HEALTHCHECK` will take effect.

The command after the `CMD` keyword can be either a shell command (e.g. `HEALTHCHECK
CMD /bin/check-running`) or an _exec_ array (as with other Dockerfile commands;
see e.g. `ENTRYPOINT` for details).

The command's exit status indicates the health status of the container.
The possible values are:

- 0: success - the container is healthy and ready for use
- 1: unhealthy - the container is not working correctly
- 2: starting - the container is not ready for use yet, but is working correctly

If the probe returns 2 ("starting") when the container has already moved out of the
"starting" state then it is treated as "unhealthy" instead.

For example, to check every five minutes or so that a web-server is able to
serve the site's main page within three seconds:

    HEALTHCHECK --interval=5m --timeout=3s \
      CMD curl -f http://localhost/ || exit 1

To help debug failing probes, any output text (UTF-8 encoded) that the command writes
on stdout or stderr will be stored in the health status and can be queried with
`docker inspect`. Such output should be kept short (only the first 4096 bytes
are stored currently).

When the health status of a container changes, a `health_status` event is
generated with the new status. The health status is also displayed in the
`docker ps` output.

Signed-off-by: Thomas Leonard <thomas.leonard@docker.com>
Signed-off-by: Sebastiaan van Stijn <github@gone.nl>
2016-06-02 23:58:34 +02:00
..
windowsoci Windows: Use image version, not OS version for TTY fixup 2016-05-25 12:22:52 -07:00
client.go libcontainerd: reuse our pkg/locker 2016-03-29 09:43:12 -07:00
client_linux.go Add support for user-defined healthchecks 2016-06-02 23:58:34 +02:00
client_solaris.go Get the Docker Engine to build clean on Solaris 2016-05-23 16:37:12 -07:00
client_windows.go Add support for user-defined healthchecks 2016-06-02 23:58:34 +02:00
container.go Reset restart timeout if execution longer than 10s 2016-04-22 10:37:34 -07:00
container_linux.go Merge pull request #22125 from crosbymichael/restart-timeout 2016-04-25 19:15:32 +02:00
container_solaris.go Get the Docker Engine to build clean on Solaris 2016-05-23 16:37:12 -07:00
container_windows.go Merge pull request #23142 from Microsoft/ExtraCleanup 2016-06-01 11:09:06 -04:00
pausemonitor_linux.go Replace execdrivers with containerd implementation 2016-03-18 13:38:32 -07:00
process.go Replace execdrivers with containerd implementation 2016-03-18 13:38:32 -07:00
process_linux.go Replace execdrivers with containerd implementation 2016-03-18 13:38:32 -07:00
process_solaris.go Get the Docker Engine to build clean on Solaris 2016-05-23 16:37:12 -07:00
process_windows.go Windows: Use image version, not OS version for TTY fixup 2016-05-25 12:22:52 -07:00
queue_linux.go Replace execdrivers with containerd implementation 2016-03-18 13:38:32 -07:00
remote.go Fix misspellings 2016-03-28 16:21:02 +03:00
remote_linux.go Merge pull request #23148 from mlaventure/wait-for-containerd-before-restarting-it 2016-06-01 10:35:31 -07:00
remote_solaris.go Get the Docker Engine to build clean on Solaris 2016-05-23 16:37:12 -07:00
remote_windows.go libcontainerd: reuse our pkg/locker 2016-03-29 09:43:12 -07:00
types.go Add support for user-defined healthchecks 2016-06-02 23:58:34 +02:00
types_linux.go Adding postRunProcessing infrastructure for hanlding Windows Update. 2016-04-06 14:03:05 -07:00
types_solaris.go Get the Docker Engine to build clean on Solaris 2016-05-23 16:37:12 -07:00
types_windows.go Adding servicing update to postRunProcessing for Windows containers. 2016-04-25 12:16:26 -07:00
utils_linux.go Add ulimit support to libcontainerd addprocess 2016-03-23 19:54:32 -07:00
utils_windows.go Adding servicing update to postRunProcessing for Windows containers. 2016-04-25 12:16:26 -07:00