Application Observability

Observability is about being able to use information outputs from a system to monitor how the system behaves internally.

Observability data is captured using the following signals:

Metrics - a numerical measurement in time used to observe change over a period of time or configured limits. For example, memory consumption, CPU Usage, available disk space.
Logs - text outputs produced by a running system/application to provide information about what is happing. For example, outputs to capture security events such as failed login attempts, or unexpected conditions such as errors.
Traces - contextual data used to follow a request's entire path through a distributed system. For example, trace data can be used to identify bottlenecks, or failure points, within a distributed system.

Margo's application observability scope is limited to the following areas:

The device's container platform
The device's workload orchestration agent
The compliant containerized applications deployed to the device.

The application observability data is intended to be used for the following purposes:

Monitoring container platform's health and current state. This includes things like memory, cpu and disk usage and availability for clusters, nodes, pods and containers, the current running state for pods and containers, and configured resource limits. This enables a customer to make decision such as whether or not a device can support more applications, or has too many applications deployed.
Monitoring the workload orchestration agent and containerized application's state to ensure it is running correctly, performing as expected and not consuming more resource than expected.
To assist with debugging/diagnostics for applications encountering unexpected conditions impacting their ability to run as expected.

Margo's application observability is NOT intended to be used to monitor anything outside the device such as production processes, machinery, controllers, or sensors and should NOT be used for this purpose.

Observability Framework

Instead of defining our own observability framework we have adopted the OpenTelemetry specification. OpenTelemetry is a popular open source specification that provides a common way for observability data to be used. There are several reasons why OpenTelemetry was chosen which includes:

OpenTelemetry is based on a open source specification and not only an implementation
OpenTelemetry is widely adopted
OpenTelemetry has a large, and active, open source community
OpenTelemetry provides SDKs for many popular languages if people wish to use them
The OpenTelemetry community project has a lot of existing tooling that is useful such as telemetry receivers for Kubernetes, Docker and the host system.
OpenTelemetry is vendor agnostic and there are a lot of popular open source, and for purchase, backend solutions available for consuming and displaying observability data on the market.

Decision Needed: Need to determine which version(s) of the specification are supported

Open Telemetry Collector Deployment Methods

The device owner MUST deploy, and configure, an OpenTelemetry collector on their device. The device owner MAY choose the deployment model they wish to follow but MUST use one of the following approaches.

For standalone and clustered devices there MUST be at least one OpenTelemetry collector deployed to collect the observability data required below. The Device owner MAY choose to deploy multiple OpenTelemetry collectors with each collector receiving different parts of the observability data required below as long as all required observability data is collected.

Deployment Model - Multi-Node Deployment

For multi-node capable clusters the device owner MAY chose to use the DaemonSet deployment model to ensure there is an OpenTelemetry collector running on each node.

Deployment Model - DaemonSet

For multi-node capable clusters the device owner MUST ensure the communication between applications, and collector, from one node to a collector on a different node is secure.

Action: Need to research how this is done so we can provide additional information on what is required.

The device owner MUST NOT use the sidecar deployment model at this time since this requires the pods/containers to have foreknowledge of this deployment model.

Action: Some more research needs to be done here. If there is a way to do this dynamically without requiring the application developer to include special attributes on their pods then it may be allowed.

The device owner MUST NOT pre-configure any exporters that would make any observability data available outside the device/cluster.

The device owner MUST NOT attempt to inject auto-instrumentation (by using the OpenTelemetry operator for example) into any compliant applications running on the device that are not owned by the device owner.

Container Platform Observability Requirements

In order to allow for monitoring the chosen container platform's state the device own MUST ensure the following observability data is being collected and made available for export from the OpenTelemetry collector(s) on the device/cluster

Kubernetes

For devices running Kubernetes the following is a minimum list of observability data that MUST be provided. The device owner MAY choose to provide additional observability data if they wish.

Cluster (both single or multiple node) observability data MUST be collected.
It is recommended the Device Owner use the Kubernetes Cluster Receiver with the default configuration to collect this information but using this receiver is not required.
If the Device Owner chooses not to use the Kubernetes Cluster Receiver they MUST provide the same output as the Kubernetes Cluster Receiver's default configuration.

Note: Please see the information below for the default metrics emitted by the Kubernetes Cluster Receiver.

Cluster events observability data MUST be collected.
It is recommended the Device Owner use either the Kubernetes Objects Receiver or Kubernetes Event Receiver with the default configuration to collect this information but using either of these receivers is not required.
If the Device Owner chooses not to use either the Kubernetes Object Receiver or Kubernetes Event Receiver they MUST provide the same output as these Kubernetes Event Receiver's default configuration.

Action: Need to determine which namespaces should be included. All of them, or just the ones the device owner is responsible for creating.

Action: The Kubernetes objects receiver needs to be configured to export events and other resource logs so we'll need to document something additional for this receiver.

Node, Pod and Container observability data MUST be collected.
It is recommended the Device Owner use the Kubelet Stats Receiver with the default configuration to collect this information but using this receiver is not required.
If the Device Owner chooses not to use the Kubelet Stats Receiver they MUST provide the same output as the Kubelet Stats Receiver's default configuration.

Note: Please see the information below for the default metrics emitted by the Kubelet Stats Receiver.

Metadata identifying the observability data's source MUST be added to the received observability data.
It is recommend the Device Owner use the Kubernetes Attributes Processor with the default configuration to enhance the observability data with this additional metadata but using this processor is not required.
If the Device Owner chooses not to use the Kubernetes Attributes Processor they MUST provide the same metadata as the Kubernetes Attributes Processor's default configuration

Note: Please see the information below for the default attributes added by the Kubernetes Attributes Processor.

Standalone Device Container Platforms

For devices running non-clustered container platforms such as Docker or Podman the following is a minimum list of observability data that MUST be provided. The device owner MAY choose to provide additional observability data if they wish.

Container observability data MUST be collected.
It is recommended the Device Owner use the Docker Stats Receiver or Podman Stats Receiver with the default configuration to collect this information but using either of these receivers is not required.
If the Device Owner chooses not to use either receiver they MUST provide the same output as the receiver's default configuration.

Note: Please see the information below for the default metrics emitted by the Docker Stats and Podman Stats Receivers.

General

The collector MUST receive data using the OLTP format.
It is recommended the Device Owner use the OLTP Receiver to allow applications to send observability data to the collector.
If the Device Owner chooses not to use the OLTP Receiver they MUST provide the same functionality as the OLTP receiver.

Action: We will need to determine if there is additional information the device owner needs to include as attributes for each message to ensure the source can be identified. For example, we may require a device device Id attribute.

Host observability data MUST be collected.
It is recommended the Device Owner use the Host Metrics Receiver with the default configuration to collect this information but using this receiver is not required.
If the Device Owner chooses not to use the Host Metrics Receiver they MUST provided the same output as the Host Metrics Receiver's default configuration.

Note: Please see the information below for the default metrics emitted by the Host Metrics Receivers.

Log file observability data MUST be collected.
It is recommended the Device Owner use the File Log Receiver to receive logs from their chosen container platform and applications sending logs to stdout.
If the Device Owner chooses not to use the File Log Receiver they MUST provide the same functionality to receive the container platform and application logs.

Action: The Log Receiver needs to be configured to capture the container logs so we'll need to figure out how we want to document this. We also need to determine if there are any other logs we should capture in addition to the container logs.

Workload Orchestration Agent Observability Requirements

It is recommended the workload orchestration agent be deployed as a containerized application for a number of reasons. If it is deployed this way, the application's resource utilization observability data is capture automatically as part of the container platform observability requirements.

If the device owner chooses not to deploy their workload orchestration agent as a containerized application they MUST ensure the following resource usage observability data is available from the OpenTelemetry collector for their agent.

Action: Need to do research to determine if this makes sense, or not, when the agent is not running as a containerized application. We may have to leave it up to what is covered through device observability for this case. If it is possible, and makes sense, we need to define what should be provided.

In addition to the resource utilization data the workload orchestration agent MUST also send the following minimum set of application observability data to the open telemetry collector on the device/cluster. The device owner MAY choose to provided additional observability data if they wish.

Action: We need to understand what the WOS/a is going to be doing to determine what this is.

Compliant Application Observability Requirements

Compliant applications MAY choose to expose application specific observability data by sending their observability data to the Open Telemetry collector on the device/cluster. While this is optional, is it highly recommended in order to support distributed diagnostics.

Application developers choosing to expose application logs for consumption with OpenTelemetry MUST either write their logs to stdout or send them using OTLP so the receiver on the OpenTelemetry collector can import them.

Action: We need some research here to understand if there are any expectations on log format, if it's best to use stdout, log files, or sending logs directly to the collector. Also, need to explore the option to allow people to use third part log collection agents (e.g., FluentBit) that collect the logs and forward them to the collector.

Application developers SHOULD NOT expect their applications to be auto-instrumented by anything outside for their control (by the OpenTelemetry operator for example).

An application developer MAY choose an alternative implementation for their observability data but it MUST be self-contained within the deployment of their application. If this is done, it is NOT recommended application developers publish their observability data outside the device/cluster by using any other means other than the Open Telemetry collector. If the application developer chooses to export data without using the OpenTelemetry collector they MUST NOT do this without the customer's approval.

Action: Need to address in some form legacy applications that are not currently using open telemetry and don't want to migrate their application to use it.

Connecting to the OpenTelemetry Collector

In order for an application to publish its observability data to the collector on the device/cluster the device own MUST inject the following environment variables into each container.

Environment Variable	Description
OTEL_EXPORTER_OTLP_PROTOCOL	(Optional) "grpc" if the preferred protocol is gRPC, "http/protobuf" if the preferred protocol is HTTP + protobuf. The default is "http/protobuf" if nothing is provided for this environment variable. If the preferred protocol is "grpc" but no gRPC endpoint is provided the application client connects using "http/protobuf". If the preferred protocol is "grpc" but the application client cannot connect via gRPC, the application client connects via "http/protobuf".
GRPC_OTEL_EXPORTER_OTLP_ENDPOINT	(Optional) The URL for the application to use to connect to the OpenTelemetry collector using gRPC.
HTTP_OTEL_EXPORTER_OTLP_ENDPOINT	(Required) The URL for the application to use to connect to the OpenTelemetry collector HTTP + protobuf
OTEL_EXPORTER_OTLP_CERTIFICATE	(Optional)The PATH for the client certificate (in PEM format) to use for secure connections to the OpenTelemetry Collector. The application must connect using the certificate if it is provided.
> Action: We need to do some additional research to validate the above and see if any other data is needed for things like establishing a secure connection to the collector.

Exporting Observability Data

Customers MUST be able to export observability data from a device/cluster to collectors, or backends, onsite or in the cloud if they wish to make the information available.

Decision Needed: There is a dependency on the decisions about using OpenTelemetry instead of the management API approach. If OpenTelemetry is chosen then there would be some subset of data that MUST be exported to the workload orchestration service vendor.

Future Decision: For MVS1 we have decided the configuration is updated manually. We know this is not ideal because it is error prone and can result in changes being made that should not be made. The current thinking is that the device orchestration agent will be responsible for updating the configuration when the WOS vendor or customer needs to add exports but this is out of scope for MVS1.

OpenTelemetry allows using either a push or pull approach for getting data from a collector. If you are a solution vendor offering work load orchestration or observability services in the cloud it is strongly recommend you NOT require a pull method for collecting observability data because most customers will not allow their devices to be exposed this way because of security concerns.

Consuming Observability Data

Solution vendors, such as the workload orchestration service vendor, MAY choose to consume observability data exported from customer devices to provide valuable services to the customer.

Customers MAY choose to consume observability data exported from their devices to other OpenTelemetry collectors or backends withing their environment that is not on the device.

Device owners are NOT expected to provide backends for consuming observability data on their devices.

Application Observability Default Telemetry

The following telemetry data is collected by using the default configurations for the receivers indicated above. You can find more information about each piece of telemetry from the receiver's documentation.

Action: This information was compiled based on the receiver's documentation and we still need to validate the default data emitted matches what is documented.

Metrics

The following table shows the metrics emitted by the indicated receivers when using the default configuration.

Metric Group	Metric	Target	Kubernetes Cluster Receiver	Kubelet Stats Receiver	Docker Stats Reciever	Podman Stats Reciever	Host Metrics Reciever
CPU	Limit	Container	X
CPU	Load Average (15m, 5m, 1m)	System					X
CPU	Time	Container, K8s Nod, K8s Pod, System		X			X
CPU	Request	Container	X
CPU	Usage Kernel Mode	Container			X
CPU	Usage Per CPU	Container				X
CPU	Usage System	Container				X
CPU	Usage Total	Container			X	X
CPU	Usage Use Mode	Container			X
CPU	Utilization	Container, K8s Node, k8s Pod		X	X	X
Disk	IO	Container, System				X	X
Disk	IO Read	Container				X
Disk	IO Write	Container				X
Disk	IO Time	System				X
Disk	IO Time (Weighted)	System				X
Disk	Operations	System				X
Disk	Operations Pending	System				X
Disk	Operation Time	System				X
Disk	Total Read/Writes	System				X
File System	Available	Container, K8s Node, K8s Pod		X
File System	Capacity	Container, K8s Node, K8s Pod		X
File System	Inodes	K8s Volume, System		X		X
File System	Inodes Free	Volume		X
File System	Inodes Used	Volume		X
File System	Usage	Container, K8s Node, K8s Pod, System		X			X
Memory	Available	Container, K8s Node, K8s Pod		X
Memory	File	Container			X
Memory	Limit	Container	X		X	X
Memory	Major Page Fault	Container, K8s Node, K8s Pod		X
Memory	Page Faults	Container, K8s Node, K8s Pod		X
Memory	Percent	Container			X	X
Memory	Request	Container	X
Memory	RSS	Container, K8s Node, K8s Pod		X
Memory	Total Cache	Container			X
Memory	Usage	Container, K8s Node, K8s Pod, System		X	X	X	X
Memory	Working Set	Container, K8s Node, K8s Pod		X
Network	Connections	System					X
Network	Errors	K8s Node, K8s Pod, System		X			X
Network	IO	K8s Node, K8s Pod, System		X			X
Network	IO Bytes Sent	Container			X	X
Network	IO Bytes Received	Container			X	X
Network	IO Packets	System					X
Network	IO Packets Dropped	System					X
Network	IO Packets Dropped (Incoming)	Container			X
Network	IO Packets Dropped (Outgoing)	Container			X
Paging	Faults	System					X
Paging	Operations	System					X
Paging	Usage	System					X
Process	CPU Time	System					X
Process	Disk IO	System					X
Process	Memory Usage	System				X
Process	Memory Virtual	System				X
Processes	Count	System					X
Processes	Created	System					X
Resource Quota	Hard Limit	Various	X
Resource Quota	Used	Various	X
State	Ready	Container	X
State	Restarts	Container	X
State	Active Jobs	Cron Job	X
State	Current Scheduled Nodes	Daemonset	X
State	Desired Scheduled Nodes	Daemonset	X
State	Misscheduled Modes	Daemonset	X
State	Ready Nodes	Daemonset	X
State	Available	Deployment	X
State	Desired	Deployment	X
State	Current Replicas	HPA	X
State	Desired Replicas	HPA	X
State	Max Replicas	HPA	X
State	Min Replicas	HPA	X
State	Active Pods	Job	X
State	Desired Successful Pods	Job	X
State	Failed Pods	Job	X
State	Max Parallel Jobs	Job	X
State	Successful Pods	Job	X
State	Phase	Namespace	X
State	Phase	Pod	X
State	Available	Replicaset	X
State	Desired	Replicaset	X
State	Available	Replication Controller	X
State	Desired	Replication Controller	X
State	Current Pods	Stateful Set	X
State	Desired Pods	Stateful Set	X
State	Ready Pods	Stateful Set	X
State	Updated Pods	Stateful Set	X
Storage	Available	Volume		X
Storage	Capacity	Volume		X
Storage	Limit	Container	X
Storage (Ephemeral)	Limit	Container	X
Storage	Requests	Container	X
Storage (Ephemeral)	Request	Container	X

Logs

The following shows the logs emitted by the indicated receiver. The Kubernetes Event receiver collects the event logs when using the default configuration. The Kubernetes Object Receiver and Log Receiver must be configured to collect the desired logs.

Source	Kubernetes Object Receiver	Kubernetes Event Receiver	File Log Receiver
Container Logs			-
K8s Events	-	-
K8s Resources	X

Kubernetes Attributes Processor

The following shows the attributes added to each signal when using the Kubernetes Attribute Processors' default configuration.

k8s.namespace.name
k8s.pod.name
k8s.pod.uid
k8s.pod.start_time
k8s.deployment.name
k8s.node.name