Apache Integrates with SkyWalking for Log Processing

admin 9 2025-01-11 编辑

Apache  Integrates with SkyWalking for Log Processing

This paper mainly introduces two Apache integrated SkyWalking log plug-ins to provide a more convenient operation and environment for log processing in Apache .

In the field of observability, the three main directions of data collection and analysis, Metrics, Logger and Tracing, are usually used to achieve insight into the operational status of applications.

Apache has integrated Apache SkyWaling Tracing capabilities as early as version 1.4, with features such as error logging and access log collection added in subsequent versions. Now with Apache SkyWalking's support for Metrics, it enables Apache to implement a one-stop observable solution in integrated mode, covering both logging, metrics and call tracing.

Feature Development Background​

Those of you who are familiar with Apache should know that Apache produces two types of logs during operation, namely the access log and the error log.

Access logs record detailed information about each request and are logs generated within the scope of the request, so they can be directly associated with Tracing. Error logs, on the other hand, are Apache runtime output log messages, which are application-wide logs, but cannot be 100% associated with requests.

At present, Apache provides very rich log processing plug-ins, including TCP/HTTP/Kafka and other collection and reporting plug-ins, but they are weakly associated with Tracing. Take Apache SkyWalking as an example. We extract the SkyWalking Tracing Conetxt Header from the log records of Apache and export it to the file system, and then use the log processing framework (fluentbit) to convert the logs into a log format acceptable to SkyWalking. The Tracing Context is then parsed and extracted to obtain the Tracing ID to establish a connection with the Trace.

Obviously, the above way of handling the process is tedious and complicated, and requires additional conversion of log formats. For this reason, in PR#5500 we have implemented the Apache SkyWalking access log into the Apache plug-in ecosystem to make it easier for users to collect and process logs using Apache SkyWalking in Apache .

Introduction of the New Plugins​

SkyWalking Logger Plugin​

The SkyWalking Logger plugin parses the SkyWalking Tracing Context Header and prints the relevant Tracing Context information to the log, thus enabling the log to be associated with the call chain.

By using this plug-in, Apache can get the SkyWalking Tracing Context and associate it with Tracing even if the SkyWalking Tracing plug-in is not turned on, if Apache SkyWalking is already integrated downstream.

The above Content is the log content, where the Apache metadata configuration is used to collect request-related information. You can later modify the Log Format to customize the log content by Plugin Metadata, please refer to the official documentation.

How to Use​

When using this plugin, since the SkyWalking plugin is "not enabled" by default, you need to manually modify the plugins section in the conf/default-.yaml file to enable the plugin.

plugins:  ...  - error-log-logger  ...

Then you can use the SkyWalking Tracing plug-in to get the tracing data directly, so you can verify that the Logging plug-in-related features are enabled and working properly.

Step 1: Create a route​

Next, create a route and bind the SkyWalking Tracing plugin and the SkyWalking Logging plugin. More details of the plugin configuration can be found in the official Apache documentation.

curl -X PUT 'http://192.168.0.108:9080//admin/routes/1001' \-H 'X-API-KEY:  edd1c9f034335f136f87ad84b625c8f1' \-H 'Content-Type: application/json' \-d '{    "uri": "/get",    "plugins": {        "skywalking": {            "sample_ratio": 1        },        "skywalking-logger": {            "endpoint_addr": "http://127.0.0.1:12800"        }    },    "upstream": {        "type": "roundrobin",        "nodes": {            "httpbin.org:80": 1        }    }}'
Step 2: Log Processing​

On the Apache SkyWalking side, you can use LAL (Logger Analysis Language) scripts for log processing, such as Tag extraction, SkyWalking metadata correction, and so on.

The main purpose of Tag extraction here is to facilitate subsequent retrieval and to add dependencies to the Metrics statistics. The following code can be used to configure the SkyWalking LAL script to complete the Tag extraction. For more information on how to use the SkyWalking LAL script, please refer to the official Apache SkyWalking documentation.

# The default LAL script to save all logs, behaving like the versions before 8.5.0.rules:  - name: default    dsl: |      filter {        json {          abortOnFailure false        }        extractor {          tag routeId: parsed.route_id          tag upstream: parsed.upstream          tag clientIp: parsed.client_ip          tag latency: parsed.latency        }        sink {        }      }

After configuring the above LAL script in SkyWalking OAP Server the following log will be displayed.

Details of the expanded log are as follows.

As you can see from the above, displaying routeId, upstream and clientIp as key-value pairs is much easier than searching directly in the log body. This is because the Tag format not only supports log display format and search, but also generates information such as Metrics using MAL statistics.

SkyWalking Error Logger Plugin​

The error-log-logger plug-in now supports the SkyWalking log format, and you can now use the http-error-log plug-in to quickly connect Apache error logs to Apache SkyWalking. Currently, error logs do not have access to SkyWalking Tracing Context information, and therefore cannot be directly associated with SkyWalking Tracing.

The main reason for the error log to be integrated into SkyWalking is to centralize the Apache log data and to make it easier to view all observable data within SkyWalking.

How to Use​

Since the error-log-logger plugin is "not enabled" by default, you still need to enable the plugin in the way mentioned above.

plugins:  ...  - error-log-logger  ...
Step 1: Bind the route​

After enabling, you need to bind the plugin to routes or global rules. Here we take "bind routes" as an example.

curl -X PUT 'http://192.168.0.108:9080//admin/plugin_metadata/error-log-logger' \-H 'X-API-KEY:  edd1c9f034335f136f87ad84b625c8f1' \-H 'Content-Type: application/json' \-d '{    "inactive_timeout": 10,    "level": "ERROR",    "skywalking": {        "endpoint_addr": "http://127.0.0.1:12800/v3/logs"    }}'

Note that the endpoint_addr is the SkyWalking OAP Server address and needs to have the URI (i.e. /v3/logs).

Step 2: LAL Processing​

In much the same way as the Access Log processing, the logs are also processed by LAL when they reach SkyWalking OAP Server. Therefore, we can still use the SkyWalking LAL script to analyze and process the log messages.

It is important to note that the Error Log message body is in text format. If you are extracting tags, you will need to use regular expressions to do this. Unlike Access Log, which handles the message body in a slightly different way, Acces Log uses JSON format and can directly reference the fields of the JSON object using JSON parsing, but the rest of the process is largely the same.

Tags can also be used to optimize the display and retrieval for subsequent metrics calculations using SkyWalking MAL.

rules:  - name: -errlog    dsl: |      filter {        text {          regexp "(?<datetime>\\d{4}/\\d{2}/\\d{2} \\d{2}:\\d{2}:\\d{2}) \\[(?<level>\\w+)\\] \\d+\\#\\d+:( \\*\\d+ \\[(?<module>\\w+)\\] (?<position>.*\\.lua:\\d+): (?<function>\\w+\\(\\)):)* (?<msg>.+)"        }        extractor {          tag level: parsed.level          if (parsed?.module) {            tag module: parsed.module            tag position: parsed.position            tag function: parsed.function          }        }        sink {        }      }

After the LAL script used by SkyWalking OAP Server, some of the Tags will be extracted from the logs, as shown below.

Summary​

This article introduces two logging plug-ins for Apache that integrate with SkyWalking to provide a more convenient operation and environment for logging in Apache afterwards.

We hope that through this article, you will have a fuller understanding of the new features and be able to use Apache for centralized management of observable data more conveniently in the future.

Apache Integrates with SkyWalking for Log Processing

上一篇: Understanding the Significance of 3.4 as a Root in Mathematics
下一篇: Canary releases with Apache
相关文章