Writing a plugin for the Elastic APM Java agent

This blog describes how to write a plugin for R2DBC for Elastic APM Java agent.

Dec 7, 2023

Sven Rienstra

Cloud Developer

As we all know, observability is something we rely heavily on in our ever more complex landscape. In order to know if our applications are still doing what they should be doing, we can't live without it. But also, when something does go wrong, it can provide us with very valuable insight into what is going on inside our application.

One of the tools we can use to improve the observability of our applications is Application Performance Monitoring agents (or APM agents). The agents provide information on what is going on inside our applications. They generally do this by instrumenting our code, without necessarily needing any changes within our own code. Most providers of APM tooling have agents available for a wide range of programming languages, such as Java, .NET, Python, PHP, etc. Using the instrumentation, the agents can record events like HTTP requests, database queries, and messaging events.

To record events, for example, when an HTTP request starts and ends, the agent will need to hook into the web framework of your choice. The agent provider generally will provide a list of supported technologies and frameworks. In the case of the HTTP request example, the agent will dynamically add some code around the web framework to record when an HTTP request starts and ends.

So what do you actually get by using these agents? In the example of Elastic APM, you'll be able to see a timeline visualization like below.

We can see which HTTP requests are fired and what database calls are being made. This information can be extremely useful when trying to understand what's happening within your application, for example, when looking into a performance issue.

What if your framework is not supported?

But what to do when your framework of choice is not supported by the agent? By not doing anything, you might be missing out on valuable information. So, is there anything we can do? Luckily, there is! Most APM agents will offer some way of adding manual instrumentation to your code. This might be suitable if you want to add some instrumentation for a very specific use case. But what if, for example, your database framework is not supported? Adding manual instrumentation for each query is not an ideal solution.

Today, we'll be looking at an example where a Java application was using R2DBC (a database framework) and Elastic as the APM provider. The Elastic APM agent offers a plugin API, so we can write our own instrumentation without needing to instrument each individual query.

How does the APM agent work?

Before we look into how we can write a plugin for our example case, let's dive into how the Elastic APM agent works. The Elastic APM agent uses bytecode manipulation to instrument code. By using bytecode manipulation, it can modify Java classes at runtime, allowing the agent to change a class without recompiling it. What will typically happen is that the agent will add some code when an instrumented method is entered and exited.

A good example to demonstrate how this works in practice is the Servlet API. The Servlet API in Java is the main entry point for most HTTP servers. There are many implementations, but by instrumenting on the API level, it doesn't matter which implementation is used. The main entry point for the Servlet API is the service method1. We could write instrumentation that would indicate the start of an HTTP request the moment the service method is entered and indicate the end of the HTTP request when the method is exited.

Let's write our R2DBC plugin

We now have a basic idea of how the APM agent works, so we can have a look at how we could write a plugin for our problem at hand. The aim is to be able to record queries that have been executed, including SQL statements. We first need to identify what our entry point will be to instrument. The most obvious choice seems to be io.r2dbc.spi.Statement#execute[^2]. According to the javadoc, this method is responsible for Executes one or more SQL statements and returns the Results. There is one problem, however, the Statement API doesn't have any reference to the SQL statement. There are probably some ways around this, but to keep it simple, we'll instrument a specific implementation: io.r2dbc.postgresql.PostgresqlStatement. This implementation has an execute method which takes the SQL statement as a parameter.

Now that we know what to instrument, how do we actually record a query that has been executed? Elastic is using OpenTelemetry2 to record events. In the terminology, we call a request a 'trace', and within a trace, we can have multiple or nested 'spans'. A span describes a single unit of work, for example, a database query. OpenTelemetry also defines conventions on how to record data about the specific unit of work3.

Elastic offers a plugin API. To define the plugin, we need to extend ElasticApmInstrumentation. This offers a couple of overrides to define the plugin. First of all, let's define the matchers, describing what we want to instrument:

@Override
public ElementMatcher<? super TypeDescription> getTypeMatcher() {
    return named("io.r2dbc.postgresql.PostgresqlStatement");
}

@Override
public ElementMatcher<? super MethodDescription> getMethodMatcher() {
    return named("execute").and(takesArgument(0, named("java.lang.String")));
}

We can see we match on 2 things here, the class name of the type we want to match and the method we want to match on. Now we need to define what we need to do when that method is called:

@Override
public String getAdviceClassName() {
    return "nl.skyworkz.apm.agent.r2dbc.postgresql.R2dbcPostgresqlInstrumentation$HandleExecuteStatementAdvice";
}

public static class HandleExecuteStatementAdvice {

    private static final SignatureParser signatureParser = new SignatureParser();

    @Advice.OnMethodExit(inline = false)
    @Advice.AssignReturned.ToReturned(typing = Assigner.Typing.DYNAMIC)
    public static Flux<PostgresqlResult> onExitExecute(@Advice.Argument(0) String sql,
                                                       @Advice.Return Flux<io.r2dbc.postgresql.api.PostgresqlResult> result) {
        return Mono.defer(() -> Mono.just(createSpan(sql)))
                .flatMapMany(spanBuilder -> {
                    Span span = spanBuilder.startSpan();

                    return result
                            .doOnComplete(() -> {
                                span.setStatus(StatusCode.OK).end();
                            })
                            .doOnError(throwable -> {
                                span.recordException(throwable).end();
                            });
                });
    }

    /**
     * See https://github.com/open-telemetry/opentelemetry-specification/blob/main/specification/trace/semantic_conventions/database.md
     */
    private static SpanBuilder createSpan(String sql) {
        StringBuilder

We wrap around the Reactor type here to start the span before we subscribe to the query result and once it completes we close the span. The span contains the statement (query) being executed. The query will now show up in our timeline (like in the screenshot earlier) and we'll be able to see a summary of the statement and how long it took to execute that statement.

The signature parser that is being used is a copy of the SignatureParser class from the JDBC plugin of the APM agent4. To package the plugin a few more steps are needed, you can find them on the Elastic website: https://www.elastic.co/guide/en/apm/agent/java/current/plugin-api.html

Footnotes

I tried coding with AI, and became its micro-manager instead

AI evangelists claim it can replace junior developers. I decided to put that to the test and tried Vibe Coding for a simple project. See how it went here.

May 14, 2025

I tried coding with AI, and became its micro-manager instead

AI evangelists claim it can replace junior developers. I decided to put that to the test and tried Vibe Coding for a simple project. See how it went here.

May 14, 2025

Why European Companies Should Rethink Their Dependence on U.S. Cloud Providers

In this blogpost we will examine the key risks and considerations when using U.S. cloud providers and when a European alternative might be a better fit.

Apr 2, 2025

Why European Companies Should Rethink Their Dependence on U.S. Cloud Providers

In this blogpost we will examine the key risks and considerations when using U.S. cloud providers and when a European alternative might be a better fit.

Apr 2, 2025

I made my multi-arch Docker image 10x faster

This blog post discusses a performance issue affecting multi-architecture Docker images on Apple Silicon, and how to fix it.

May 24, 2024

I made my multi-arch Docker image 10x faster

This blog post discusses a performance issue affecting multi-architecture Docker images on Apple Silicon, and how to fix it.

May 24, 2024

I tried coding with AI, and became its micro-manager instead

AI evangelists claim it can replace junior developers. I decided to put that to the test and tried Vibe Coding for a simple project. See how it went here.

May 14, 2025

Why European Companies Should Rethink Their Dependence on U.S. Cloud Providers

In this blogpost we will examine the key risks and considerations when using U.S. cloud providers and when a European alternative might be a better fit.

Apr 2, 2025

I made my multi-arch Docker image 10x faster

This blog post discusses a performance issue affecting multi-architecture Docker images on Apple Silicon, and how to fix it.

May 24, 2024

Working your way through pesky issue(s) in Prometheus and Thanos

A war story describing how a restart wreaked havoc on an observability stack running on Prometheus and Thanos

Apr 16, 2024

Ready to Transform Your Cloud Strategy and Empower Your Team?

Let's connect and discuss how Skyworkz can help you achieve lasting cloud success.

Schedule a Free Consultation

Ready to Transform Your Cloud Strategy and Empower Your Team?

Let's connect and discuss how Skyworkz can help you achieve lasting cloud success.

Schedule a Free Consultation

Ready to Transform Your Cloud Strategy and Empower Your Team?

Let's connect and discuss how Skyworkz can help you achieve lasting cloud success.

Schedule a Free Consultation

Writing a plugin for the Elastic APM Java agent

What if your framework is not supported?

How does the APM agent work?

Let's write our R2DBC plugin

Footnotes

Read more

I tried coding with AI, and became its micro-manager instead

I tried coding with AI, and became its micro-manager instead

Why European Companies Should Rethink Their Dependence on U.S. Cloud Providers

Why European Companies Should Rethink Their Dependence on U.S. Cloud Providers

I made my multi-arch Docker image 10x faster

I made my multi-arch Docker image 10x faster

I tried coding with AI, and became its micro-manager instead

Why European Companies Should Rethink Their Dependence on U.S. Cloud Providers

I made my multi-arch Docker image 10x faster

Working your way through pesky issue(s) in Prometheus and Thanos

Ready to Transform Your Cloud Strategy and Empower Your Team?

Ready to Transform Your Cloud Strategy and Empower Your Team?

Ready to Transform Your Cloud Strategy and Empower Your Team?