By apipark — 09 May 2026

How to Wait for Java API Request Completion

java api request how to wait for it to finish

The modern software landscape is inextricably linked with Application Programming Interfaces (APIs). From fetching data for a web application to orchestrating microservices or interacting with sophisticated AI models, API calls are the lifeblood of interconnected systems. For Java developers, efficiently handling these API requests, particularly waiting for their completion, is a fundamental challenge that dictates an application's responsiveness, scalability, and resource utilization. A naive, blocking approach can quickly lead to unresponsive user interfaces, thread starvation, and abysmal performance, transforming a potentially dynamic application into a sluggish bottleneck. This comprehensive guide delves into various strategies for managing Java API request completion, moving beyond simple synchronous calls to embrace the power of asynchronous programming, concurrency utilities, and reactive paradigms. We will explore each method's intricacies, provide practical code examples, discuss their advantages and disadvantages, and offer insights into choosing the most appropriate strategy for different scenarios. Our journey will equip you with the knowledge to build highly performant, resilient, and scalable Java applications that seamlessly interact with external services.

Understanding API Interactions in Java: The Foundation

Before diving into waiting mechanisms, it's crucial to grasp the nature of API interactions in Java. An API request fundamentally involves your application initiating communication with an external service, often over a network. This could be an HTTP API call to a REST endpoint, a gRPC request to a microservice, or a database query. Regardless of the protocol, these operations share a common characteristic: they are I/O-bound. This means a significant portion of the operation's duration is spent waiting for data to be sent or received over a network, rather than performing intensive CPU computations.

The Blocking Nature of I/O

Traditionally, many I/O operations in Java are blocking. When your code makes a blocking API call, the thread executing that code pauses its execution and waits until the API response is fully received or an error occurs. While conceptually straightforward, this blocking behavior has profound implications:

Resource Inefficiency: A thread that is blocked is consuming system resources (memory, CPU context) but is not performing any useful computation. If many threads are blocked simultaneously waiting for API responses, system resources can be quickly exhausted, leading to performance degradation or even application crashes.
Lack of Responsiveness: In applications with a user interface, a blocking API call on the main thread will cause the UI to freeze, leading to a poor user experience. In server-side applications, blocking calls can prevent other requests from being processed, severely limiting throughput and scalability.
Increased Latency: If one API call is slow, it directly impacts the overall response time of your application, as subsequent operations on that thread cannot proceed until the slow call completes.

Understanding these inherent challenges sets the stage for exploring non-blocking and asynchronous approaches, which are designed to mitigate these issues by allowing threads to perform other useful work while waiting for I/O operations to complete. The goal is always to maximize the utility of available threads and ensure a responsive system, especially when dealing with multiple or potentially slow API invocations.

The Synchronous Paradigm: Simplicity and its Limitations

The most straightforward way to make an API request in Java is synchronously. In this model, the thread that initiates the API call will halt its execution and wait for the response to arrive before proceeding to the next line of code.

How Synchronous Calls Work

Consider a basic HTTP API call using Java's built-in HttpClient (introduced in Java 11):

import java.io.IOException;
import java.net.URI;
import java.net.http.HttpClient;
import java.net.http.HttpRequest;
import java.net.http.HttpResponse;
import java.time.Duration;

public class SynchronousApiClient {

    public static void main(String[] args) {
        HttpClient client = HttpClient.newBuilder()
                .version(HttpClient.Version.HTTP_2)
                .followRedirects(HttpClient.Redirect.NORMAL)
                .connectTimeout(Duration.ofSeconds(10))
                .build();

        String apiUrl = "https://jsonplaceholder.typicode.com/todos/1"; // Example API

        HttpRequest request = HttpRequest.newBuilder()
                .uri(URI.create(apiUrl))
                .timeout(Duration.ofSeconds(20))
                .header("Accept", "application/json")
                .GET()
                .build();

        long startTime = System.currentTimeMillis();
        System.out.println("Initiating synchronous API request...");

        try {
            HttpResponse<String> response = client.send(request, HttpResponse.BodyHandlers.ofString());

            // The code execution blocks here until the API call completes
            // or a timeout/error occurs.

            if (response.statusCode() == 200) {
                System.out.println("API Response received (Status: " + response.statusCode() + "):");
                System.out.println(response.body());
            } else {
                System.err.println("API Request failed with status code: " + response.statusCode());
            }
        } catch (IOException | InterruptedException e) {
            System.err.println("Error during API request: " + e.getMessage());
            Thread.currentThread().interrupt(); // Restore interrupt status
        }

        long endTime = System.currentTimeMillis();
        System.out.println("Synchronous API request completed in " + (endTime - startTime) + " ms.");
        System.out.println("Proceeding with other tasks after API completion...");
        // Any code here will only execute after the API call has finished.
    }
}

In this example, the client.send() method is a blocking call. The main thread pauses its execution until it receives the HttpResponse object. While simple to understand and implement for a single, isolated API call, this approach quickly reveals its shortcomings in more complex or high-throughput scenarios.

Disadvantages of Synchronous API Calls

The simplicity of synchronous API calls comes at a significant cost, making them unsuitable for many modern applications:

UI Freezing (Client-side Applications): In desktop or mobile applications, performing a synchronous API call on the Event Dispatch Thread (EDT) or UI thread will cause the entire application to become unresponsive until the API response is received. Users will perceive the application as frozen, leading to a frustrating experience.
Poor Scalability (Server-side Applications): In a web server or microservice that handles many concurrent API requests, if each incoming request uses a thread that then makes a blocking external API call, that thread becomes idle while waiting. If the external API is slow, many server threads can become blocked simultaneously. This leads to:
- Thread Starvation: All available threads in the server's thread pool might get consumed, preventing new incoming requests from being processed.
- High Resource Consumption: Maintaining a large number of blocked threads still consumes memory and CPU context, wasting valuable server resources.
- Reduced Throughput: The rate at which the server can process requests plummets because threads are tied up waiting instead of doing useful work.
Inefficient Resource Utilization: As mentioned, a blocked thread does nothing but wait. This is an inefficient use of CPU cycles and memory, especially when the waiting time is substantial (e.g., waiting for a slow network API).
Difficulty with Concurrency: If your application needs to make multiple independent API calls simultaneously, a synchronous approach would force them to execute sequentially, drastically increasing the total execution time. To achieve concurrency synchronously, you'd have to manually manage new threads, which then leads to the complexities of thread management and synchronization.

Given these significant drawbacks, especially in the context of responsive UIs, scalable backend services, and efficient resource management, it becomes clear that relying solely on synchronous API calls is often an anti-pattern. The next logical step is to embrace asynchronous programming.

Embracing Asynchronicity: The Foundation for Responsiveness

Asynchronous programming is a paradigm shift designed to overcome the limitations of synchronous API calls. Instead of blocking the current thread while waiting for an API response, an asynchronous operation initiates the API request and immediately returns control to the calling thread. The actual work of waiting for the response and processing it happens in the background, often on a different thread or using non-blocking I/O mechanisms.

What is Asynchronous Programming?

At its core, asynchronous programming means "do not wait." When you make an asynchronous API call:

Your code initiates the API request.
The method returns almost immediately, typically providing a "handle" or "promise" for the future result.
Your thread can then continue with other tasks, perform other computations, or initiate other API calls.
When the API response eventually arrives, a predefined callback or continuation mechanism is triggered to process the result.

This non-blocking nature is fundamental to building high-performance and responsive applications.

Benefits of Asynchronous Operations

The adoption of asynchronous API request completion brings a multitude of advantages:

Enhanced Responsiveness: In client-side applications, the UI remains fluid and interactive because the main thread is never blocked. In server-side applications, incoming requests can always be picked up, preventing thread starvation and ensuring continuous service availability.
Improved Scalability: By not blocking threads, an application can handle a much larger number of concurrent API calls and client requests with fewer threads. Threads are quickly freed up to process other tasks rather than idling, leading to better utilization of server resources. This directly translates to higher throughput and better performance under heavy load.
Efficient Resource Utilization: Threads are kept busy performing actual work instead of waiting. This minimizes the overhead associated with managing a large number of blocked threads and makes more efficient use of CPU cores.
Parallelism and Concurrency: Asynchronous approaches naturally lend themselves to executing multiple API calls in parallel. You can initiate several independent API requests concurrently and then wait for all of them to complete, significantly reducing the overall execution time compared to sequential synchronous calls.
Better User Experience: For end-users, this means faster loading times, more interactive interfaces, and a generally smoother experience, even when interacting with backend services that might have varying response times.

Core Concepts in Asynchronous Java

Java offers several evolving mechanisms to implement asynchronous API call completion:

Callbacks: A function or method passed as an argument to another function, which is then invoked when the asynchronous operation completes. While simple, managing nested callbacks (callback hell) can become complex.
Futures/Promises: An object representing the result of an asynchronous operation that may not have completed yet. You can check if the operation is done, cancel it, and retrieve its result once available. Java's Future interface is a prime example. CompletableFuture extends this concept with powerful chaining and composition capabilities.
Reactive Streams: A specification for asynchronous stream processing with non-blocking backpressure. Libraries like Project Reactor and RxJava implement this, allowing you to model API calls as data streams that can be composed, transformed, and reacted to in a highly declarative manner.

In the following sections, we will delve into specific Java constructs that enable these asynchronous patterns, illustrating how to effectively wait for API request completion without blocking the entire application.

Method 1: Basic Threading for Asynchronous API Calls

The most fundamental way to introduce asynchronicity in Java is by explicitly creating and managing threads. Instead of making a blocking API call on the main thread, you can offload this task to a new thread, allowing the main thread to continue its execution.

Using `java.lang.Thread`

The java.lang.Thread class allows you to spawn a new thread of execution. When you need to make an API call without blocking the current thread, you can wrap the API logic in a Runnable and execute it on a new Thread.

import java.io.IOException;
import java.net.URI;
import java.net.http.HttpClient;
import java.net.http.HttpRequest;
import java.net.http.HttpResponse;
import java.time.Duration;

public class BasicThreadingApiClient {

    public static void main(String[] args) throws InterruptedException {
        System.out.println("Main thread started. ID: " + Thread.currentThread().getId());

        String apiUrl = "https://jsonplaceholder.typicode.com/posts/1"; // Example API

        // Create a Runnable to encapsulate the API call logic
        Runnable apiCaller = () -> {
            System.out.println("API thread started. ID: " + Thread.currentThread().getId());
            HttpClient client = HttpClient.newBuilder()
                    .version(HttpClient.Version.HTTP_2)
                    .connectTimeout(Duration.ofSeconds(5))
                    .build();

            HttpRequest request = HttpRequest.newBuilder()
                    .uri(URI.create(apiUrl))
                    .timeout(Duration.ofSeconds(10))
                    .GET()
                    .build();

            try {
                HttpResponse<String> response = client.send(request, HttpResponse.BodyHandlers.ofString());
                if (response.statusCode() == 200) {
                    System.out.println("API Thread: Received response (Status: " + response.statusCode() + ")");
                    // Process response.body()
                    // System.out.println(response.body().substring(0, 100) + "..."); // Print partial body
                } else {
                    System.err.println("API Thread: Request failed with status code: " + response.statusCode());
                }
            } catch (IOException | InterruptedException e) {
                System.err.println("API Thread: Error during API request: " + e.getMessage());
                if (e instanceof InterruptedException) {
                    Thread.currentThread().interrupt(); // Restore interrupt status
                }
            } finally {
                System.out.println("API thread finished. ID: " + Thread.currentThread().getId());
            }
        };

        // Create and start a new Thread
        Thread apiThread = new Thread(apiCaller, "API-Worker-Thread");
        apiThread.start(); // This method returns immediately.

        System.out.println("Main thread continues executing other tasks while API call is in progress...");

        // Simulate some other work in the main thread
        Thread.sleep(100); // Small delay to show main thread is active
        System.out.println("Main thread performed some other operations.");

        // Waiting for the API thread to complete using join()
        System.out.println("Main thread is now waiting for the API thread to complete...");
        apiThread.join(15000); // Wait for API thread to finish, with a timeout of 15 seconds.
                               // If API thread doesn't finish within 15s, main thread proceeds.

        if (apiThread.isAlive()) {
            System.out.println("API thread did not complete within the timeout. Main thread proceeding.");
            apiThread.interrupt(); // Optionally interrupt the API thread if it's still running
        } else {
            System.out.println("API thread has completed its execution.");
        }

        System.out.println("Main thread finished. ID: " + Thread.currentThread().getId());
    }
}

In this setup, apiThread.start() immediately returns, allowing the main thread to continue executing System.out.println("Main thread continues...") and other logic. The apiCaller Runnable runs concurrently in its own thread.

Waiting for a Thread to Complete: `thread.join()`

The thread.join() method is used when the current thread needs to wait for the specified thread to die. If thread.join() is called without arguments, the current thread will block indefinitely until the target thread completes. A timeout version, thread.join(long millis), allows the current thread to wait for a specified maximum amount of time. If the target thread doesn't complete within that time, the current thread unblocks and continues.

While thread.join() provides a way to wait, it inherently reintroduces a blocking point in the calling thread. The key difference is that now you choose when and where to block, and you're blocking for the completion of a background task, not the I/O itself.

Limitations of Basic Threading

While simple Thread creation offers immediate asynchronous benefits, it comes with significant limitations:

Thread Management Overhead: Creating a new Thread for every API call is resource-intensive. Thread creation and destruction involve system calls, which are costly. For applications making many frequent API calls, this overhead can degrade performance.
Resource Exhaustion: An unmanaged proliferation of threads can quickly exhaust system memory and CPU, leading to OutOfMemoryError or severe performance degradation. The operating system has limits on the number of threads it can manage.
Difficulty in Error Propagation: If an exception occurs in the new thread, it's not automatically propagated back to the calling thread. You need explicit mechanisms (e.g., shared variables, Thread.UncaughtExceptionHandler) to handle errors.
No Return Value: Runnable instances do not directly return a value. To get a result from the API call back to the main thread, you typically need shared mutable state, which introduces synchronization challenges and potential race conditions.
Limited Control: Basic Thread objects offer minimal control over thread lifecycle, scheduling, or pooling.

Due to these drawbacks, directly managing java.lang.Thread instances for every asynchronous API request is generally discouraged for anything beyond very simple, isolated tasks. A more robust and managed approach is typically preferred, leading us to the Executor Framework.

Method 2: Leveraging Java's Executor Framework

To overcome the limitations of manually managing java.lang.Thread objects, Java introduced the Executor Framework in java.util.concurrent. This framework separates the concerns of task submission from task execution, providing a managed environment for running asynchronous tasks. It primarily revolves around ExecutorService and Executors.

Introduction to `ExecutorService` and `Executors`

Executor: A simple interface with a single method, execute(Runnable command), which executes the given command at some time in the future.
ExecutorService: An extension of Executor that provides methods for managing the lifecycle of tasks and the executor itself (e.g., submit(), invokeAll(), shutdown()). It manages a pool of worker threads, reusing them for multiple tasks, thereby reducing the overhead of thread creation and destruction.
Executors: A utility class that provides static factory methods for creating common ExecutorService configurations.

Types of Thread Pools (and when to use them for API calls)

Executors offers several pre-configured ExecutorService types:

newFixedThreadPool(int nThreads): Creates a thread pool with a fixed number of threads. If all threads are busy, new tasks wait in a queue. This is excellent for API calls where you want to cap the number of concurrent external requests to prevent overwhelming the external service or your own application's resources. It's often suitable for I/O-bound tasks.
newCachedThreadPool(): Creates a thread pool that creates new threads as needed but reuses previously constructed threads when they are available. If threads are idle for a certain period (e.g., 60 seconds), they are terminated. This can be useful for applications with many short-lived, bursty API calls. However, if the API calls are long-running, it can potentially create an uncontrolled number of threads, leading to resource exhaustion.
newSingleThreadExecutor(): Creates an ExecutorService that uses a single worker thread. Tasks are executed sequentially in the order they are submitted. Useful for ensuring sequential execution of API calls or events.
newScheduledThreadPool(int corePoolSize): Creates a thread pool that can schedule commands to run after a given delay, or to execute periodically. Useful for polling APIs or background refresh tasks.

For most API interaction scenarios, newFixedThreadPool is often the preferred choice, allowing controlled concurrency.

`submit()` and `execute()`

execute(Runnable command): Used for Runnable tasks that don't return a result.
submit(Callable<T> task): Used for Callable tasks (which can return a result of type T and throw checked exceptions) or Runnable tasks. It returns a Future<T> object, which represents the pending result of the task.

Retrieving Results with `Future<T>` and `future.get()`

The Future<T> interface is a crucial component for waiting for API call completion with the Executor Framework. It represents the result of an asynchronous computation.

import java.io.IOException;
import java.net.URI;
import java.net.http.HttpClient;
import java.net.http.HttpRequest;
import java.net.http.HttpResponse;
import java.time.Duration;
import java.util.concurrent.*;

public class ExecutorServiceApiClient {

    public static void main(String[] args) {
        // Create a fixed-size thread pool for API calls
        // Limiting to 5 concurrent API calls
        ExecutorService executor = Executors.newFixedThreadPool(5);
        System.out.println("Main thread started. ExecutorService initialized.");

        String apiUrl1 = "https://jsonplaceholder.typicode.com/todos/1"; // Example API 1
        String apiUrl2 = "https://jsonplaceholder.typicode.com/todos/2"; // Example API 2 (might be slower)
        String apiUrl3 = "https://jsonplaceholder.typicode.com/posts/1"; // Example API 3

        Callable<String> apiTask1 = () -> callApi(apiUrl1, "API Task 1");
        Callable<String> apiTask2 = () -> callApi(apiUrl2, "API Task 2");
        Callable<String> apiTask3 = () -> callApi(apiUrl3, "API Task 3");

        long startTime = System.currentTimeMillis();

        // Submit tasks to the executor and get Future objects
        Future<String> future1 = executor.submit(apiTask1);
        Future<String> future2 = executor.submit(apiTask2);
        Future<String> future3 = executor.submit(apiTask3);

        System.out.println("Main thread submitted API tasks. Continuing with other work...");
        // Simulate other work in the main thread
        try {
            Thread.sleep(50);
        } catch (InterruptedException e) {
            Thread.currentThread().interrupt();
        }
        System.out.println("Main thread performed some other immediate tasks.");

        // Now, wait for the API calls to complete and retrieve results
        // Using future.get() is blocking for the current thread
        try {
            System.out.println("\nMain thread waiting for API results (Task 1)...");
            String result1 = future1.get(10, TimeUnit.SECONDS); // Wait for up to 10 seconds
            System.out.println("Result from Task 1: " + result1);

            System.out.println("\nMain thread waiting for API results (Task 2)...");
            String result2 = future2.get(10, TimeUnit.SECONDS);
            System.out.println("Result from Task 2: " + result2);

            System.out.println("\nMain thread waiting for API results (Task 3)...");
            String result3 = future3.get(10, TimeUnit.SECONDS);
            System.out.println("Result from Task 3: " + result3);

        } catch (InterruptedException e) {
            System.err.println("Main thread interrupted while waiting for API result: " + e.getMessage());
            Thread.currentThread().interrupt();
        } catch (ExecutionException e) {
            System.err.println("Exception occurred during API execution: " + e.getCause().getMessage());
        } catch (TimeoutException e) {
            System.err.println("API call timed out: " + e.getMessage());
            future1.cancel(true); // Attempt to cancel the underlying task
            future2.cancel(true);
            future3.cancel(true);
        } finally {
            // It is crucial to shut down the executor service when done to release resources
            executor.shutdown();
            try {
                if (!executor.awaitTermination(5, TimeUnit.SECONDS)) {
                    System.err.println("Executor did not terminate in time. Forcing shutdown.");
                    executor.shutdownNow(); // Attempt to stop all actively executing tasks
                }
            } catch (InterruptedException e) {
                System.err.println("Shutdown interrupted: " + e.getMessage());
                executor.shutdownNow();
                Thread.currentThread().interrupt();
            }
        }

        long endTime = System.currentTimeMillis();
        System.out.println("\nAll API requests processed in " + (endTime - startTime) + " ms.");
        System.out.println("Main thread finished.");
    }

    private static String callApi(String apiUrl, String taskName) throws IOException, InterruptedException {
        System.out.println(taskName + ": Starting API call to " + apiUrl + " on thread " + Thread.currentThread().getName());
        HttpClient client = HttpClient.newBuilder()
                .version(HttpClient.Version.HTTP_2)
                .connectTimeout(Duration.ofSeconds(5))
                .build();

        HttpRequest request = HttpRequest.newBuilder()
                .uri(URI.create(apiUrl))
                .timeout(Duration.ofSeconds(10))
                .GET()
                .build();

        HttpResponse<String> response = client.send(request, HttpResponse.BodyHandlers.ofString());
        if (response.statusCode() == 200) {
            System.out.println(taskName + ": Received successful response.");
            // Return a snippet of the body to avoid printing very long responses
            return "Status: " + response.statusCode() + ", Body Snippet: " + response.body().substring(0, Math.min(response.body().length(), 100)) + "...";
        } else {
            throw new IOException(taskName + ": API request failed with status code: " + response.statusCode());
        }
    }
}

The future.get() method is blocking. When you call future.get(), the current thread (in this case, the main thread) will pause its execution until the corresponding task submitted to the ExecutorService completes and its result is available. The version future.get(long timeout, TimeUnit unit) allows you to specify a maximum waiting time, preventing indefinite blocking.

Handling `TimeoutException` and `InterruptedException`

TimeoutException: If the get() method with a timeout is used, and the task does not complete within the specified time, a TimeoutException is thrown. This allows your application to gracefully handle slow or unresponsive APIs without blocking indefinitely.
InterruptedException: If the current thread (the one calling get()) is interrupted while waiting for the result, an InterruptedException is thrown.
ExecutionException: If the task executed by the ExecutorService throws an exception, that exception is wrapped in an ExecutionException and thrown by future.get(). You can retrieve the original cause using e.getCause().

Managing Shutdown of `ExecutorService`

It's crucial to properly shut down an ExecutorService when it's no longer needed to release system resources.

shutdown(): Initiates an orderly shutdown. Previously submitted tasks are executed, but no new tasks will be accepted.
shutdownNow(): Attempts to stop all actively executing tasks, halts the processing of waiting tasks, and returns a list of the tasks that were awaiting execution.
awaitTermination(long timeout, TimeUnit unit): Blocks until all tasks have completed execution after a shutdown request, or the timeout occurs, or the current thread is interrupted, whichever happens first.

The Executor Framework provides a much more robust and manageable way to handle asynchronous API calls compared to raw threads. It abstracts away thread creation and management, allowing developers to focus on task logic. However, Future.get() is still a blocking operation. While better than blocking the main application thread for the entire duration of an API call, it still creates a blocking point when you need the result. For complex asynchronous workflows, nested future.get() calls can lead to unreadable and error-prone code. This limitation paved the way for more advanced non-blocking composition patterns, such as CompletableFuture.

Method 3: The Power of `CompletableFuture` for Fluent Asynchronicity

Java 8 introduced CompletableFuture, a significant enhancement to the Future interface. It provides a rich set of methods for non-blocking composition, chaining, and error handling, making it the preferred choice for building complex asynchronous workflows involving API calls. CompletableFuture implements CompletionStage and Future, allowing for a more declarative and fluent style of asynchronous programming.

Why `CompletableFuture`?

Traditional Future objects are passive; you can only check if a task is done, cancel it, or block until it's finished using get(). CompletableFuture overcomes this by allowing you to:

Chain operations: Define what should happen after a future completes, without blocking.
Combine futures: Wait for multiple futures to complete and then process their results.
Handle errors declaratively: Attach error handling logic directly to the future.
Complete manually: Explicitly complete a future with a value or an exception.

Creating `CompletableFuture` Instances

You can create CompletableFuture instances in several ways:

CompletableFuture.supplyAsync(Supplier<U> supplier): Runs the supplier task asynchronously and completes the future with its result. Useful for tasks that return a value.
CompletableFuture.runAsync(Runnable runnable): Runs the runnable task asynchronously and completes the future when it finishes (returns Void). Useful for tasks that don't return a value.
new CompletableFuture<T>(): Creates an uncompleted CompletableFuture that can be manually completed later using complete(T value) or completeExceptionally(Throwable ex).
CompletableFuture.completedFuture(U value): Returns a CompletableFuture that is already completed with the given value.

By default, supplyAsync and runAsync use the ForkJoinPool.commonPool(). You can also provide a custom Executor for more fine-grained control over the thread pool, which is recommended for I/O-bound tasks like API calls.

Chaining Operations: The Essence of Non-Blocking Composition

The real power of CompletableFuture lies in its ability to chain dependent asynchronous operations without blocking.

thenApply(Function<T, U> fn): Processes the result of the previous stage asynchronously and returns a new CompletableFuture with the transformed result.
thenAccept(Consumer<T> action): Consumes the result of the previous stage but doesn't return a value (returns CompletableFuture<Void>).
thenRun(Runnable action): Executes a Runnable after the previous stage completes, ignoring its result.
thenCompose(Function<T, CompletionStage<U>> fn): FlatMap-like operation. When the previous stage completes, fn is applied to its result, which must return a CompletionStage. This is used to flatten nested CompletableFutures (e.g., when the result of one API call is used to make another API call).
thenCombine(CompletionStage<? extends U> other, BiFunction<? super T, ? super U, ? extends V> fn): Combines the results of two independent CompletableFutures once both complete.
whenComplete(BiConsumer<? super T, ? super Throwable> action) / exceptionally(Function<Throwable, ? extends T> fn) / handle(BiFunction<? super T, Throwable, ? extends U> fn): For error handling and side effects.

Code Example: `CompletableFuture` for Chained API Calls

Let's illustrate with an example where the result of one API call informs the next:

import java.io.IOException;
import java.net.URI;
import java.net.http.HttpClient;
import java.net.http.HttpRequest;
import java.net.http.HttpResponse;
import java.time.Duration;
import java.util.concurrent.*;

public class CompletableFutureApiClient {

    private static final HttpClient client = HttpClient.newBuilder()
            .version(HttpClient.Version.HTTP_2)
            .connectTimeout(Duration.ofSeconds(5))
            .build();

    // Use a custom executor for API calls to avoid commonPool starvation if tasks block
    private static final ExecutorService apiExecutor = Executors.newFixedThreadPool(10,
            r -> {
                Thread t = new Thread(r);
                t.setName("API-CF-Worker-" + t.getId());
                t.setDaemon(true); // Allow JVM to exit if only daemon threads remain
                return t;
            });

    public static void main(String[] args) {
        System.out.println("Main thread started. ID: " + Thread.currentThread().getId());

        String initialApiUrl = "https://jsonplaceholder.typicode.com/todos/1"; // Get a todo
        String usersApiUrlBase = "https://jsonplaceholder.typicode.com/users/"; // Get user details

        long startTime = System.currentTimeMillis();

        CompletableFuture<String> todoFuture = CompletableFuture.supplyAsync(() -> {
            System.out.println("Fetching todo on thread: " + Thread.currentThread().getName());
            try {
                return callApi(initialApiUrl, "Todo API");
            } catch (IOException | InterruptedException e) {
                throw new CompletionException("Failed to fetch todo", e);
            }
        }, apiExecutor); // Explicitly use our API executor

        CompletableFuture<String> userFuture = todoFuture
                .thenApply(todoJson -> {
                    System.out.println("Processing todo result to extract userId on thread: " + Thread.currentThread().getName());
                    // Simple parsing: extract "userId": X from JSON
                    int userIdIndex = todoJson.indexOf("\"userId\":");
                    if (userIdIndex != -1) {
                        int start = userIdIndex + "\"userId\":".length();
                        int end = todoJson.indexOf(",", start);
                        if (end == -1) end = todoJson.indexOf("}", start);
                        if (start < end) {
                            String userIdStr = todoJson.substring(start, end).trim();
                            System.out.println("Extracted userId: " + userIdStr);
                            return userIdStr;
                        }
                    }
                    throw new IllegalArgumentException("Could not extract userId from todo response: " + todoJson);
                })
                .thenCompose(userId -> CompletableFuture.supplyAsync(() -> {
                    System.out.println("Fetching user details for userId " + userId + " on thread: " + Thread.currentThread().getName());
                    try {
                        return callApi(usersApiUrlBase + userId, "User API for " + userId);
                    } catch (IOException | InterruptedException e) {
                        throw new CompletionException("Failed to fetch user for userId " + userId, e);
                    }
                }, apiExecutor)); // Again, use our API executor for the subsequent call

        // Combine the results or handle errors
        userFuture
                .thenAccept(userJson -> {
                    System.out.println("\nFinal Result (user details):");
                    System.out.println(userJson);
                })
                .exceptionally(ex -> {
                    System.err.println("An error occurred in the chain: " + ex.getMessage());
                    return null; // Return null to complete the exceptionally stage
                })
                .whenComplete((result, throwable) -> {
                    System.out.println("All asynchronous operations completed.");
                    long endTime = System.currentTimeMillis();
                    System.out.println("Total time for CompletableFuture chain: " + (endTime - startTime) + " ms.");
                    // Ensure the executor is shut down
                    apiExecutor.shutdown();
                });


        System.out.println("Main thread continues its execution without blocking immediately...");
        // This line executes before the API calls complete
        try {
            // Keep main thread alive long enough for async tasks to complete
            // In a real application, this might be handled by a web server or other framework
            Thread.sleep(5000);
        } catch (InterruptedException e) {
            Thread.currentThread().interrupt();
        }

        System.out.println("Main thread finishing (may still wait for API executor shutdown).");

        try {
            if (!apiExecutor.awaitTermination(1, TimeUnit.MINUTES)) { // Give enough time for tasks to finish
                System.err.println("API Executor did not terminate in time. Forcing shutdown.");
                apiExecutor.shutdownNow();
            }
        } catch (InterruptedException e) {
            System.err.println("API Executor shutdown interrupted: " + e.getMessage());
            apiExecutor.shutdownNow();
            Thread.currentThread().interrupt();
        }
    }

    private static String callApi(String apiUrl, String taskName) throws IOException, InterruptedException {
        // Simulate potential network delay
        // Thread.sleep((long)(Math.random() * 500) + 100);
        HttpRequest request = HttpRequest.newBuilder()
                .uri(URI.create(apiUrl))
                .timeout(Duration.ofSeconds(10))
                .GET()
                .build();
        HttpResponse<String> response = client.send(request, HttpResponse.BodyHandlers.ofString());
        if (response.statusCode() == 200) {
            // System.out.println(taskName + ": Received successful response on thread: " + Thread.currentThread().getName());
            return response.body();
        } else {
            throw new IOException(taskName + ": API request failed with status code: " + response.statusCode() + " Body: " + response.body());
        }
    }
}

This example demonstrates how CompletableFuture allows you to define a sequence of dependent operations (fetch todo -> extract userId -> fetch user) in a non-blocking, declarative way. Each thenApply or thenCompose stage is executed when the previous stage completes, potentially on a different thread from the apiExecutor pool.

Waiting for Multiple Futures: `allOf`, `anyOf`

CompletableFuture.allOf(CompletableFuture<?>... cfs): Returns a new CompletableFuture<Void> that is completed when all of the given CompletableFutures complete. Useful when you need to perform multiple independent API calls in parallel and then wait for all of them to finish before proceeding. You then typically need to call join() on each individual CompletableFuture to retrieve its result.
CompletableFuture.anyOf(CompletableFuture<?>... cfs): Returns a new CompletableFuture<Object> that is completed when any of the given CompletableFutures complete, with the result of that CompletableFuture. Useful when you need the fastest response among several APIs or fallbacks.

// Example for allOf
CompletableFuture<String> apiCallA = CompletableFuture.supplyAsync(() -> callApi("urlA", "A"), apiExecutor);
CompletableFuture<String> apiCallB = CompletableFuture.supplyAsync(() -> callApi("urlB", "B"), apiExecutor);
CompletableFuture<String> apiCallC = CompletableFuture.supplyAsync(() -> callApi("urlC", "C"), apiExecutor);

CompletableFuture<Void> allFutures = CompletableFuture.allOf(apiCallA, apiCallB, apiCallC);

allFutures.thenRun(() -> {
    try {
        System.out.println("Result A: " + apiCallA.join()); // .join() is a blocking get() that throws unchecked exceptions
        System.out.println("Result B: " + apiCallB.join());
        System.out.println("Result C: " + apiCallC.join());
    } catch (CompletionException e) {
        System.err.println("One of the API calls failed: " + e.getCause().getMessage());
    }
}).exceptionally(ex -> {
    System.err.println("An error occurred in allOf aggregation: " + ex.getMessage());
    return null;
});

// Example for anyOf
CompletableFuture<String> fastApi = CompletableFuture.supplyAsync(() -> { /* fast API call */ return "fast"; });
CompletableFuture<String> slowApi = CompletableFuture.supplyAsync(() -> { /* Thread.sleep(1000); slow API call */ return "slow"; });

CompletableFuture<Object> anyOne = CompletableFuture.anyOf(fastApi, slowApi);
anyOne.thenAccept(result -> System.out.println("First API to complete: " + result));

Advantages over Traditional `Future`

Non-Blocking Composition: The most significant advantage. You can build complex pipelines of asynchronous tasks without blocking threads, leading to higher scalability and responsiveness.
Declarative Error Handling: Error handling can be attached directly to the CompletableFuture chain, making it more robust and readable than nested try-catch blocks.
Manual Completion: Flexibility to complete futures manually, integrating with callback-based APIs or custom event systems.
Readability: For complex async flows, CompletableFuture often leads to more readable and maintainable code than nested callbacks or explicit ExecutorService management with Future.get().

CompletableFuture represents a robust and modern approach to handling asynchronous API request completion in Java, especially when dealing with interdependent API calls or requiring complex coordination.

APIPark is a high-performance AI gateway that allows you to securely access the most comprehensive LLM APIs globally on the APIPark platform, including OpenAI, Anthropic, Mistral, Llama2, Google Gemini, and more.Try APIPark now! 👇👇👇

Install APIPark – it’s free

Method 4: Concurrency Utilities for Coordination

While CompletableFuture excels at orchestrating dependent tasks, Java's java.util.concurrent package also provides powerful low-level synchronization aids that are invaluable for coordinating multiple threads and waiting for specific events or conditions, particularly when multiple API calls need to finish before a collective action can be taken.

`CountDownLatch`: Waiting for N Operations to Complete

A CountDownLatch is a synchronization aid that allows one or more threads to wait until a set of operations being performed in other threads completes. It's initialized with a given count. Any thread that calls await() will block until the count reaches zero. Other threads, typically worker threads, decrement the count by calling countDown() when they finish a task.

This is perfectly suited for scenarios where you launch several independent API calls and need to wait for all of them to complete before processing their collective results or moving to the next stage of your application.

import java.io.IOException;
import java.net.URI;
import java.net.http.HttpClient;
import java.net.http.HttpRequest;
import java.net.http.HttpResponse;
import java.time.Duration;
import java.util.ArrayList;
import java.util.Collections;
import java.util.List;
import java.util.concurrent.*;

public class CountDownLatchApiClient {

    private static final HttpClient client = HttpClient.newBuilder()
            .version(HttpClient.Version.HTTP_2)
            .connectTimeout(Duration.ofSeconds(5))
            .build();

    private static String callApi(String apiUrl, String taskName) throws IOException, InterruptedException {
        System.out.println(taskName + ": Starting API call to " + apiUrl + " on thread " + Thread.currentThread().getName());
        HttpRequest request = HttpRequest.newBuilder()
                .uri(URI.create(apiUrl))
                .timeout(Duration.ofSeconds(10))
                .GET()
                .build();

        HttpResponse<String> response = client.send(request, HttpResponse.BodyHandlers.ofString());
        if (response.statusCode() == 200) {
            return "Status: " + response.statusCode() + ", Body Snippet: " + response.body().substring(0, Math.min(response.body().length(), 50)) + "...";
        } else {
            throw new IOException(taskName + ": API request failed with status code: " + response.statusCode());
        }
    }

    public static void main(String[] args) throws InterruptedException {
        System.out.println("Main thread started. ID: " + Thread.currentThread().getId());

        int numberOfApiCalls = 5;
        // CountDownLatch initialized with the number of API calls we expect
        CountDownLatch latch = new CountDownLatch(numberOfApiCalls);
        // A list to store results from API calls, thread-safe
        List<String> apiResults = Collections.synchronizedList(new ArrayList<>());

        ExecutorService executor = Executors.newFixedThreadPool(numberOfApiCalls); // Pool size equals concurrent calls
        System.out.println("ExecutorService for API calls initialized.");

        String[] apiUrls = {
            "https://jsonplaceholder.typicode.com/todos/1",
            "https://jsonplaceholder.typicode.com/posts/2",
            "https://jsonplaceholder.typicode.com/comments/3",
            "https://jsonplaceholder.typicode.com/albums/4",
            "https://jsonplaceholder.typicode.com/photos/5"
        };

        long startTime = System.currentTimeMillis();
        System.out.println("Launching " + numberOfApiCalls + " parallel API calls...");

        for (int i = 0; i < numberOfApiCalls; i++) {
            final int taskId = i + 1;
            final String currentApiUrl = apiUrls[i];
            executor.execute(() -> {
                String taskName = "API Call " + taskId;
                try {
                    String result = callApi(currentApiUrl, taskName);
                    apiResults.add(result);
                    System.out.println(taskName + " completed successfully.");
                } catch (IOException | InterruptedException e) {
                    apiResults.add(taskName + " failed: " + e.getMessage());
                    System.err.println(taskName + " failed: " + e.getMessage());
                    if (e instanceof InterruptedException) {
                        Thread.currentThread().interrupt();
                    }
                } finally {
                    latch.countDown(); // Decrement the latch count when this API call finishes
                    System.out.println(taskName + ": CountDown. Latch count: " + latch.getCount());
                }
            });
        }

        System.out.println("\nMain thread waiting for all API calls to complete using CountDownLatch.await()...");
        try {
            // Main thread blocks here until latch count reaches zero or timeout
            boolean allCompleted = latch.await(20, TimeUnit.SECONDS); // Wait with a timeout

            if (allCompleted) {
                System.out.println("\nAll API calls completed!");
                System.out.println("Collected Results:");
                apiResults.forEach(System.out::println);
            } else {
                System.err.println("\nTimeout occurred. Not all API calls completed in time.");
                System.err.println("Partial Results:");
                apiResults.forEach(System.out::println);
            }
        } catch (InterruptedException e) {
            System.err.println("Main thread interrupted while waiting for API calls: " + e.getMessage());
            Thread.currentThread().interrupt();
        } finally {
            executor.shutdown();
            if (!executor.awaitTermination(5, TimeUnit.SECONDS)) {
                System.err.println("Executor did not terminate gracefully. Forcing shutdown.");
                executor.shutdownNow();
            }
        }

        long endTime = System.currentTimeMillis();
        System.out.println("\nTotal time for all parallel API requests: " + (endTime - startTime) + " ms.");
        System.out.println("Main thread finished.");
    }
}

CountDownLatch is effective when you need a one-time gate: once the count reaches zero, it cannot be reset. The await() call effectively acts as a blocking wait point for the main thread, but it waits for multiple background tasks, not just one.

`CyclicBarrier`: Synchronizing Multiple Threads at a Common Barrier Point

A CyclicBarrier allows a set of threads to all wait for each other to reach a common barrier point. Once all threads have reached the barrier, they are all released simultaneously. The "cyclic" part means that the barrier can be reused once the waiting threads are released. This is useful for coordinating a fixed number of threads that perform some work, synchronize, perform more work, synchronize again, and so on.

While less common for simple "wait for API completion" scenarios than CountDownLatch, CyclicBarrier could be useful in more complex integration tests or batch processing where multiple services (simulated by threads making API calls) must all reach a certain state before a collective next step is taken.

// Example usage (conceptual for API calls)
// Not detailed code, as primary use case for waiting for completion is CountDownLatch/CompletableFuture

/*
public class CyclicBarrierApiCoordination {
    public static void main(String[] args) {
        int numParticipants = 3;
        CyclicBarrier barrier = new CyclicBarrier(numParticipants, () -> {
            System.out.println("All participants reached the barrier. Proceeding with aggregation.");
            // This runnable is executed once all threads arrive
        });

        for (int i = 0; i < numParticipants; i++) {
            new Thread(() -> {
                try {
                    // Simulate an API call
                    System.out.println(Thread.currentThread().getName() + " making API call 1...");
                    Thread.sleep((long)(Math.random() * 1000 + 500)); // Simulate API latency
                    System.out.println(Thread.currentThread().getName() + " finished API call 1. Waiting at barrier.");
                    barrier.await(); // Wait for others

                    // Simulate another API call after all completed the first
                    System.out.println(Thread.currentThread().getName() + " making API call 2...");
                    Thread.sleep((long)(Math.random() * 800 + 300));
                    System.out.println(Thread.currentThread().getName() + " finished API call 2.");

                } catch (InterruptedException | BrokenBarrierException e) {
                    Thread.currentThread().interrupt();
                    e.printStackTrace();
                }
            }).start();
        }
    }
}
*/

`Semaphore`: Controlling Access to a Limited Number of Resources

A Semaphore controls access to a shared resource by maintaining a set of permits. If you need to limit the number of threads that can concurrently make API calls (e.g., to respect a third-party API rate limit, or to prevent overwhelming an internal service), a Semaphore is an excellent tool.

Each thread must acquire() a permit before accessing the resource (making an API call). If no permits are available, the thread blocks until a permit is released by another thread using release().

import java.io.IOException;
import java.net.URI;
import java.net.http.HttpClient;
import java.net.http.HttpRequest;
import java.net.http.HttpResponse;
import java.time.Duration;
import java.util.concurrent.*;

public class SemaphoreApiClient {

    private static final HttpClient client = HttpClient.newBuilder()
            .version(HttpClient.Version.HTTP_2)
            .connectTimeout(Duration.ofSeconds(5))
            .build();

    private static String callApi(String apiUrl, String taskName) throws IOException, InterruptedException {
        // Simulate some API processing time
        // Thread.sleep((long)(Math.random() * 500) + 100);
        HttpRequest request = HttpRequest.newBuilder()
                .uri(URI.create(apiUrl))
                .timeout(Duration.ofSeconds(10))
                .GET()
                .build();

        HttpResponse<String> response = client.send(request, HttpResponse.BodyHandlers.ofString());
        if (response.statusCode() == 200) {
            return "Status: " + response.statusCode() + ", Body Snippet: " + response.body().substring(0, Math.min(response.body().length(), 50)) + "...";
        } else {
            throw new IOException(taskName + ": API request failed with status code: " + response.statusCode());
        }
    }

    public static void main(String[] args) {
        System.out.println("Main thread started.");

        int totalApiRequests = 10;
        int maxConcurrentRequests = 3; // Limit to 3 concurrent API calls
        // Initialize Semaphore with maxConcurrentRequests permits
        Semaphore semaphore = new Semaphore(maxConcurrentRequests);
        ExecutorService executor = Executors.newCachedThreadPool(); // Use cached pool as tasks might vary

        System.out.println("Launching " + totalApiRequests + " API requests with a concurrency limit of " + maxConcurrentRequests + ".");
        long startTime = System.currentTimeMillis();

        for (int i = 0; i < totalApiRequests; i++) {
            final int taskId = i + 1;
            final String apiUrl = "https://jsonplaceholder.typicode.com/posts/" + taskId; // Example API

            executor.submit(() -> {
                String taskName = "Request " + taskId;
                try {
                    System.out.println(taskName + ": Attempting to acquire permit...");
                    semaphore.acquire(); // Acquire a permit. Blocks if no permits are available.
                    System.out.println(taskName + ": Permit acquired. Making API call. Available permits: " + semaphore.availablePermits());

                    String result = callApi(apiUrl, taskName);
                    System.out.println(taskName + ": API Call successful. " + result);
                } catch (InterruptedException e) {
                    System.err.println(taskName + ": Interrupted while acquiring permit or calling API: " + e.getMessage());
                    Thread.currentThread().interrupt();
                } catch (IOException e) {
                    System.err.println(taskName + ": API Call failed: " + e.getMessage());
                } finally {
                    semaphore.release(); // Release the permit, allowing another thread to acquire one
                    System.out.println(taskName + ": Permit released. Available permits: " + semaphore.availablePermits());
                }
            });
        }

        // We need a way to wait for all submitted tasks to complete.
        // For demonstration, we'll simply shut down the executor and wait for termination.
        executor.shutdown();
        try {
            if (!executor.awaitTermination(30, TimeUnit.SECONDS)) {
                System.err.println("Executor did not terminate in time. Forcing shutdown.");
                executor.shutdownNow();
            }
        } catch (InterruptedException e) {
            System.err.println("Main thread interrupted during executor shutdown: " + e.getMessage());
            executor.shutdownNow();
            Thread.currentThread().interrupt();
        }

        long endTime = System.currentTimeMillis();
        System.out.println("\nAll API requests (or at least all submitted tasks) processed in " + (endTime - startTime) + " ms.");
        System.out.println("Main thread finished.");
    }
}

Semaphore provides fine-grained control over resource access, an essential capability when dealing with APIs that impose usage limits. While it doesn't directly "wait for completion" in the same way CountDownLatch or CompletableFuture.allOf() does, it's a vital tool for managing the initiation of concurrent API calls. To combine waiting for completion with rate limiting, you might use a Semaphore in conjunction with CountDownLatch or a list of CompletableFutures to track each call.

Method 5: Reactive Programming with Project Reactor/RxJava

Reactive programming represents a more advanced and powerful paradigm for handling asynchronous data streams and events, making it exceptionally well-suited for API interactions, especially in microservices architectures or event-driven systems. It's built upon the Reactive Streams specification, which provides a standard for asynchronous stream processing with non-blocking backpressure.

Introduction to Reactive Streams Specification

The Reactive Streams specification defines four interfaces:

Publisher: A producer of data that can be subscribed to by Subscribers.
Subscriber: A consumer of data that receives events (onSubscribe, onNext, onError, onComplete).
Subscription: Represents the connection between a Publisher and a Subscriber, allowing for backpressure (the Subscriber can signal how much data it can handle).
Processor: Represents a stage that is both a Subscriber and a Publisher.

Libraries like Project Reactor (often used with Spring WebFlux) and RxJava implement this specification, providing rich APIs for building reactive pipelines.

Flux and Mono (Project Reactor) / Observable and Single (RxJava)

Project Reactor:
- Flux<T>: Represents a stream of 0 to N elements (asynchronous sequence). Ideal for API calls that might return multiple results or a continuous stream of events.
- Mono<T>: Represents a stream of 0 or 1 element (asynchronous scalar). Perfect for standard API calls that return a single response.
RxJava:
- Observable<T>: Similar to Flux, for 0 to N elements.
- Single<T>: Similar to Mono, for exactly 1 element.
- Maybe<T>: For 0 or 1 element.
- Completable: For an action that completes without emitting any items, only onComplete or onError.

These reactive types allow you to define a sequence of operations (transformations, filters, error handling, retries) that will be applied to the data emitted by the publisher, all in a non-blocking fashion.

Chaining Operators for Data Transformation, Error Handling, and Concurrency Management

Reactive libraries provide a vast array of operators that can be chained to build complex data processing pipelines:

Mapping/Transformation: map(), flatMap(), concatMap()
Filtering: filter()
Error Handling: onErrorResume(), onErrorReturn(), retry()
Concurrency: subscribeOn(), publishOn()
Aggregation: collectList(), zip()

Integrating with Non-Reactive APIs and Waiting for Completion

While reactive APIs are designed to be non-blocking, there are times you need to integrate with blocking APIs or explicitly wait for a reactive stream to complete.

subscribe(): The most common way to initiate the reactive pipeline. The subscribe() method attaches a Subscriber and begins the flow of data. It is inherently non-blocking.
block() (Mono/Flux): A blocking operator that subscribes to the Mono/Flux and waits for the completion of the sequence, returning the last element produced (for Mono) or throwing an exception. Use block() sparingly, typically only at the edge of your reactive application (e.g., in main methods for testing, or when bridging to legacy blocking code). Excessive use negates the benefits of reactive programming.
toFuture() (Mono/Flux): Converts a Mono/Flux to a CompletableFuture, allowing you to bridge reactive streams with CompletableFuture-based asynchronous logic.

Code Example with WebClient (Spring WebFlux)

Spring WebFlux, built on Project Reactor, provides WebClient for making non-blocking HTTP API calls. This is a common and powerful pattern in modern Java microservices.

import org.springframework.web.reactive.function.client.WebClient;
import reactor.core.publisher.Flux;
import reactor.core.publisher.Mono;
import reactor.core.scheduler.Schedulers;

import java.time.Duration;
import java.util.concurrent.CountDownLatch;
import java.util.concurrent.TimeUnit;
import java.util.concurrent.atomic.AtomicInteger;

public class ReactiveApiClient {

    private final WebClient webClient;
    private final AtomicInteger successCount = new AtomicInteger(0);
    private final AtomicInteger errorCount = new AtomicInteger(0);

    public ReactiveApiClient(String baseUrl) {
        this.webClient = WebClient.builder()
                .baseUrl(baseUrl)
                .defaultHeader("Accept", "application/json")
                .clientConnector(new reactor.netty.http.client.HttpClient
                        .create()
                        .responseTimeout(Duration.ofSeconds(10))
                        .compress(true) // Enable compression
                        .option(io.netty.channel.ChannelOption.CONNECT_TIMEOUT_MILLIS, 5000)
                        .resolver(reactor.netty.resolver.Default='false'.DefaultHostNameResolver.instance()) // Optimize DNS resolution
                        .runOn(Schedulers.newBoundedElastic(100, 1000, "webclient-threads")) // Use a dedicated scheduler
                        .wiretap(true) // For debugging network traffic
                )
                .build();
    }

    // Fetches a single todo item
    public Mono<String> fetchTodo(int id) {
        System.out.println("Initiating fetch for todo " + id + " on thread: " + Thread.currentThread().getName());
        return webClient.get()
                .uri("/techblog/en/todos/{id}", id)
                .retrieve()
                .bodyToMono(String.class)
                .doOnSuccess(s -> {
                    successCount.incrementAndGet();
                    System.out.println("Successfully fetched todo " + id + " on thread: " + Thread.currentThread().getName());
                })
                .doOnError(e -> {
                    errorCount.incrementAndGet();
                    System.err.println("Error fetching todo " + id + ": " + e.getMessage() + " on thread: " + Thread.currentThread().getName());
                })
                .timeout(Duration.ofSeconds(5)) // Overall operation timeout
                .retryBackoff(3, Duration.ofSeconds(1), Duration.ofSeconds(5)) // Retry with exponential backoff
                .onErrorReturn("{\"id\":" + id + ", \"title\":\"Fallback for " + id + "\", \"completed\":false}") // Fallback on persistent error
                .subscribeOn(Schedulers.boundedElastic()); // Run I/O intensive parts on boundedElastic pool
    }

    // Fetches multiple todo items concurrently
    public Flux<String> fetchMultipleTodos(int... ids) {
        return Flux.fromArray(ids)
                .flatMap(this::fetchTodo, 5); // Concurrently fetch 5 items at a time
    }

    public static void main(String[] args) throws InterruptedException {
        System.out.println("Main thread started. ID: " + Thread.currentThread().getId());

        ReactiveApiClient client = new ReactiveApiClient("https://jsonplaceholder.typicode.com");

        // Example 1: Fetch a single todo and block for its result (for main method demonstration)
        System.out.println("\n--- Fetching a single todo (blocking for demo) ---");
        try {
            String singleTodo = client.fetchTodo(1).block(Duration.ofSeconds(10)); // Using block() for simple main method demo
            System.out.println("Blocking Result (Todo 1): " + singleTodo.substring(0, Math.min(singleTodo.length(), 100)) + "...");
        } catch (Exception e) {
            System.err.println("Blocking call for single todo failed: " + e.getMessage());
        }

        // Example 2: Fetch multiple todos concurrently and collect results non-blockingly
        System.out.println("\n--- Fetching multiple todos concurrently (non-blocking) ---");
        int[] todoIds = {2, 3, 4, 5, 6, 7, 8, 9, 10, 11};
        CountDownLatch latch = new CountDownLatch(todoIds.length);
        long startTime = System.currentTimeMillis();

        client.fetchMultipleTodos(todoIds)
                .collectList() // Collect all results into a single list
                .subscribe(
                        results -> {
                            System.out.println("\n--- All multiple todo fetches completed! ---");
                            results.forEach(s -> System.out.println("  Result: " + s.substring(0, Math.min(s.length(), 100)) + "..."));
                            System.out.println("Total success: " + client.successCount.get());
                            System.out.println("Total errors: " + client.errorCount.get());
                            latch.countDown(); // Signal completion of the list collection
                        },
                        error -> {
                            System.err.println("Error fetching multiple todos: " + error.getMessage());
                            latch.countDown(); // Signal completion even on error
                        },
                        () -> {
                            // This will be called if the Flux completes without emitting any items, after collectList
                            // This part might not be hit if collectList emits a single List and then completes
                            // A separate latch for collectList is more accurate.
                            // For this specific example, the single latch on collectList's subscribe will work.
                        }
                );

        System.out.println("Main thread continues. Waiting for multiple todo fetches via latch...");
        latch.await(30, TimeUnit.SECONDS); // Wait for the reactive stream to complete

        long endTime = System.currentTimeMillis();
        System.out.println("Total time for multiple reactive API requests: " + (endTime - startTime) + " ms.");
        System.out.println("Main thread finished.");

        // Clean up scheduler resources managed by WebClient
        Schedulers.shutdownNow(); // Important to release resources from custom schedulers
    }
}

This reactive approach for API interaction provides:

Asynchronous by Design: All operations are inherently non-blocking.
Declarative Composition: Complex workflows (fetching, mapping, retrying, error handling) are expressed fluently using operators.
Backpressure: Consumers can signal how much data they can process, preventing producers from overwhelming them.
Concurrency Management: subscribeOn() and publishOn() allow fine-grained control over thread execution, offloading I/O operations to appropriate schedulers.
Resilience: Built-in operators for retries, fallbacks, and timeouts make API interactions highly resilient.

Benefits and Complexities of Reactive Approach

Benefits:

Scalability: Achieves high concurrency with a minimal number of threads, making it extremely scalable for I/O-bound workloads.
Responsiveness: Ensures non-blocking behavior throughout the application, leading to highly responsive systems.
Resilience: Powerful operators for building robust error handling, retry, and timeout strategies directly into the data flow.
Code Clarity for Complex Streams: For complex event-driven architectures or continuous data streams, reactive pipelines can be more concise and easier to reason about than nested callbacks or intricate CompletableFuture chains.

Complexities:

Steep Learning Curve: The reactive paradigm requires a different way of thinking about program flow and error handling.
Debugging Challenges: Stack traces in reactive applications can be deep and difficult to interpret due to the asynchronous and chained nature of operations.
Overhead for Simple Tasks: For very simple, isolated API calls, the overhead of setting up a reactive pipeline might be more than the benefit.
Integration with Blocking Code: Bridging reactive code with traditional blocking libraries or APIs (e.g., using block() or toFuture()) needs careful consideration to avoid reintroducing blocking bottlenecks.

Reactive programming, particularly with Project Reactor, is increasingly becoming the standard for modern Java microservices and high-performance backends due to its unparalleled scalability and robustness in handling API interactions.

Advanced Considerations for Robust API Interaction

Beyond choosing a core asynchronous strategy, building truly robust applications that interact with APIs requires attention to several critical aspects. These considerations ensure reliability, efficiency, and manageability in real-world scenarios.

Timeouts

Implementing robust timeout mechanisms is paramount to prevent threads from hanging indefinitely on unresponsive APIs. Timeouts should be applied at multiple layers:

HTTP Client Level: Most HTTP clients (like Java's HttpClient, Apache HttpClient, OkHttp, or Spring's WebClient) allow you to configure connection timeouts (time to establish a connection) and read/request timeouts (time to receive data after connection).
- Example (HttpClient): HttpRequest.newBuilder().timeout(Duration.ofSeconds(20))
- Example (WebClient with Reactor Netty): responseTimeout(Duration.ofSeconds(10))
Future.get() / CompletableFuture.get() Level: When blocking for a result, always use the timed versions to limit the waiting period.
- Example: future.get(10, TimeUnit.SECONDS)
CompletableFuture Specific Timeouts: CompletableFuture offers orTimeout(long timeout, TimeUnit unit) and completeOnTimeout(T value, long timeout, TimeUnit unit).
- orTimeout() completes the CompletableFuture exceptionally with TimeoutException if it's not completed within the given time.
- completeOnTimeout() completes the CompletableFuture with a given value if it's not completed within the given time.
Reactive Stream Level: Reactive operators like timeout(Duration) can terminate a stream with an error if no element is emitted within a specified duration.

Retry Mechanisms

Transient API failures (network glitches, temporary service overload) are common. Implementing retry mechanisms can significantly improve the resilience of your application.

Simple Retry: A fixed number of retries after a short delay.
Exponential Backoff: Increasing the delay between retries exponentially. This helps prevent overwhelming an overloaded service.
Jitter: Adding a random component to the backoff delay to prevent all clients from retrying at precisely the same time, leading to thundering herd problems.
Circuit Breakers: Libraries like Resilience4j or Hystrix (legacy) implement the Circuit Breaker pattern. If an API service fails repeatedly, the circuit breaker "opens," preventing further calls to that service for a period, and quickly failing those requests. This protects the failing service from further load and prevents your application from waiting indefinitely. After a set time, the circuit breaker moves to a "half-open" state, allowing a few test calls to determine if the service has recovered.
Reactive retry()/retryWhen(): Reactive libraries offer powerful retry() and retryWhen() operators for declarative retry policies, including backoff and jitter.

Error Handling Strategies

Robust API interaction requires comprehensive error handling:

Custom Exceptions: Define specific exceptions for different types of API errors (e.g., ApiTimeoutException, ServiceUnavailableException, InvalidApiResponseException).
Fallback Mechanisms: When an API call fails, consider if a default value, a cached response, or a less critical alternative API can be used as a fallback.
- CompletableFuture.exceptionally(Function<Throwable, ? extends T> fn)
- Reactive onErrorResume(Function<Throwable, Mono<T>> fallback) or onErrorReturn(T value)
Centralized Error Handling: Implement global exception handlers (e.g., in Spring @ControllerAdvice) to catch and process API-related exceptions, returning consistent error responses to clients.
Logging: Always log API call failures with sufficient detail (request, response status, error message) to aid debugging and monitoring.

Connection Pooling

For HTTP-based APIs, setting up an efficient connection pool for your HttpClient is crucial. Repeatedly establishing new TCP connections is expensive. A connection pool reuses existing connections, drastically reducing latency and resource consumption.

Java 11 HttpClient: It automatically manages a connection pool. You can configure it via system properties or client builders.
Apache HttpClient / OkHttp: These libraries provide robust connection pooling out of the box, with configurable parameters like max connections, max connections per route, and idle connection eviction. Always create a single HttpClient instance (or WebClient instance) and reuse it throughout your application.

Logging and Monitoring

Effective logging and monitoring are non-negotiable for production API integrations.

Detailed Logging: Log the start and end of API calls, their duration, success/failure status, and key parameters. Use correlation IDs to link related logs across requests.
Metrics: Collect metrics on API call success rates, latency, throughput, and error rates using tools like Micrometer, Prometheus, or Grafana. These metrics provide critical insights into the health and performance of your API integrations.
Distributed Tracing: For microservices architectures, distributed tracing (e.g., with OpenTelemetry, Jaeger, Zipkin) is essential to visualize the flow of requests across multiple services and identify bottlenecks in an API call chain.

Idempotency

When designing or interacting with APIs, especially those that involve state changes, consider idempotency. An idempotent operation can be safely retried multiple times without producing different results beyond the initial call. For example, deleting a resource is usually idempotent: deleting it once or five times has the same final effect. Non-idempotent operations (like creating a new order without a unique transaction ID) should be handled carefully with retry logic.

API Management & Gateways

For organizations managing a multitude of internal and external APIs, especially those integrating AI models, an API management platform becomes indispensable. Platforms like APIPark provide an open-source AI gateway and API management solution that simplifies the entire API lifecycle, from design and publication to monitoring and access control. It can significantly offload the burden of implementing complex security, rate limiting, and traffic management logic within individual services, allowing developers to focus on core business logic while APIPark handles the underlying infrastructure for robust API interactions. An API gateway can centralize concerns such as authentication, authorization, rate limiting, request/response transformation, and caching, providing a unified entry point to your backend services. This not only enhances security and performance but also simplifies client applications and enables consistent API governance across your entire ecosystem.

By diligently addressing these advanced considerations, Java developers can move beyond simply making API calls to building truly robust, resilient, and high-performing applications that gracefully handle the complexities of distributed systems and external service dependencies.

Choosing the Right Strategy

With various mechanisms available for handling Java API request completion, deciding on the most appropriate strategy depends on several factors: the complexity of your application, the nature of the API calls, performance requirements, team expertise, and existing technology stack. There's no one-size-fits-all answer.

Let's summarize the strengths and weaknesses of the main approaches in a comparative table.

Feature / Strategy	Basic Threading (`Thread.start()`)	ExecutorService (`Future.get()`)	CompletableFuture (`thenApply`, `allOf`)	Reactive Programming (Mono/Flux)
Concurrency	Manual & Error-prone	Managed Thread Pool	Highly Managed & Composable	Asynchronous & Non-blocking
Blocking Nature	`Thread.join()` is blocking	`Future.get()` is blocking	Non-blocking composition; `join()`/`get()` are blocking exit points	Inherently Non-blocking (except `block()`)
Error Handling	Manual (`UncaughtExceptionHandler`, shared state)	`ExecutionException` from `get()`	Declarative (`exceptionally`, `handle`)	Declarative (`onErrorResume`, `retry`)
Result Retrieval	Manual (shared state)	`Future<T>`	`CompletableFuture<T>`	`Subscriber` / `block()` / `toFuture()`
Chaining/Composition	Very difficult	Limited (manual chaining of `get()`)	Excellent (declarative, fluent API)	Excellent (operators, highly composable)
Resource Management	Poor (manual thread lifecycle)	Good (managed thread pools)	Good (managed thread pools)	Excellent (event-loop friendly, minimal threads)
Learning Curve	Low (conceptually simple)	Medium	Medium to High	High
Use Cases	Very simple, isolated tasks (rarely recommended)	Independent background tasks, batch processing	Dependent API call pipelines, parallel processing with aggregation	High-throughput microservices, event-driven systems, continuous streams
Typical Overhead	High for many threads	Moderate	Low to Moderate	Very Low (per-task)
Resilience Features	Manual	Manual (with external libraries)	Good (timeouts, basic retries)	Excellent (built-in retry, timeout, fallback)

Decision Matrix

For simple, isolated, non-critical background tasks without return values: While not ideal, a simple Runnable submitted to a single-thread ExecutorService might suffice. Avoid raw Thread.start().
For independent, parallel API calls where you need to wait for all to complete and collect results: ExecutorService with Future.get() or CountDownLatch is effective. However, CompletableFuture.allOf() provides a cleaner, non-blocking way to achieve this.
For chained API calls where the output of one call feeds into the input of the next: CompletableFuture with thenCompose() is the ideal choice due to its non-blocking composition.
For complex asynchronous workflows, event-driven architectures, or high-throughput microservices: Reactive programming with Project Reactor (especially with Spring WebFlux's WebClient) offers the most scalable and robust solution. It demands a higher learning investment but pays off in highly performant and resilient systems.
For limiting concurrent access to an API (e.g., rate limiting): Semaphore is the specialized tool. It can be used in conjunction with any of the above mechanisms to enforce concurrency constraints.

In modern Java development, CompletableFuture has become the de-facto standard for asynchronous operations when explicit reactive streams are not yet necessary. It strikes a good balance between power, expressiveness, and a manageable learning curve. Reactive programming is the logical next step for applications pushing the boundaries of scalability and responsiveness.

Best Practices for Waiting for API Completion

Regardless of the chosen strategy, adhering to a set of best practices will ensure your API interactions are reliable, efficient, and maintainable.

Prefer Asynchronous and Non-Blocking Approaches: Always favor non-blocking I/O and asynchronous patterns (Executor Framework, CompletableFuture, Reactive Streams) over synchronous blocking calls, especially in server-side or UI applications. This is the single most important practice for scalability and responsiveness.
Implement Sensible Timeouts: Never make an API call without a timeout. This protects your application from unresponsive external services, prevents resource exhaustion, and allows for graceful degradation or retry strategies. Apply timeouts at the HTTP client, Future.get(), and CompletableFuture/Reactive stream levels.
Handle Exceptions Gracefully: API calls are inherently prone to network issues, service unavailability, and unexpected responses. Implement robust error handling, including specific exception types, retry logic, and fallback mechanisms. Log errors comprehensively.
Manage Thread Pools Effectively: When using ExecutorService or CompletableFuture with custom executors, configure thread pools appropriately for your workload. For I/O-bound tasks like API calls, a FixedThreadPool or BoundedElasticScheduler (in Reactor) with a pool size related to the number of concurrent APIs you can handle (rather than CPU cores) is often suitable. Always shut down ExecutorService instances to prevent resource leaks.
Reuse HTTP Client Instances: Create a single HttpClient or WebClient instance (configured with a connection pool) and reuse it throughout your application. Creating new client instances for every API call is extremely inefficient.
Use Correlation IDs for Tracing: When interacting with multiple services or APIs, especially in a microservices environment, generate and propagate correlation IDs. This allows you to trace a single logical request across all services and logs, invaluable for debugging and monitoring.
Consider Circuit Breakers and Bulkheads: For critical API dependencies, integrate resilience patterns like Circuit Breakers (to fail fast when a service is unhealthy) and Bulkheads (to isolate components and prevent cascading failures) using libraries like Resilience4j.
Monitor API Performance and Health: Implement comprehensive monitoring for all external API calls. Track latency, success rates, error rates, and throughput. This provides early warning signs of issues and helps in capacity planning.
Document API Interaction Logic: Clearly document how your application interacts with external APIs, including expected behaviors, error codes, retry policies, and authentication mechanisms. This is especially critical for maintenance and onboarding new team members.
Test Thoroughly: Unit test your API client logic with mock servers. Conduct integration tests against actual (test) APIs. Perform load testing to identify bottlenecks and validate your concurrency strategies under stress.

By integrating these best practices into your development workflow, you can build Java applications that not only correctly wait for API request completion but do so in a way that is highly performant, scalable, and resilient to the inherent challenges of distributed systems.

Conclusion

Navigating the complexities of Java API request completion is a cornerstone of modern software development. As applications become increasingly distributed and reliant on external services, the ability to efficiently and robustly manage API interactions directly impacts their responsiveness, scalability, and overall user experience. We have journeyed through various approaches, starting from the basic yet often problematic synchronous calls, moving to the more controlled thread management of the Executor Framework, embracing the declarative power of CompletableFuture for complex asynchronous workflows, and finally exploring the highly scalable world of reactive programming with Project Reactor.

Each method offers a distinct set of trade-offs, providing developers with a rich toolkit. While synchronous calls offer simplicity for trivial tasks, they quickly become bottlenecks. The Executor Framework provides managed concurrency, but Future.get() still introduces blocking. CompletableFuture emerges as a versatile and modern solution, enabling non-blocking composition and robust error handling without the full paradigm shift of reactive programming. For the most demanding, high-throughput, and event-driven systems, reactive libraries like Project Reactor represent the pinnacle of asynchronous control, albeit with a steeper learning curve.

Beyond the core mechanisms, a truly masterly approach to API interactions incorporates advanced considerations such as intelligent timeouts, resilient retry mechanisms, comprehensive error handling, efficient connection pooling, and robust monitoring. Furthermore, for organizations dealing with a myriad of APIs, leveraging an API management platform like APIPark can centralize governance, security, and performance optimization, allowing developers to focus on core logic rather than infrastructure concerns.

Ultimately, the choice of strategy hinges on the specific context of your application. However, the overarching principle remains clear: embrace asynchronicity. By understanding and effectively applying the patterns and practices discussed in this guide, Java developers can transcend the challenges of waiting for API responses, building applications that are not only functional but also responsive, scalable, and resilient in the face of an ever-interconnected digital world.

Frequently Asked Questions (FAQ)

1. What is the fundamental difference between synchronous and asynchronous API calls in Java, and why does it matter?

Synchronous API calls block the executing thread, meaning the thread pauses and waits for the API response before moving to the next line of code. This is simple for single calls but leads to unresponsive UIs and poor scalability in concurrent environments. Asynchronous API calls, on the other hand, initiate the API request and immediately return control to the calling thread. The actual waiting and processing happen in the background, typically on a different thread or using non-blocking I/O. This matters because asynchronous calls enable higher responsiveness, better resource utilization, and superior scalability by preventing threads from idling while waiting for I/O operations to complete.

2. When should I use `CompletableFuture` versus the traditional `Future` with an `ExecutorService`?

You should prefer CompletableFuture over traditional Future in most modern Java applications, especially when dealing with complex asynchronous workflows or dependent API calls. While Future allows you to submit tasks to an ExecutorService and retrieve results, its get() method is blocking. CompletableFuture extends Future by providing powerful non-blocking composition methods (like thenApply, thenCompose, allOf, anyOf) that allow you to chain operations, combine results, and handle errors declaratively without blocking threads, leading to more readable, robust, and scalable code. Use Future if your needs are very basic and you only need to submit a task and block once for its result.

3. How can I handle multiple parallel API calls and wait for all of them to complete in Java?

There are several effective ways: * ExecutorService with Future.get(): Submit each API call as a Callable to an ExecutorService, get a Future for each, and then iterate through the Futures calling get() on each (preferably with a timeout). This approach is blocking when retrieving results. * CountDownLatch: Initialize a CountDownLatch with the number of API calls. Each worker thread making an API call calls countDown() when finished. The main thread calls latch.await() to block until all calls complete. * CompletableFuture.allOf(): This is often the most modern and non-blocking way. Launch all API calls as CompletableFutures, then use CompletableFuture.allOf(future1, future2, ...).join() (or whenComplete) to wait for all to finish. You then retrieve individual results from each CompletableFuture. * Reactive Programming (Project Reactor/RxJava): Use Flux.zip() or Mono.zip() to combine multiple Monos/Fluxs, which complete when all source streams emit their elements.

4. What are some essential best practices for ensuring resilience in API interactions?

Key best practices for resilience include: * Implement Comprehensive Timeouts: Set connection, request, and overall operation timeouts at the HTTP client, Future, and reactive stream levels to prevent indefinite hangs. * Automate Retry Mechanisms: Use intelligent retry logic (e.g., with exponential backoff and jitter) to handle transient API failures. * Apply Circuit Breaker Patterns: Integrate libraries like Resilience4j to quickly fail requests to unhealthy services, protecting both your application and the external API from cascading failures. * Implement Fallbacks: Define alternative actions or default values when an API call fails to ensure graceful degradation of functionality. * Use Connection Pooling: Reuse HttpClient instances with connection pooling to optimize network resource usage and reduce latency. * Monitor and Log: Implement robust logging and monitoring (metrics, distributed tracing) to quickly identify and diagnose API-related issues in production.

5. When should I consider using an API Gateway or an API Management platform like APIPark?

You should consider an API Gateway or an API Management platform when your application or organization starts to manage a significant number of internal and/or external APIs, especially in microservices architectures. These platforms centralize critical cross-cutting concerns that are tedious and error-prone to implement in every service, such as: * Security: Authentication, authorization, rate limiting, and traffic control. * Traffic Management: Load balancing, routing, and versioning. * Monitoring and Analytics: Centralized logging, performance metrics, and usage analytics. * Transformation: Request/response payload transformations. * Developer Portal: To publish APIs, manage access, and provide documentation for consumers. Platforms like APIPark offer comprehensive solutions for the entire API lifecycle, including specialized features for integrating and managing AI models, providing significant benefits in terms of efficiency, security, and scalability for managing your API ecosystem.

🚀You can securely and efficiently call the OpenAI API on APIPark in just two steps:

Step 1: Deploy the APIPark AI gateway in 5 minutes.

APIPark is developed based on Golang, offering strong product performance and low development and maintenance costs. You can deploy APIPark with a single command line.

curl -sSO https://download.apipark.com/install/quick-start.sh; bash quick-start.sh

In my experience, you can see the successful deployment interface within 5 to 10 minutes. Then, you can log in to APIPark using your account.

Step 2: Call the OpenAI API.