By apipark — 07 Nov 2025

Mastering Kubernetes Controllers: Watching CRD Changes

controller to watch for changes to crd

Kubernetes has irrevocably transformed the landscape of cloud-native application deployment and management. At its core, Kubernetes operates on a declarative model: you tell it what you want, and it works tirelessly to make it happen. This paradigm is powered by an intricate dance of "controllers" that continuously observe the cluster's actual state, compare it against your desired state, and take corrective actions to bridge any discrepancies. While Kubernetes comes packed with a rich set of built-in resources like Pods, Deployments, and Services, the true power of its extensibility shines through Custom Resource Definitions (CRDs). CRDs allow users to define their own resource types, effectively extending the Kubernetes API to manage domain-specific objects directly within the cluster.

However, defining a CRD is only half the battle. To bring these custom resources to life, you need a custom controller – an operator that knows how to interpret your CRD, react to its changes, and orchestrate the underlying infrastructure or application logic accordingly. This journey into mastering Kubernetes controllers, particularly those tasked with watching CRD changes, is fundamental for anyone looking to build robust, Kubernetes-native applications and operators. It delves into the very heart of how Kubernetes maintains its desired state, offering a profound understanding of its internal mechanisms and the power it vests in its users to tailor its capabilities to virtually any workload.

This comprehensive guide will unpack the intricacies of Kubernetes controllers, the foundational role of the Kubernetes API server, the mechanics of CRDs, and, most importantly, the sophisticated art of efficiently watching for changes in these custom resources. We will explore the tools, patterns, and best practices involved in crafting a resilient and effective controller, ensuring that your custom logic responds promptly and accurately to the dynamic ebb and flow of your Kubernetes environment.

The Foundation: Kubernetes Controllers and the Desired State

At the heart of Kubernetes' self-healing and automation capabilities lies the concept of a "controller." A Kubernetes controller is essentially a control loop that constantly monitors the state of your cluster and makes changes to move the current state towards the desired state. This is a continuous, never-ending process. Think of it as a vigilant guardian, always observing, always adjusting.

Each built-in Kubernetes resource—be it a Pod, Deployment, ReplicaSet, or Service—is managed by a specific controller. For instance, the Deployment controller watches Deployment objects. When it detects a new Deployment, it creates a ReplicaSet to manage a specified number of Pods. If a Pod dies, the ReplicaSet controller detects the discrepancy and creates a new Pod to maintain the desired count. This elegant feedback loop is what makes Kubernetes so powerful and resilient.

The Anatomy of a Control Loop

A typical control loop, at a high level, follows these steps:

Observe: The controller watches for changes in a specific set of resources within the Kubernetes cluster. This observation is typically done by interacting with the Kubernetes API server.
Analyze: Upon detecting a change (or periodically), the controller fetches the current state of the relevant resources.
Compare: It then compares the current state with the desired state, which is usually defined in the resource's specification.
Act: If a discrepancy is found, the controller takes action to reconcile the difference, moving the current state closer to the desired state. This might involve creating, updating, or deleting other Kubernetes resources, or interacting with external systems.
Repeat: The loop then continues, ensuring that the system continuously converges towards its intended configuration.

This iterative process is fundamental to how Kubernetes operates. Without controllers, Kubernetes would merely be a static configuration store; it's the controllers that imbue it with its dynamic, self-managing intelligence.

The Central Hub: The Kubernetes API Server

Every interaction with a Kubernetes cluster, whether initiated by a user, an administrator, or another controller, flows through the Kubernetes API server. This server is the front-end for the Kubernetes control plane, exposing the Kubernetes API itself. It's the only component that directly communicates with the etcd data store, where all cluster data is persistently stored.

The API server serves several critical functions:

RESTful Interface: It provides a consistent, RESTful API through which all cluster operations are performed. This API is both human-readable (via kubectl) and machine-consumable, making it ideal for automation and programmatic interaction.
Authentication and Authorization: Before any request is processed, the API server authenticates the client and authorizes the requested action based on Kubernetes' Role-Based Access Control (RBAC) policies. This ensures that only authorized entities can perform specific operations.
Admission Control: After authentication and authorization, admission controllers intercept requests to the API server. These webhooks can mutate (change) or validate (reject) requests before they are persisted in etcd. This provides an additional layer of policy enforcement and configuration management.
Data Validation: The API server validates the incoming data structure against the schema defined for each resource type, ensuring data integrity.
Event Publishing: Crucially for controllers, the API server publishes events whenever a resource changes. Controllers subscribe to these event streams to detect alterations in the cluster state.

Resources: The Building Blocks of Kubernetes

Everything in Kubernetes is represented as a resource. A resource is an endpoint in the Kubernetes API that stores a collection of API objects of a certain kind. For example, pods is a resource that stores Pod objects. These resources can be broadly categorized into:

Built-in Resources: These are the standard resources that come with Kubernetes out-of-the-box, such as Pods, Deployments, Services, ConfigMaps, Secrets, etc. They are defined within the core Kubernetes project and are fundamental to its operation.
Custom Resources: These are resources that users define themselves using Custom Resource Definitions (CRDs). They allow extending Kubernetes' capabilities to manage application-specific or domain-specific objects as if they were native Kubernetes resources.

The API server treats custom resources with the same respect as built-in ones. Once a CRD is registered, the API server exposes endpoints for managing instances of that custom resource, allowing kubectl and other clients to interact with them seamlessly. This unification under a single API is a cornerstone of Kubernetes' power and flexibility.

Deep Dive into Custom Resource Definitions (CRDs)

Custom Resource Definitions (CRDs) are the mechanism by which you can extend the Kubernetes API with your own custom resource types. Before CRDs, extending Kubernetes often involved API aggregation, which was more complex. CRDs simplify this process dramatically, allowing developers to define new "kinds" of objects that behave like native Kubernetes objects.

When you create a CRD, you're essentially telling the Kubernetes API server about a new resource kind that it should manage. This doesn't, however, teach Kubernetes how to do anything with that resource; it merely provides a schema and endpoint for storing and retrieving objects of that kind. The "doing" part falls to your custom controller.

Defining a CRD

A CRD is itself a Kubernetes resource, specified in YAML. Here's a conceptual breakdown of its key components:

apiVersion: apiextensions.k8s.io/v1
kind: CustomResourceDefinition
metadata:
  name: myresources.stable.example.com
spec:
  group: stable.example.com # The API group for your resource
  versions:
    - name: v1 # The API version
      served: true
      storage: true
      schema:
        openAPIV3Schema: # OpenAPI v3 schema for validation
          type: object
          properties:
            spec:
              type: object
              properties:
                image:
                  type: string
                  description: The image to use for the resource.
                replicas:
                  type: integer
                  minimum: 1
                  description: The number of desired replicas.
            status: # Optional: for controllers to report current state
              type: object
              properties:
                availableReplicas:
                  type: integer
                  description: The number of available replicas.
  scope: Namespaced # Or Cluster, if it's a cluster-scoped resource
  names:
    plural: myresources
    singular: myresource
    kind: MyResource
    shortNames:
      - mr

Let's dissect this:

group: This defines the API group for your custom resource. It typically follows a reverse domain name convention (e.g., stable.example.com). This helps avoid naming collisions and provides logical grouping.
versions: CRDs support multiple API versions (e.g., v1alpha1, v1beta1, v1). Each version can have its own schema. You must designate one version as storage: true (the version stored in etcd) and one or more as served: true (available via the API server).
scope: This specifies whether the custom resource is Namespaced (like Pods) or Cluster (like Nodes).
names: These define the various names for your resource:
- plural: The plural name used in API URLs (e.g., kubectl get myresources).
- singular: The singular name used for single instances.
- kind: The CamelCase name of your resource object (e.g., kind: MyResource).
- shortNames: Optional, convenient aliases for kubectl.
schema (openAPIV3Schema): This is crucial. It defines the structure and validation rules for your custom resource's spec (desired state) and status (current observed state). The Kubernetes API server uses this schema to validate objects created with your CRD, ensuring data integrity. Strong schema validation is vital for robust controllers.
status Subresource: By adding subresources: { status: {} } to a version definition, you enable a separate /status endpoint for your custom resource. This allows controllers to update the status field independently without requiring full write access to the entire resource, improving security and concurrency. This separation is a best practice, as user-defined spec and controller-managed status should ideally be distinct.
scale Subresource: Similarly, subresources: { scale: {} } enables the /scale endpoint, allowing Horizontal Pod Autoscalers (HPA) to interact with your custom resource to adjust its desired replica count.

Once a CRD is applied to a cluster, the Kubernetes API server dynamically updates to include the new resource type. kubectl get will then recognize myresources, and you can create instances of MyResource using standard Kubernetes YAML syntax.

Why CRDs are Game-Changers

CRDs are fundamental to the operator pattern. An "operator" is essentially a custom controller that manages a CRD, encapsulating operational knowledge for specific applications (e.g., a Cassandra Operator, a Prometheus Operator). They extend Kubernetes' automation capabilities beyond basic container orchestration to managing complex stateful applications, databases, or even external infrastructure components. By defining application-specific concepts as Kubernetes resources, operators can leverage Kubernetes' powerful control plane primitives (like scheduling, rolling updates, self-healing) to manage these higher-level abstractions. This allows developers to interact with complex applications using a familiar Kubernetes-native API, simplifying deployment and operational tasks dramatically.

Understanding Kubernetes Controllers in Depth: Informers, Listers, and Workqueues

Now that we understand the role of controllers and CRDs, let's peel back the layers and examine the core components and patterns used to build robust controllers, particularly how they efficiently watch for changes in the Kubernetes API. The client-go library in Go is the de-facto standard for building Kubernetes controllers, and it provides these foundational abstractions.

The Problem: Efficiently Watching for Changes

A naive approach to watching changes would be to constantly poll the API server (e.g., GET /myresources every few seconds). This is highly inefficient, places unnecessary load on the API server and etcd, and introduces latency in reaction times. A better approach is to leverage the API server's "watch" mechanism.

The Kubernetes API server supports long-lived HTTP GET requests that return a stream of events (ADDED, MODIFIED, DELETED) for a given resource type. This "watch" API is incredibly efficient for real-time notifications. However, directly managing these watch connections, handling disconnections, ensuring proper resource versions, and building a consistent local cache is complex. This is where client-go's informers come into play.

Informers: The Cornerstone of Efficient Watching

Informers are the most critical component for building reliable and performant Kubernetes controllers. An informer provides a mechanism to:

Watch Resources: Establish and maintain a watch connection to the Kubernetes API server for a specific resource type. It handles connection retries, ensures proper resourceVersion handling to avoid missing events, and keeps the local cache up-to-date.
Cache Resources Locally: Maintain a local, in-memory cache of the watched resources. This cache, often implemented using an Indexer, allows controllers to query resources without hitting the API server for every request, significantly reducing load and improving performance.
Provide Event Notifications: Call user-defined event handler functions (OnAdd, OnUpdate, OnDelete) whenever a change is detected.

The informer pattern is critical for several reasons:

Reduced API Server Load: By maintaining a local cache and only watching for deltas, informers drastically reduce the number of direct API calls made by controllers.
Improved Performance: Controllers can query their local cache almost instantly, rather than waiting for an API server round trip.
Consistency: Informers ensure that the local cache is eventually consistent with the API server. They achieve this by combining a list operation (to initially populate the cache) with a watch operation (to keep it updated). If the watch stream is interrupted, the informer re-lists the resources to resynchronize, preventing data loss or inconsistencies.
Shared Informers: For clusters with many controllers, it's common for multiple controllers to need to watch the same resource type (e.g., multiple controllers needing to know about all Pods). SharedInformers allow multiple controllers to share a single watch connection and local cache, further optimizing API server interactions and memory usage.

Listers: Querying the Local Cache

While informers are responsible for populating and maintaining the local cache, "listers" provide a convenient and type-safe way to query that cache. A lister for a specific resource type (e.g., PodLister or MyResourceLister) allows you to retrieve objects by name, namespace, or by listing all objects matching certain criteria, all without making any network calls to the API server. This makes reconciliation loops fast and efficient.

Workqueues: Decoupling Event Handling from Reconciliation

When an informer detects a change, it calls the registered event handlers. Directly processing complex reconciliation logic within these handlers is generally a bad practice. Reconciliation can be time-consuming, involve external API calls, and might fail, requiring retries. If reconciliation blocks the event handler, it can prevent other events from being processed, leading to a backlog and stale caches.

This is where "workqueues" come in. A workqueue is an in-memory queue that decouples the event handling from the actual reconciliation logic.

The typical flow is:

Event Received: An informer's event handler (OnAdd, OnUpdate, OnDelete) is triggered.
Enqueue Key: Instead of performing reconciliation immediately, the handler simply extracts a unique identifier for the changed object (e.g., namespace/name) and adds this "key" to the workqueue.
Worker Goroutines: One or more worker goroutines continuously pull keys from the workqueue.
Process Item: Each worker processes a key by calling the controller's main reconciliation function.
Retry Logic: If reconciliation fails, the key can be re-added to the workqueue with an exponential backoff, ensuring that transient errors don't cause permanent failures and avoiding busy-retries that could overload external systems. This guarantees eventual consistency.

Workqueues provide several benefits:

Concurrency: Multiple worker goroutines can process items from the queue concurrently, improving throughput.
Rate Limiting and Backoff: Workqueues often come with built-in rate-limiting capabilities, allowing you to control how frequently failed items are retried.
Error Handling: By putting items back on the queue after a failure, controllers can gracefully handle transient errors and ensure that every item is eventually processed successfully (or marked as permanently failed after too many retries).
Decoupling: They clearly separate the responsibilities of event detection (informers) and state reconciliation (workers).

The Client-Go Ecosystem

The client-go library provides these components as part of its cache and tools/cache packages. When building a custom controller, you'll typically interact with:

kubernetes.Clientset: A client for built-in Kubernetes resources.
MyResourceClient (typed clientset): A client specifically generated for your custom resource type.
InformerFactory: Used to create informers for various resource types, often shared across controllers.
SharedInformer: The actual informer that handles listing, watching, and caching.
Lister: For reading from the informer's cache.
Workqueue: For processing reconciliation requests.

By leveraging these abstractions, client-go makes it significantly easier to build robust, efficient, and idiomatic Kubernetes controllers that interact with the API server in a well-behaved manner. The complexity of managing watch connections, caching, and error handling is largely abstracted away, allowing controller developers to focus on their specific reconciliation logic.

Building a Custom Controller for CRDs: A Practical Guide

Developing a custom controller involves several distinct steps, from defining your custom resource to implementing the core reconciliation logic. This section outlines the process, emphasizing the interplay between CRDs, client-go components, and the reconciliation loop. While full code will not be provided due to length constraints, the conceptual steps and their rationale will be detailed.

Prerequisites

Before embarking on controller development, ensure you have:

Go Language Environment: Controllers are predominantly written in Go.
Kubernetes Cluster: A running Kubernetes cluster (e.g., minikube, kind, or a cloud-managed cluster) to deploy and test your CRD and controller.
kubectl: For interacting with the cluster.
controller-gen: A tool from the kubernetes-sigs/controller-tools project, essential for generating Go types from your CRD and various boilerplate code.
kustomize: Useful for managing Kubernetes manifests.

Step-by-Step Controller Development

Let's imagine we want to build a controller for a MyResource CRD that manages a simple external message queue subscription.

1. Define the Custom Resource Definition (CRD)

First, create the myresource.yaml file that defines your custom resource. This includes the apiVersion, kind, metadata, spec and status fields, and crucially, the OpenAPI v3 schema for validation.

# crds/myresource.yaml
apiVersion: apiextensions.k8s.io/v1
kind: CustomResourceDefinition
metadata:
  name: subscriptions.example.com
spec:
  group: example.com
  versions:
    - name: v1
      served: true
      storage: true
      schema:
        openAPIV3Schema:
          type: object
          properties:
            spec:
              type: object
              properties:
                topic:
                  type: string
                  description: The message queue topic to subscribe to.
                endpoint:
                  type: string
                  description: The endpoint to send messages to.
              required: ["topic", "endpoint"]
            status:
              type: object
              properties:
                phase:
                  type: string
                  description: Current state of the subscription (e.g., Pending, Active, Failed).
                message:
                  type: string
                  description: Detailed status message.
  scope: Namespaced
  names:
    plural: subscriptions
    singular: subscription
    kind: Subscription
    shortNames:
      - sub

Apply this CRD to your cluster: kubectl apply -f crds/myresource.yaml

2. Generate Go Types from CRD

Use controller-gen to generate the Go struct definitions for your Subscription resource, including SubscriptionList, SchemeGroupVersion, and DeepCopy methods. This tool reads Go markers (comments like //+kubebuilder:resource:path=subscriptions,scope=Namespaced) and the CRD YAML to produce client-go-compatible types.

You'll define a basic Go struct like:

// pkg/apis/example/v1/subscription_types.go
package v1

import (
    metav1 "k8s.io/apimachinery/pkg/apis/meta/v1"
)

// +genclient
// +k8s:deepcopy-gen:interfaces=k8s.io/apimachinery/pkg/runtime.Object

// Subscription is the Schema for the subscriptions API
type Subscription struct {
    metav1.TypeMeta   `json:",inline"`
    metav1.ObjectMeta `json:"metadata,omitempty"`

    Spec   SubscriptionSpec   `json:"spec,omitempty"`
    Status SubscriptionStatus `json:"status,omitempty"`
}

// +k8s:deepcopy-gen:interfaces=k8s.io/apimachinery/pkg/runtime.Object

// SubscriptionList contains a list of Subscription
type SubscriptionList struct {
    metav1.TypeMeta `json:",inline"`
    metav1.ListMeta `json:"metadata,omitempty"`
    Items           []Subscription `json:"items"`
}

// SubscriptionSpec defines the desired state of Subscription
type SubscriptionSpec struct {
    Topic    string `json:"topic"`
    Endpoint string `json:"endpoint"`
}

// SubscriptionStatus defines the observed state of Subscription
type SubscriptionStatus struct {
    Phase   string `json:"phase,omitempty"`
    Message string `json:"message,omitempty"`
}

Then run controller-gen (typically with make generate in a scaffolded project) to create client-go specific code in pkg/client/.... This includes: * Clientset: A client library to interact with your custom resource. * Informers: Components to watch and cache your custom resources. * Listers: To query the local cache.

3. Implement the Controller Logic

The core controller implementation will reside in pkg/controller/subscription/subscription_controller.go (or similar).

a. Main Controller Structure:

Define a Controller struct that holds references to the clients, informers, and workqueue.

// pkg/controller/subscription/subscription_controller.go
package subscription

import (
    "context"
    "fmt"
    "time"

    "k8s.io/apimachinery/pkg/api/errors"
    "k8s.io/apimachinery/pkg/util/runtime"
    "k8s.io/apimachinery/pkg/util/wait"
    "k8s.io/client-go/kubernetes"
    "k8s.io/client-go/tools/cache"
    "k8s.io/client-go/util/workqueue"
    "k8s.io/klog/v2"

    clientset "your_module/pkg/client/clientset/versioned"
    informers "your_module/pkg/client/informers/externalversions/example/v1"
    listers "your_module/pkg/client/listers/example/v1"
    apiv1 "your_module/pkg/apis/example/v1"
)

// Controller is the controller for Subscription resources.
type Controller struct {
    kubeclientset    kubernetes.Interface
    subscriptionclientset clientset.Interface

    subscriptionsLister listers.SubscriptionLister
    subscriptionsSynced cache.InformerSynced

    workqueue workqueue.RateLimitingInterface
    // recorder  events.EventRecorder // Optional: for recording Kubernetes events
}

// NewController creates a new Subscription Controller.
func NewController(
    kubeclientset kubernetes.Interface,
    subscriptionclientset clientset.Interface,
    subscriptionInformer informers.SubscriptionInformer) *Controller {

    c := &Controller{
        kubeclientset:    kubeclientset,
        subscriptionclientset: subscriptionclientset,
        subscriptionsLister: subscriptionInformer.Lister(),
        subscriptionsSynced: subscriptionInformer.Informer().HasSynced,
        workqueue: workqueue.NewNamedRateLimitingQueue(workqueue.DefaultControllerRateLimiter(), "Subscriptions"),
        // recorder: recorder,
    }

    klog.Info("Setting up event handlers for Subscription")
    subscriptionInformer.Informer().AddEventHandler(cache.ResourceEventHandlerFuncs{
        AddFunc: c.enqueueSubscription,
        UpdateFunc: func(old, new interface{}) {
            oldSub := old.(*apiv1.Subscription)
            newSub := new.(*apiv1.Subscription)
            if oldSub.ResourceVersion == newSub.ResourceVersion {
                // No actual changes, just a resync
                return
            }
            c.enqueueSubscription(new)
        },
        DeleteFunc: c.enqueueSubscriptionForDelete,
    })

    return c
}

b. Event Handlers and Enqueuing:

The AddFunc, UpdateFunc, and DeleteFunc registered with the informer are lightweight. Their primary role is to extract the key (namespace/name) of the affected resource and add it to the workqueue.

func (c *Controller) enqueueSubscription(obj interface{}) {
    var key string
    var err error
    if key, err = cache.MetaNamespaceKeyFunc(obj); err != nil {
        runtime.HandleError(err)
        return
    }
    c.workqueue.Add(key)
}

func (c *Controller) enqueueSubscriptionForDelete(obj interface{}) {
    var key string
    var err error
    // Use DeletionHandlingMetaNamespaceKeyFunc for deleted objects to get the correct key even if the object is already gone from the cache.
    if key, err = cache.DeletionHandlingMetaNamespaceKeyFunc(obj); err != nil {
        runtime.HandleError(err)
        return
    }
    c.workqueue.Add(key)
}

c. The Run Loop and Worker Goroutines:

The Run method starts the controller. It waits for caches to be synced, then launches worker goroutines that continuously pull items from the workqueue.

func (c *Controller) Run(workers int, stopCh <-chan struct{}) error {
    defer runtime.HandleCrash()
    defer c.workqueue.ShutDown()

    klog.Info("Starting Subscription controller")

    klog.Info("Waiting for informer caches to sync")
    if ok := cache.WaitForCacheSync(stopCh, c.subscriptionsSynced); !ok {
        return fmt.Errorf("failed to wait for caches to sync")
    }

    klog.Info("Starting workers")
    for i := 0; i < workers; i++ {
        go wait.Until(c.runWorker, time.Second, stopCh)
    }

    klog.Info("Started workers")
    <-stopCh
    klog.Info("Shutting down workers")

    return nil
}

func (c *Controller) runWorker() {
    for c.processNextWorkItem() {
    }
}

d. Processing Work Items:

processNextWorkItem retrieves a key from the workqueue and calls the main reconcile function. It also handles error management and putting items back into the queue if reconciliation fails.

func (c *Controller) processNextWorkItem() bool {
    obj, shutdown := c.workqueue.Get()

    if shutdown {
        return false
    }

    // We call Done here so the workqueue knows we have finished
    // processing this item. We also use a defer so that even if
    // we panic, we call Done.
    defer c.workqueue.Done(obj)

    var key string
    var ok bool
    if key, ok = obj.(string); !ok {
        // As the item in the workqueue is actually of type string, we cast it to a string.
        runtime.HandleError(fmt.Errorf("expected string in workqueue but got %#v", obj))
        c.workqueue.Forget(obj) // Remove the item if it's not a string
        return true
    }

    // Run the syncHandler, passing it the namespace/name string of the
    // Foo resource to be synced.
    if err := c.syncHandler(key); err != nil {
        // Put the item back on the workqueue to handle any transient errors.
        c.handleErr(err, key)
        return true
    }

    // If no error occurs, we Forget this item so it's not retried.
    c.workqueue.Forget(obj)
    klog.Infof("Successfully synced '%s'", key)
    return true
}

func (c *Controller) handleErr(err error, key interface{}) {
    if c.workqueue.NumRequeues(key) < 5 { // Retry up to 5 times
        klog.Infof("Error syncing '%s': %v, retrying...", key, err)
        c.workqueue.AddRateLimited(key) // Add back with exponential backoff
        return
    }

    runtime.HandleError(fmt.Errorf("dropping Subscription '%s' out of workqueue after too many retries: %v", key, err))
    c.workqueue.Forget(key) // Give up after too many retries
}

e. The reconcile Function (syncHandler): The Core Logic

This is where the actual desired state reconciliation happens.

func (c *Controller) syncHandler(key string) error {
    namespace, name, err := cache.SplitMetaNamespaceKey(key)
    if err != nil {
        runtime.HandleError(fmt.Errorf("invalid resource key: %s", key))
        return nil
    }

    // Get the Subscription resource from the informer's cache.
    subscription, err := c.subscriptionsLister.Subscriptions(namespace).Get(name)
    if err != nil {
        // The Subscription resource may no longer exist, in which case we stop processing.
        if errors.IsNotFound(err) {
            klog.Infof("Subscription '%s' in work queue no longer exists, perhaps it was deleted", key)
            // Handle cleanup if necessary (e.g., delete external resources)
            return nil
        }
        return err // Requeue this item
    }

    // --- Core Reconciliation Logic ---
    // Here's where you compare desired state (subscription.Spec) with actual state (external system).

    // Example: Check if the external message queue subscription exists and matches spec.
    // You would interact with an external message queue API here.
    currentPhase := subscription.Status.Phase
    desiredPhase := "Active" // Assume active state upon successful external provisioning

    // Simulate external API call
    klog.Infof("Reconciling Subscription: %s/%s", subscription.Namespace, subscription.Name)
    if subscription.Status.Phase == "" || subscription.Status.Phase == "Pending" {
        klog.Infof("Creating external subscription for topic '%s' at endpoint '%s'", subscription.Spec.Topic, subscription.Spec.Endpoint)
        // In a real scenario, make an API call to your message queue provider (e.g., AWS SQS, Kafka, RabbitMQ)
        // and create the subscription. Handle potential errors.
        // For now, simulate success.
        time.Sleep(2 * time.Second) // Simulate network latency

        // Update the status of the Subscription resource in Kubernetes
        subscriptionCopy := subscription.DeepCopy()
        subscriptionCopy.Status.Phase = "Active"
        subscriptionCopy.Status.Message = "External subscription created successfully."

        // Use the client to update the status subresource
        _, err = c.subscriptionclientset.ExampleV1().Subscriptions(subscription.Namespace).UpdateStatus(context.TODO(), subscriptionCopy, metav1.UpdateOptions{})
        if err != nil {
            klog.Errorf("Failed to update status for Subscription '%s': %v", key, err)
            return err // Requeue
        }
        klog.Infof("Updated status of Subscription '%s' to Active", key)
        // c.recorder.Event(subscription, corev1.EventTypeNormal, "Created", "External subscription created.")

    } else if subscription.Status.Phase == "Active" {
        // Check for changes in spec and update external resource if needed
        // For example, if subscription.Spec.Endpoint changed, update the external subscription.
        klog.Infof("Subscription '%s' is already active. Verifying consistency...", key)
        // In a real scenario, query external system to ensure it matches desired state.
        // If external state differs, update external system.
        // If external state matches, nothing to do.

    }

    // Handle deletion: If the Subscription resource was deleted, our enqueueSubscriptionForDelete will add its key
    // to the workqueue. If Get() returns IsNotFound, it means the resource is gone from cache.
    // Here we'd perform cleanup of external resources.
    // This specific example doesn't demonstrate explicit external cleanup for deletion.
    // A common pattern is to use finalizers for graceful cleanup.

    return nil
}

This syncHandler is the brain of your controller. It should be idempotent, meaning it can be called multiple times with the same input and produce the same desired output state without side effects. It reads the desired state from subscription.Spec, interacts with external systems (or other Kubernetes resources) to achieve that state, and then updates the subscription.Status to reflect the observed state.

4. The `main` Function: Wiring Everything Up

The main function is responsible for setting up the Kubernetes client, creating the informer factory, initializing the controller, and starting its run loop.

// cmd/controller/main.go
package main

import (
    "flag"
    "time"

    "k8s.io/client-go/kubernetes"
    "k8s.io/client-go/tools/clientcmd"
    "k8s.io/klog/v2"

    clientset "your_module/pkg/client/clientset/versioned"
    informers "your_module/pkg/client/informers/externalversions"
    "your_module/pkg/controller/subscription"
)

var (
    masterURL  string
    kubeconfig string
)

func main() {
    klog.InitFlags(nil)
    flag.Parse()

    // Set up signals so we can handle the first shutdown signal gracefully.
    stopCh := make(chan struct{})
    // signal.SetupSignalHandler() returns a channel that is closed when an interrupt is received.
    // For simplicity, we'll assume a direct signal.Notify for this example.
    // Usually: sigCh := signal.SetupSignalHandler(); <-sigCh

    cfg, err := clientcmd.BuildConfigFromFlags(masterURL, kubeconfig)
    if err != nil {
        klog.Fatalf("Error building kubeconfig: %s", err.Error())
    }

    kubeClient, err := kubernetes.NewForConfig(cfg)
    if err != nil {
        klog.Fatalf("Error building kubernetes clientset: %s", err.Error())
    }

    subscriptionClient, err := clientset.NewForConfig(cfg)
    if err != nil {
        klog.Fatalf("Error building example clientset: %s", err.Error())
    }

    // Create SharedInformerFactory for all custom resources under example.com group
    // Resync period: The informer will periodically re-list all resources even if no events were observed.
    // This helps in reconciling potential inconsistencies.
    subscriptionInformerFactory := informers.NewSharedInformerFactory(subscriptionClient, time.Second*30)

    // Get the informer for our specific Subscription resource
    subscriptionInformer := subscriptionInformerFactory.Example().V1().Subscriptions()

    // Create our controller
    controller := subscription.NewController(kubeClient, subscriptionClient, subscriptionInformer)

    // Start the informer factory (starts all registered informers)
    subscriptionInformerFactory.Start(stopCh)

    // Run the controller
    if err = controller.Run(2, stopCh); err != nil { // 2 worker goroutines
        klog.Fatalf("Error running controller: %s", err.Error())
    }
}

func init() {
    flag.StringVar(&kubeconfig, "kubeconfig", "", "Path to a kubeconfig. Only required if running outside of a cluster.")
    flag.StringVar(&masterURL, "master", "", "The address of the Kubernetes API server. Overrides any value in kubeconfig. Only required if running outside of a cluster.")
}

This main function demonstrates how to initialize the kubeclientset (for built-in resources), your subscriptionclientset (for your CRD), and the SharedInformerFactory. It then creates your controller, passing the necessary informers, and finally starts the informers and the controller's main run loop.

Deployment

To deploy your controller:

Build Docker Image: Containerize your Go controller application into a Docker image.
Kubernetes Manifests: Create Deployment, ServiceAccount, ClusterRole, and ClusterRoleBinding (or Role/RoleBinding for namespaced controllers) YAMLs to deploy your controller in the cluster and grant it the necessary RBAC permissions to watch and manage your CRD instances, as well as any other resources it needs to interact with (e.g., Pods, Services).

A minimal ClusterRole for our example might look like:

# deploy/rbac.yaml
apiVersion: rbac.authorization.k8s.io/v1
kind: ClusterRole
metadata:
  name: subscription-controller-role
rules:
- apiGroups:
  - "" # Core API group
  resources:
  - events
  verbs:
  - create
  - patch
- apiGroups:
  - example.com # Our custom API group
  resources:
  - subscriptions
  - subscriptions/status
  verbs:
  - get
  - list
  - watch
  - update
  - patch

This role allows the controller to watch, get, list, update, and patch subscriptions and its status subresource. It also allows creating events, which is good practice for controller debugging.

APIPark is a high-performance AI gateway that allows you to securely access the most comprehensive LLM APIs globally on the APIPark platform, including OpenAI, Anthropic, Mistral, Llama2, Google Gemini, and more.Try APIPark now! 👇👇👇

Install APIPark – it’s free

Advanced Topics and Best Practices in Controller Development

Building a basic controller is a great start, but creating a production-ready, resilient, and secure controller requires attention to several advanced topics and best practices.

Idempotency: The Golden Rule

As mentioned, controllers must be idempotent. This means that applying the same desired state multiple times should always result in the same actual state, without causing unintended side effects. For example, if your controller creates a Pod, calling the creation logic twice for the same desired Pod should not result in two Pods. Kubernetes API calls are generally idempotent themselves (e.g., Create fails if the resource exists, Update only applies if resourceVersion matches). However, when interacting with external systems, you must design your external API calls to be idempotent or implement checks within your controller to ensure actions are only taken when truly necessary.

Event Handling and Workqueue Design

Debouncing Events: Controllers often receive multiple Update events for a single resource within a short timeframe (e.g., status updates). If your reconciliation logic is heavy, you might want to debounce these events. The workqueue's rate-limiting capabilities help with this by not immediately re-adding an item if it's already in the queue or if it was recently processed.
Handling DELETED Objects Gracefully (Finalizers): When a resource is deleted, the informer's OnDelete handler is triggered. However, by the time your reconciliation function runs, Lister.Get() will return "not found." If your controller needs to perform cleanup actions on external systems (e.g., delete an external database, unregister an external API endpoint), simply acting on OnDelete might not be enough if the controller crashes or the cleanup fails. Finalizers are the idiomatic Kubernetes way to handle this.
- When a resource with a finalizer is marked for deletion, Kubernetes doesn't immediately remove it from etcd. Instead, it adds a deletion timestamp and waits until all finalizers are removed.
- Your controller's reconciliation loop then detects the deletion timestamp, performs cleanup, and then removes its finalizer. Once all finalizers are removed, Kubernetes can safely delete the object. This pattern ensures that cleanup actions are reliably performed even in the face of controller restarts or failures.

Leader Election for High Availability

If you run multiple replicas of your controller for high availability, you must ensure that only one instance is actively reconciling at any given time to avoid race conditions and conflicting actions. This is achieved through leader election. client-go provides utilities for implementing leader election using ConfigMaps or Endpoints in Kubernetes. The active leader obtains a lease, and if it fails, another replica will acquire the lease and become the new leader. This guarantees that your control loop remains singular and consistent.

Status Subresource: Reflecting Reality

Always update the status subresource of your CRD to reflect the actual, observed state of the world as managed by your controller. The spec is the desired state, and the status is the current reality. Separating these concerns is crucial. Updating the status typically involves fetching a copy of the resource, modifying its status field, and then calling clientset.UpdateStatus(). This operation is often restricted by RBAC, allowing controllers to update status while other users might only be able to modify the spec.

Webhooks (Validating and Mutating Admission Webhooks)

While controllers react to changes in resources after they've been stored, Admission Webhooks intercept requests to the API server before they are stored.

Validating Admission Webhooks: These webhooks can reject requests if the resource does not meet certain criteria that cannot be expressed purely by the OpenAPI schema in the CRD. For example, ensuring that a referenced Secret actually exists or that a complex business rule is followed.
Mutating Admission Webhooks: These webhooks can change (mutate) incoming requests. For example, injecting default values into a resource's spec or adding labels and annotations automatically.

Webhooks extend the API server's admission control. They are synchronous and can prevent an object from ever being created or updated, offering a powerful way to enforce policies and augment resources at the point of creation/update, which is different from a controller that reacts after the fact.

Testing Custom Controllers

Rigorous testing is essential for controllers.

Unit Tests: Test individual functions and components in isolation (e.g., syncHandler logic given mock inputs).
Integration Tests: Test the controller's interaction with a real (or mock) Kubernetes API server, using client-go's fake clientsets. This verifies the control loop's overall behavior.
End-to-End (E2E) Tests: Deploy your controller and CRD to a real cluster (e.g., a test minikube instance), create custom resources, and assert that the controller correctly orchestrates the desired changes and updates the status.

Security Considerations: RBAC for Controllers

Your controller needs appropriate RBAC permissions to perform its duties. Grant it the least privilege necessary. For example:

list, watch, get permissions on its own CRD (and potentially other resources it needs to observe).
create, update, patch, delete permissions on the resources it creates or modifies (e.g., Pods, Deployments, Services) or its own CRD's status subresource.
Avoid granting * permissions unless absolutely necessary for cluster-level operators.

The API Management Perspective

As organizations increasingly leverage Kubernetes and custom resources to build sophisticated, distributed applications, the complexity of managing the underlying APIs escalates dramatically. Each microservice, each custom controller, and each external integration potentially introduces new API endpoints and interaction patterns. This proliferation of APIs necessitates a robust and centralized API management strategy to ensure consistency, security, and traceability across the entire service ecosystem.

Custom controllers, by their very nature, are often orchestrating interactions with various internal and external services, many of which expose their own APIs. For example, our Subscription controller might interact with a message queue's API to create subscriptions. If your custom controller is responsible for provisioning external services that themselves expose APIs (e.g., a controller that deploys a new instance of an AI model and exposes its inference API), then the importance of API lifecycle management becomes paramount.

For developers building advanced Kubernetes-native applications, particularly those integrating with AI services or complex microservice architectures, managing the resulting API landscape can be daunting. Products like APIPark offer comprehensive solutions for API management, providing an AI gateway and developer portal that helps streamline integration, deployment, and lifecycle management for both AI and REST services. This becomes particularly relevant when your custom controllers are orchestrating external systems that expose their own APIs, ensuring consistency, security, and traceability across your entire service ecosystem.

Consider a scenario where a custom Kubernetes controller manages the deployment and scaling of multiple AI model instances. Each instance might expose a slightly different inference API. Without a unified management layer, tracking, securing, and providing access to these dynamic AI APIs becomes a significant operational challenge. An API management platform can step in to:

Unify API Formats: Standardize how external consumers interact with various AI models, even if the underlying model APIs differ. This shields client applications from direct model changes.
Centralize Authentication and Authorization: Provide a single point for securing access to all APIs managed by your custom controllers, ensuring consistent security policies.
Manage API Lifecycle: From design and publication to versioning and decommissioning, such platforms help govern the entire lifecycle of APIs exposed by the services orchestrated by your controllers.
Monitor and Analyze API Usage: Offer detailed logging and analytics on API calls, crucial for understanding performance, identifying issues, and optimizing resource allocation—especially important in dynamic Kubernetes environments where controllers are constantly adjusting resources.

In essence, while Kubernetes controllers excel at managing the internal state of your cluster and its resources (including custom ones), an API management platform focuses on the external consumption and governance of the services and APIs these controllers (and the applications they manage) expose. The synergy between robust Kubernetes extensibility and powerful API management tools creates a resilient and manageable cloud-native infrastructure, particularly crucial as enterprises move towards hybrid and multi-cloud strategies involving diverse AI and microservice landscapes.

Challenges and Troubleshooting

Developing and operating custom controllers comes with its own set of challenges. Understanding common pitfalls and debugging strategies is crucial.

Controller "Flapping" or "Churn"

This occurs when a controller continuously makes changes that cause the desired state to diverge or cycle, rather than converging. This might be due to:

Non-Idempotent Logic: Actions that produce different results when run multiple times.
Race Conditions: Multiple controllers or external actors conflicting over the same resource.
Incorrect Status Updates: The controller's status updates cause it to re-reconcile unnecessarily.
External System Instability: Flaky external APIs causing repeated failures and retries.

Troubleshooting: Examine controller logs for repetitive reconciliation cycles. Use kubectl describe on your custom resource to see events and status changes. Ensure your reconciliation logic is truly idempotent.

ResourceVersion Conflicts

When updating a resource, you must specify its resourceVersion. If the resource has been updated by another entity since you last fetched it, your update will fail with a conflict error. This is optimistic concurrency control.

Troubleshooting: In your reconciliation loop, when fetching an object to update its status, always Get the latest version of the object before modifying and attempting an UpdateStatus. If a conflict occurs, re-fetch, re-apply your changes, and retry. The workqueue's retry mechanism is helpful here.

Slow Reconciliation Loops

A slow reconcile function can lead to lagging reactions to cluster changes.

Troubleshooting: * Profiling: Use Go's pprof to identify performance bottlenecks in your syncHandler. * External Calls: Minimize blocking external API calls. Consider using goroutines or asynchronous patterns if multiple external interactions are needed. * Excessive Logging: While good for debugging, verbose logging can slow down tight loops. Adjust log levels. * Cache Utilization: Ensure you are reading from the informer's local cache (Lister) as much as possible, rather than making direct API server calls for known resources.

Informer Synchronization Issues

If an informer fails to sync its cache, the controller might operate on stale data or never start processing events.

Troubleshooting: The controller.Run() function typically includes cache.WaitForCacheSync(). If this fails, investigate network issues between the controller and the API server, or RBAC permissions preventing the controller from listing/watching the required resources.

Debugging Techniques

Comprehensive Logging: Use klog/v2 for structured and configurable logging. Log entry and exit points of your syncHandler, significant decisions, and interactions with external systems.
Kubernetes Events: Use the EventRecorder provided by client-go to publish Kubernetes events on your custom resources. These events are visible with kubectl describe <my-resource>, offering a timeline of controller actions.
Metrics: Expose Prometheus metrics from your controller (e.g., workqueue_depth, reconciliation_duration_seconds). This provides operational visibility into its health and performance.
Local Debugging: Run your controller locally (outside the cluster) by pointing it to your Kubernetes cluster's API server using your kubeconfig. This allows for easier debugging with IDEs.

Conclusion

Mastering Kubernetes controllers, especially those that watch Custom Resource Definitions, is an indispensable skill for anyone deeply involved in cloud-native development and operations. It unlocks the full potential of Kubernetes as an extensible, programmable platform, enabling you to tailor its automation capabilities to your unique domain-specific needs. By understanding the foundational concepts of the Kubernetes API server, the mechanics of CRDs, and the elegant design patterns employed by client-go (Informers, Listers, Workqueues), you gain the power to build sophisticated operators that bring complex applications to life within the Kubernetes ecosystem.

The journey from defining a declarative API with a CRD to implementing a robust controller that continuously reconciles the desired state is one of deep learning and practical application. It requires careful consideration of idempotency, error handling, performance optimization, and security. Furthermore, as your Kubernetes landscape grows, the need for effective API management becomes undeniable, providing a crucial layer of governance and observability for the multitude of APIs orchestrated by your custom controllers and services.

Embrace the controller pattern, leverage the extensibility of CRDs, and build intelligent, self-managing systems that truly harness the power of Kubernetes, creating a more automated, resilient, and scalable future for your applications.

Glossary of Key Components

Here's a table summarizing the key client-go components discussed in the context of building Kubernetes controllers:

Component	Purpose	Key Benefits
Informer	Establishes a watch connection to the Kubernetes API server, continuously fetching updates for a specific resource type and maintaining a local, in-memory cache. Handles watch stream disconnections and resynchronizations.	Reduces API server load, provides real-time event notifications, ensures eventual consistency of local cache, handles network complexities.
Lister	Provides a read-only, type-safe interface to query the local cache maintained by an Informer.	Enables fast, local retrieval of resources without making network calls to the API server, improving controller performance.
Workqueue	An in-memory queue used to decouple event handling from the actual reconciliation logic. Event handlers enqueue item keys, and worker goroutines pull keys for processing.	Improves controller concurrency and throughput, provides built-in rate-limiting and exponential backoff for retries, gracefully handles transient errors, ensures eventual processing.
ClientSet	A generated Go client library that provides type-safe methods for interacting with Kubernetes API resources (both built-in and custom).	Simplifies interaction with the Kubernetes API, provides strong typing, handles serialization/deserialization, manages authentication and API calls.
Controller	The main control loop logic. Observes resource changes (via Informer), processes them (via Workqueue), and reconciles the cluster's actual state towards the desired state.	Implements the core automation logic for managing resources and maintaining desired states.
CRD	Custom Resource Definition. An API object that allows users to define their own resource types, extending the Kubernetes API.	Enables Kubernetes extensibility, allows managing domain-specific objects natively, forms the basis for the Operator pattern.

5 Frequently Asked Questions (FAQs)

1. What is the fundamental difference between a Kubernetes Controller and a Custom Resource Definition (CRD)? A CRD defines a new type of resource that Kubernetes can store and validate, extending the Kubernetes API. It describes the schema and metadata of your custom object (e.g., MyResource). A Kubernetes Controller, on the other hand, is the logic that brings a resource type to life. It observes instances of a specific resource (whether built-in or custom), compares their actual state to their desired state (defined in the resource's spec), and takes actions to reconcile any differences. So, a CRD is like a blueprint for a new object, while a controller is the active agent that understands and manipulates objects based on that blueprint.

2. Why are Informers and Workqueues essential for building robust Kubernetes controllers? Can't I just poll the API server? Informers and Workqueues are crucial for efficiency and reliability. Polling the API server (repeatedly querying for the current state) is highly inefficient, places significant load on the API server and etcd, and introduces latency in reaction times. Informers establish a single, long-lived "watch" connection to the API server, receiving real-time events for changes. They also maintain a local cache, allowing controllers to query resources without hitting the API server, drastically reducing load. Workqueues decouple the event handling from the reconciliation logic, allowing for concurrent processing, rate-limiting, and graceful error handling with retries, ensuring that transient issues don't lead to permanent inconsistencies and improving the controller's overall resilience.

3. What is the role of the resourceVersion in Kubernetes API interactions, particularly for controllers? The resourceVersion is a string that represents a specific point in time or a specific version of a resource in the Kubernetes cluster. It's an opaque value that the API server uses to ensure optimistic concurrency. When a controller (or any client) performs an update operation, it typically includes the resourceVersion of the object it last retrieved. If the resource on the API server has a different (newer) resourceVersion, it means another client modified the resource since the controller fetched it, and the update request will be rejected with a conflict error. This prevents unintended overwrites and helps controllers ensure they are always working with the most up-to-date information, crucial for maintaining desired state consistency.

4. How does a custom controller manage external resources, and what are Finalizers? A custom controller manages external resources by interacting with their respective APIs based on the desired state defined in its custom resource's spec. For example, a controller for a Database CRD might make API calls to a cloud provider to provision a database instance. Finalizers are a critical mechanism for ensuring graceful cleanup of these external resources when the custom resource (e.g., the Database object) is deleted from Kubernetes. When a resource with finalizers is marked for deletion, Kubernetes doesn't immediately remove it from its database. Instead, it adds a deletion timestamp. The controller detects this timestamp, performs the necessary cleanup of the external resource (e.g., de-provisioning the database), and then removes its finalizer from the custom resource. Only after all finalizers are removed will Kubernetes finally delete the custom resource object. This prevents orphaned external resources and ensures controlled teardown.

5. How does an API Management Platform like APIPark fit into a Kubernetes environment using custom controllers? While Kubernetes controllers excel at managing the internal state of your cluster and the resources within it (including custom ones), an API Management Platform like APIPark addresses the challenges of managing the external consumption and governance of the services and APIs exposed by your applications. If your custom controllers orchestrate services that expose their own APIs (e.g., an AI model inference API, a microservice API), APIPark can provide a unified gateway for these diverse APIs. It centralizes authentication, authorization, traffic management, versioning, and provides a developer portal. This ensures consistency, security, and traceability for all the APIs your Kubernetes-native applications, driven by custom controllers, expose to external consumers, making the entire ecosystem more manageable and robust.

🚀You can securely and efficiently call the OpenAI API on APIPark in just two steps:

Step 1: Deploy the APIPark AI gateway in 5 minutes.

APIPark is developed based on Golang, offering strong product performance and low development and maintenance costs. You can deploy APIPark with a single command line.

curl -sSO https://download.apipark.com/install/quick-start.sh; bash quick-start.sh

In my experience, you can see the successful deployment interface within 5 to 10 minutes. Then, you can log in to APIPark using your account.

Step 2: Call the OpenAI API.