An Update on OpenAPI in Openstack

I previously presented on our work to bring OpenAPI to OpenStack as part of the 2024 OpenInfra Summit Asia, the slides for which you can find here. Since that talk, another release cycle has come and gone and our work has continued to progress. Below is a summary of the current “state of play” for OpenAPI support in OpenStack and a reminder of our long-term goals in the area.

Overview

The work to add OpenAPI schemas can be broken into two parts: adding schemas to services and writing tooling to consume these schemas and generate both documentation and clients.

Services

The tl;dr: of this section is that we are relying on the fact that OpenAPI 3.1 is a superset of JSON Schema Draft 2020-12 and are adding JSON Schema schemas to as many services as possible. When put this way, it sounds pretty simple but as is often the case, the devil is in the detail. OpenStack is made up of multiple independent services maintained by different groups of people, and while the Oslo project has helped ensure some level of consistency around things like configuration (oslo.conf), database connectivity (oslo.db) and messaging (oslo.messaging), there is huge variance in the API frameworks used:

Nova, Cinder, Glance, and Manila use a combination of Routes (for routing), WebOb (for request/response models), Paste (for application dispatch) and PasteDeploy (for middleware).
Neutron uses the same Routes + WebOb + Paste + PasteDeploy combo used by Nova, Cinder, Glance, and Manila, but with the addition of Pecan in place of the “homegrown” WSGI frameworks used in those projects.
Keystone uses Flask and Flask-Restful.
Swift uses PasteDeploy (but I admittedly know very little about Swift).
Ironic uses Pecan (which in turn uses WebOb, so you’ll see references to this too)
etc etc…

Fortunately, despite the wide differences in frameworks, they pretty much all have the same building blocks:

Controllers (or Applications) which are responsible for a given resource and handles calls for same. These are typically classes with methods for each HTTP verb that the API supports.
Middleware that inspects and/or modifies requests and responses.
Routers that map a HTTP request to a controller. This is typically implemented as a special type of middleware.

In this model, both parsing of requests and generation of responses happens in the controller methods, which makes this a natural place to add validation for same. And there are a few things we want to validate:

API version (for services that uses API microversions)
Request path and query string parameters
Request and response bodies

By wrapping the controller methods with decorators, we we are able to inspect both the request or response objects, comparing them against schemas for same.

Tooling

Having schemas in place for all services is of little help if we don’t do anything with them. To this end, we want tooling that can inspect services and extract their JSON Schema schemas, combining them to produce OpenAPI schemas.

Once we have these OpenAPI schemas, we can start generating (self-validating) documentation and clients/libraries. For the former, OpenAPI will replace the os-api-ref Sphinx extension currently used across OpenStack. os-api-ref allows us to describe our APIs in reStructuredText and it is a tool that has worked relatively well for us, but the lack of machine readability means it’s hard to validate against the code. For the latter, we hope to lessen the burden of maintaining libraries and clients in multiple languages, as this has proven very challenging to do given the very large number of OpenStack APIs in multiple. Once again, OpenAPI is better suited to this challenge than os-api-ref or anything else we have.

Current status and future plans

Nova

Nova started from the best position, since it was already using JSON Schema for request validation. However, there were a couple of issues to overcome:

The version of JSON Schema used for the schemas, Draft 04, is over 12 years old, meaning the schemas needed migrating to Draft 2020-12.
There were two different mechanisms for API versioning: if-else checks inside the controller methods, plus a decorator that relied on the descriptor protocol to allow us to define version-specific controller methods. The latter made inspection of the API router more difficult than necessary and had to be replaced.
While many APIs had request query string and request body schemas, not all of them did.
Most importantly, there were no schemas for response bodies.

Most of these have now been resolved. We have bumped our schemas to Draft 2020-12, we have added all the missing request schemas, and we have added response body schemas for a number of resources. The outstanding changes to address the API versioning issues and add the remaining response body schemas are all ready and just waiting for review, so with any luck we will be able to close this out in the 2025.2 (Flamingo) release. Once this work has merged, the final step will be to start generating api-ref documentation from the OpenAPI schemas instead of using the os-api-ref tool we currently use. The patches for Nova can be found here.

Cinder

Cinder was in a very similar position to Nova thanks to their shared lineage. This means it had the same advantage - pre-existing use of JSON Schema for request validation - and the same issues. However, it also had the added issue of having a mountain of tech debt mainly related to the support for multiple API versions that Cinder offered until relatively recently. This has necessitated a lot of additional cleanup patches to do things like remove the cinder.api.v2 module and consolidate everything under cinder.api.v3.

Unfortunately, while the bulk of the changes have been written, none of them have merged. I’m hoping we can double down on this in 2025.2 (Flamingo) release and start making some progress. The patches for Cinder can be found here.

Keystone

Once again, Keystone had the good fortune of having existing use of JSON Schema for request query string and request body validation, but this was done inconsistently and didn’t cover response body schemas. Most of the work here has focused on adding these missing schemas and making the router easier to inspect by moving validation from inside the controller methods to decorators and splitting methods that previously handling multiple endpoints (e.g. GET /foo and GET /foo/123) into multiple single-purpose methods.

Many resources have been fully specced by now but some remain. In addition, there are some decisions to be made regarding support for undocumented API options that have been (re)discovered during this work. None of it should be insurmountable, however. The patches for Keystone can be found here.

Manila

Yet another service with JSON Schema already in place. Once again, the work here consists of adding missing schemas and making the router easier to inspect. A small number of patches have merged here and more are in-flight. The patches for Manila can be found here.

Ironic

Ironic hasn’t historically used JSON Schema for validation, instead using its own homegrown validation framework and taking advantage of some Pecan functionality to maintain API versioning. As a result, there’s quite a bit of work needed to get schemas into Ironic:

Move API versioning to decorators
Rework request path, query string and body parameter validation to use JSON Schema schemas (via decorators) instead of the homegrown framework
Add request body parameter validation

This work is in very early stages, but there are people working on it and there appears to be broad buy-in from the Ironic team, so I hope to see significant progress over the course of the 2025.2 (Flamingo) cycle. The patches for Ironic can be found here.

Other services (Neutron, Swift, Glance, Octavia, …)

To the best of my knowledge, no work has been started in other projects. Of the “core” projects, I expect the Glance effort to be relatively small since they already have formal schemas (exposed via the API!) for most of their resources. Conversely, I expect the Neutron effort to be both large and complicated, given the number and highly dynamic nature of Neutron’s API, driven by it’s extension-based “versioning” system. I have somehow never worked on Swift and haven’t a clue about that.

Tooling

There are two tooling-related efforts ongoing:

Development of the codegenerator project.
Improvements to the openstacksdk project, python-openstackclient project, and related projects.

The codegenerator project provides a collection of utilities for generating OpenAPI schemas and a (Rust-based 🦀) client from these schemas. Development on this tool first started in earnest during the 2024.1 (Caracal) cycle, but the tool has continued to evolve over the course of the 2024.2 (Dalmatian) and 2025.1 (Epoxy) cycles. I expect to see more progress as schemas continue to get added to the Nova project. If you’re interested in learning more about this work, I’d encourage to review the following material from my upstream colleague, Artem Goncharov (gtema), who is driving much of it.

The openstacksdk and python-openstackclient projects, meanwhile, are far more established. As a quick reminder, the openstacksdk project provides the primary OpenStack client library, while the python-openstackclient project is the primary OpenStack CLI. In recent releases, we’ve been undertaking work to prepare them for a future when they can both be at least partially auto-generated. This has taken the form of the addition of typing and a more general “removal of weirdness” goal.

The addition of typing is probably easier to grok and easier to measure. To achieve this, we have been adding type hints to openstacksdk and python-openstackclient, as well as dependencies of same. As of the 2025.1 (Epoxy) release, we have full type coverage for keystoneauth (a dependency of openstacksdk) and for both cliff and osc-lib (dependencies of python-openstackclient). We also have initial coverage of both openstacksdk and python-openstackclient themselves, with more comprehensive coverage of some of the non-core aspects of the former. Completely typing openstacksdk in particular though is more challenging due to the aforementioned weirdness, but we will continue to make progress on this over the 2025.2 (Flamingo) cycle.

The “removal of weirdness” goal is, as we noted, a little more general and cover a few related issues we’re facing. The two biggest of these, however, is (a) the over-use of variadic arguments and keyword arguments (*args and **kwargs) in the proxy and cloud layers in particular, and (b) the munging of request and response body parameters in the Resource class used throughout openstacksdk. The former is hopefully self-explanatory so let’s focus on the latter. By way of a demonstration of the issue, consider the compute service’s /servers API. This API expects you to pass networks and block_device_mapping_v2 fields in the request for POST /servers, to configure networking and block devices respectively. However, it returns this information - in a different form, no less - via the addresses and os-extended-volumes:volumes_attached in response to GET /servers/{serverID} and GET /servers/detail. Because we currently munge the request and response body fields, our Server class therefore looks like this (in highly minimized form):

class Server(resource.Resource):
    addresses = resource.Body('addresses', type=dict)
    attached_volumes = resource.Body(
        'os-extended-volumes:volumes_attached',
        aka='volumes',
        type=list,
        list_type=volume_attachment.VolumeAttachment,
        default=[],
    )
    block_device_mapping = resource.Body('block_device_mapping_v2')
    networks = resource.Body('networks')

But if you attempt to set either addresses or attached_volumes before a Server.create call, you will get an error. On the other hand, if you do a Server.fetch or Server.list call, the block_device_mapping and networks attributes will always be set to None since they don’t form part of the response body. This has necessitated a whole load of special casing in python-openstackclient, and special casing is bad for things you wish to auto-generate. By separating these, we make it easier to type and easier to auto-generate. While a number of PoCs have been produced, none have been implemented yet. We expect work on this to continue during the 2025.2 (Flamingo) cycle.

Questions?

Come discuss these topics in the OpenStack Virtual PTG, running from 07-11 April 2025. More information of the event and sessions can be found at ptg.opendev.org.