The Wapiti Swagger Parser is a Python library designed to parse Swagger (OpenAPI) specifications and extract the necessary information to generate valid HTTP requests. It focuses on providing a clean, programmatic interface for analyzing Swagger files and creating request templates without relying on external tools for OpenAPI processing.
This library is particularly useful for scenarios where only the request generation requirements are of interest, such as:
- Automated testing and validation of APIs.
- Dynamic request generation for web vulnerability scanners (like Wapiti).
- Custom API tooling.
The library is lightweight, relying only on Python's built-in json
library and PyYAML
for file parsing, and it avoids heavy dependencies on larger OpenAPI frameworks.
pip install wapiti-swagger
- Request Extraction:
- Parses all HTTP requests (methods, paths, parameters) defined in the Swagger file.
- Schema Handling:
- Resolves
$ref
references in schemas, including handling circular references gracefully.
- Resolves
- Custom Types:
- Identifies and retains custom types (e.g., enumerated values, objects) for enhanced request understanding.
- Request Body Generation:
- Automatically generates example request bodies based on schema definitions.
- Metadata Extraction:
- Captures root-level metadata like
host
,basePath
,servers
, andschemes
.
- Captures root-level metadata like
- Supports Swagger 2.0 and OpenAPI 3.x:
- Works with both specification versions seamlessly.
from wapiti_swagger.parser import parse, generate_request_body_from_schema
# Load and parse a Swagger file
parsed = parse("swagger.json")
# List all available requests
for request in parsed.requests:
print(request)
# Generate an example request body for a specific request (here one expecting JSON input)
request_body = generate_request_body_from_schema(
schema=request.parameters[0].schema, # Use the schema of the first parameter
resolved_components=parsed.components
)
print("Example request body:", request_body)
Unlike general-purpose OpenAPI parsers, this library is optimized for specific use cases like generating valid requests for API testing, scanning, or mocking. It is lightweight, customizable, and avoids unnecessary processing of response definitions or additional metadata unrelated to request generation.
Parses a Swagger/OpenAPI specification file and returns a ParsedSwagger
object containing the following:
- Requests: List of
SwaggerRequest
objects extracted from paths. - Components: Preprocessed and resolved components (e.g., schemas, parameters).
- Metadata: High-level metadata like
host
,basePath
, andservers
.
Parameters:
file_path
(str): Path to the Swagger/OpenAPI file (JSON or YAML).
Returns:
ParsedSwagger
: Object containing parsed requests, components, and metadata.
Extracts all HTTP requests from the paths
section of the Swagger specification.
Parameters:
data
(dict): The full Swagger/OpenAPI specification as a dictionary.
Returns:
List[SwaggerRequest]
: A list of requests with details like method, path, parameters, and request bodies.
Extracts parameters from the requestBody
section of a Swagger path operation.
Handles multiple media types (e.g., application/json
, text/json
).
Parameters:
request_body
(dict): TherequestBody
definition for a path operation.
Returns:
List[Parameter]
: A list of parameters with details like media type, schema, and custom type.
Parses a single parameter from the parameters
section of a Swagger path operation.
Parameters:
param
(dict): The parameter definition.
Returns:
Parameter
: Object representing the parameter with details like name, location, type, and schema.
Resolves and preprocesses all components (e.g., schemas, parameters) from the components
section of the Swagger specification.
Parameters:
components
(dict): Thecomponents
section of the Swagger specification.
Returns:
Dict[str, Dict]
: Resolved and preprocessed components organized by type (e.g., schemas, parameters).
Recursively resolves $ref
references in schemas while avoiding circular references.
Parameters:
schema
(dict): The schema to resolve.resolved_components
(dict): Preprocessed components for reference resolution.visited_refs
(set): Tracks references to avoid circular references.
Returns:
dict
: Fully resolved schema.
Extracts high-level metadata from the root of the Swagger specification, such as host
, basePath
, and servers
.
Parameters:
data
(dict): The full Swagger/OpenAPI specification.
Returns:
Dict[str, Any]
: Metadata likehost
,basePath
,schemes
, andservers
.
8. generate_request_body_from_schema(schema: dict, resolved_components: dict) -> Optional[Union[dict, list, str, int, bool]]
Generates an example request body based on a schema definition.
Parameters:
schema
(dict): The schema definition.resolved_components
(dict): Resolved components for reference resolution.
Returns:
Optional[Union[dict, list, str, int, bool]]
: An example request body.