Architecture & Invariants

This page describes the gateway’s internal architecture — the request lifecycle, security boundaries, and the invariants that make multi-tenant isolation structural rather than behavioral.

Read time: ~10 minutes.

Recent hardening:

Policy evaluation now avoids premature deny outcomes when later matching policies allow the operation.
Handler execution paths consistently run command optimization before DB execution.

Request Lifecycle

Every HTTP request flows through a fixed, ordered pipeline:

 Client Request
       │
 ┌─────▼─────────────────────────────────────────────────────┐
 │  1. CORS + Security Headers                               │
 │     X-Content-Type-Options: nosniff                       │
 │     X-Frame-Options: DENY                                 │
 │     Body size limit: 2 MiB                                │
 ├───────────────────────────────────────────────────────────┤
 │  2. Rate Limiter                                          │
 │     Token bucket per IP (configurable burst + rate)       │
 │     → 429 Too Many Requests on exhaustion                 │
 ├───────────────────────────────────────────────────────────┤
 │  3. Authentication                                        │
 │     JWT validation (HS256) → extract tenant_id,           │
 │     user_id, role from claims                             │
 │     Fallback: header-based (dev only, no JWT_SECRET)      │
 ├───────────────────────────────────────────────────────────┤
 │  4. Policy Engine                                         │
 │     YAML-defined per-table, per-role access control       │
 │     Column filtering at AST level                         │
 │     → 403 Forbidden on policy violation                   │
 ├───────────────────────────────────────────────────────────┤
 │  5. Tenant Concurrency Gate                               │
 │     Per-tenant semaphore (configurable permits)           │
 │     Prevents one tenant from consuming all connections    │
 │     → 429 on tenant saturation                            │
 ├───────────────────────────────────────────────────────────┤
 │  6. Connection Acquisition                                │
 │     pool.acquire_with_rls(RlsContext) or                  │
 │     pool.acquire_with_rls_timeout(RlsContext, timeout)    │
 │     Sets PostgreSQL GUCs:                                 │
 │       set_config('app.current_tenant_id', '...', false)   │
 │       set_config('app.current_operator_id', '...', false) │
 │       set_config('app.is_super_admin', '...', false)      │
 ├───────────────────────────────────────────────────────────┤
 │  7. EXPLAIN Pre-Check (reads only)                        │
 │     EXPLAIN on generated SQL before execution             │
 │     Rejects if cost > explain_max_cost                    │
 │              or rows > explain_max_rows                   │
 │     → QUERY_TOO_EXPENSIVE with structured JSON detail     │
 ├───────────────────────────────────────────────────────────┤
 │  8. Query Execution                                       │
 │     AST → SQL transpilation → prepared statement cache    │
 │     PostgreSQL RLS policies filter rows invisibly         │
 │     Result row cap: max_result_rows (configurable)        │
 ├───────────────────────────────────────────────────────────┤
 │  9. Connection Release                                    │
 │     COMMIT → resets txn-local RLS + statement_timeout      │
 │     Prepared statement caches remain hot for reuse         │
 │     Connection returned to pool in clean state            │
 └───────────────────────────────────────────────────────────┘
       │
 JSON Response + X-Request-Id + X-Response-Time

Connection Safety Model

The connection pool enforces a strict lifecycle:

Acquisition

Four public methods:

Method	RLS	Timeout	Use Case
`acquire_with_rls(ctx)`	✅ Set	Default	Normal tenant queries
`acquire_with_rls_timeout(ctx, ms)`	✅ Set	Custom	Gateway with `statement_timeout_ms`
`acquire_system()`	✅ Empty	Default	Schema introspection, migrations
`acquire_raw()` ⚠️	❌ None	Default	Advanced/internal paths that set RLS immediately

Release

Every connection release executes:

COMMIT;   -- Ends txn opened by RLS setup; resets transaction-local GUCs

With transaction-local settings (set_config(..., true) + SET LOCAL), this resets:

RLS context GUCs
statement_timeout

Prepared statement caches are intentionally preserved for performance while tenant context is reset on each release.

Why `acquire_raw()` Is Restricted

acquire_raw() returns a connection with no RLS context. If used for tenant queries, it would bypass row-level security entirely.

Every internal call site must include a // SAFETY: comment explaining why raw acquisition is justified (typically: “RLS context is set immediately on the next line”). This convention is enforced via CI:

# Must return empty — every acquire_raw() must have a SAFETY comment
grep -rn "acquire_raw" pg/src/ | grep -v "// SAFETY:"

COMMIT resets transaction-local RLS state and timeout
New RLS context is set for the new tenant
Query executes under the new tenant context

The integration test test_pool_connection_recycling_isolation verifies this with a pool of size 1, forcing the same physical connection to serve two different tenants sequentially.

4. Bounded Resource Consumption

Resource	Bound	Mechanism
Connections	`max_connections`	Pool size cap
Per-tenant connections	`tenant_max_concurrent`	Semaphore per tenant
Result rows	`max_result_rows`	Row cap per query
Query duration	`statement_timeout_ms`	PostgreSQL `SET LOCAL`
Query cost	`explain_max_cost`	EXPLAIN pre-check
Request body	2 MiB	Axum body limit layer
Cache entries	`max_entries`	moka TinyLFU eviction
Cache entry TTL	`ttl_secs`	moka time-to-live

5. Graceful Shutdown

On SIGTERM or Ctrl+C:

Stop accepting new connections
Wait for in-flight requests to complete
Drain the connection pool
Exit cleanly

No request is silently dropped. No connection is leaked.

Component Map

GatewayState (shared Arc across all handlers)
├── pool: PgPool                    — Connection pool with RLS-aware acquisition
├── config: GatewayConfig           — All configuration (qail.toml + env overrides)
├── policy_engine: PolicyEngine     — YAML-defined per-table, per-role policies
├── schema: SchemaRegistry          — Auto-discovered table/column/FK metadata
├── cache: QueryCache               — moka-backed LRU with table-level invalidation
├── rate_limiter: RateLimiter       — Token bucket rate limiting
├── explain_cache: ExplainCache     — Cached EXPLAIN results per query shape
├── explain_config: ExplainConfig   — Cost/row thresholds for EXPLAIN pre-check
├── tenant_semaphore: TenantSemaphore — Per-tenant concurrency limiter
├── event_engine: EventTriggerEngine  — Webhook triggers on mutations
└── user_operator_map: HashMap      — JWT user_id → tenant_id resolution cache

Where to Hook Observability

Signal	Endpoint / Header	Format
Metrics	`GET /metrics`	Prometheus
Request ID	`X-Request-Id` response header	UUID
Latency	`X-Response-Time` response header	Duration string
Health	`GET /health`	`200 OK` or error
Schema	`GET /api/_schema`	JSON
OpenAPI	`GET /api/_openapi`	OpenAPI 3.0 JSON
Cache stats	Via `/metrics`	hit/miss/entries/weighted_size

QAIL Documentation

Architecture & Invariants

Request Lifecycle

Connection Safety Model

Acquisition

Release

Why `acquire_raw()` Is Restricted

Core Invariants

1. Fail-Closed RLS

2. Cost-Bounded Execution

3. Tenant Isolation on Connection Reuse

4. Bounded Resource Consumption

5. Graceful Shutdown

Component Map

Where to Hook Observability

Keyboard shortcuts

QAIL Documentation