vgi-rpc Wire Protocol Specification¶
Version: 1 Status: Normative Audience: Cross-language implementors (Go, Rust, TypeScript, C++, etc.)
This document specifies the vgi-rpc wire protocol at byte level. A conforming implementation can interoperate with the Python reference without reading Python source code. The protocol is transport-agnostic; specific transport bindings (pipe, HTTP, shared memory) are described in later sections.
1. Overview & Conventions¶
vgi-rpc is an RPC framework where:
- Serialization uses Apache Arrow IPC Streaming Format.
- All integers are little-endian unless stated otherwise.
- All metadata strings are UTF-8 encoded.
- Wire protocol version:
"1"(the single ASCII byte0x31). - Metadata keys and values in Arrow IPC custom metadata are byte strings. Keys in the
vgi_rpc.*namespace are framework-reserved.
Terminology¶
| Term | Definition |
|---|---|
| IPC stream | A complete Arrow IPC streaming-format message sequence: schema message, zero or more record batch messages, terminated by an EOS marker. |
| Batch | An Arrow RecordBatch — zero or more rows conforming to a schema. |
| Custom metadata | Per-batch KeyValueMetadata attached to individual record batches within an IPC stream (distinct from schema-level metadata). |
| Zero-row batch | A batch with num_rows == 0. Used for log messages, error signals, pointer batches, and stream-completion markers. |
| Data batch | A batch with num_rows > 0, or a zero-row batch that lacks log/error metadata keys (e.g., void return). |
2. Arrow IPC Framing¶
Each logical message exchange uses one or more IPC streams written sequentially on the same byte stream (pipe, TCP socket, HTTP body, etc.).
An IPC stream consists of:
- Schema message — describes the columns and their Arrow types.
- Zero or more RecordBatch messages — each optionally carrying per-batch custom metadata.
- EOS marker — the 8-byte sequence
0xFF 0xFF 0xFF 0xFF 0x00 0x00 0x00 0x00(continuation token0xFFFFFFFFfollowed by 4 zero bytes for metadata length).
Multiple IPC streams are written sequentially on the same underlying byte stream. Each reader opens one stream, reads until EOS, and stops. The next reader picks up immediately after the EOS marker.
Refer to the Apache Arrow IPC specification for the byte-level encoding of schema messages, record batch messages, dictionary messages, and the encapsulated message format.
3. Metadata Key Reference¶
All framework-reserved metadata keys, their wire-format byte representations, where they appear, and their semantics:
Request metadata (on the request batch's custom metadata)¶
| Key (bytes) | Value | Description |
|---|---|---|
vgi_rpc.method |
UTF-8 method name | Target RPC method to invoke. Required. |
vgi_rpc.request_version |
"1" (ASCII 0x31) |
Wire protocol version. Required. |
vgi_rpc.request_id |
UTF-8 string (16-char hex) | Per-request correlation ID. Optional; if absent, the server generates a new 16-char hex ID. |
traceparent |
W3C Trace Context string | OpenTelemetry trace propagation. Optional. |
tracestate |
W3C Trace Context string | OpenTelemetry trace state. Optional. |
vgi_rpc.shm_segment_name |
UTF-8 OS name | Shared memory segment name (session-level). Optional. |
vgi_rpc.shm_segment_size |
Decimal integer string | Shared memory segment total size in bytes. Optional. |
Response / log / error metadata (on response batch custom metadata)¶
| Key (bytes) | Value | Description |
|---|---|---|
vgi_rpc.log_level |
One of: EXCEPTION, ERROR, WARN, INFO, DEBUG, TRACE |
Severity level. Present on log and error batches. |
vgi_rpc.log_message |
UTF-8 string | Human-readable message text. |
vgi_rpc.log_extra |
JSON string | Additional structured data. Optional. |
vgi_rpc.server_id |
UTF-8 string (12-char hex) | Server instance identifier for distributed tracing. |
vgi_rpc.request_id |
UTF-8 string | Echoed request correlation ID. |
Stream state (HTTP transport)¶
| Key (bytes) | Value | Description |
|---|---|---|
vgi_rpc.stream_state |
Opaque binary (signed token) | Serialized stream state for stateless HTTP exchanges. |
Shared memory pointer batch metadata¶
| Key (bytes) | Value | Description |
|---|---|---|
vgi_rpc.shm_offset |
Decimal integer string | Absolute byte offset in the SHM segment. |
vgi_rpc.shm_length |
Decimal integer string | Number of bytes of the serialized batch. |
vgi_rpc.shm_source |
UTF-8 SHM segment name | Provenance indicator on resolved batches (diagnostics). |
External storage pointer batch metadata¶
| Key (bytes) | Value | Description |
|---|---|---|
vgi_rpc.location |
UTF-8 URL | URL to fetch the externalized batch data. |
vgi_rpc.location.fetch_ms |
Decimal float string (e.g. "42.3") |
Fetch duration in milliseconds (diagnostics, on resolved batches). |
vgi_rpc.location.source |
UTF-8 URL | Original fetch URL (diagnostics, on resolved batches). |
Introspection batch metadata (on __describe__ response batch custom_metadata)¶
| Key (bytes) | Value | Description |
|---|---|---|
vgi_rpc.protocol_name |
UTF-8 string | Protocol class name. |
vgi_rpc.request_version |
"1" |
Wire protocol version. |
vgi_rpc.describe_version |
"2" |
Introspection format version. |
vgi_rpc.server_id |
UTF-8 string | Server instance identifier. |
4. Type Mapping¶
RPC method parameters and return values are serialized as Arrow columns. The following table defines the canonical mapping from abstract types to Arrow types. Cross-language implementations MUST use these Arrow types for interoperability.
| Abstract type | Arrow type | Serialization notes |
|---|---|---|
string |
utf8 |
UTF-8 encoded. |
bytes / binary |
binary |
Raw byte sequence. |
int / integer |
int64 |
64-bit signed integer. |
float / double |
float64 |
IEEE 754 double precision. |
bool |
bool |
— |
list[T] |
list(T) |
Recursive. |
dict[K, V] / map |
map(K, V) |
Serialized as list of (key, value) tuples. Deserialized back to map/dict. |
frozenset[T] / set[T] |
list(T) |
Serialized as list (order undefined). Deserialized back to set. |
enum |
dictionary(int16, utf8) |
Serialized as the enum member name (string). Deserialized by name lookup. |
optional[T] / T? |
Same as T, but with nullable = true on the Arrow field. |
null represents the absent value. |
dataclass (nested) |
binary |
Serialized as a complete Arrow IPC stream (schema + 1-row batch + EOS) in a binary column. See note below. |
Nested dataclass type context: This
binarymapping applies at the RPC method parameter/return level — each dataclass parameter or return value is a binary blob containing a serialized IPC stream. Within that IPC stream, the dataclass's own Arrow schema uses the same type mapping for primitive fields (string→utf8, int→int64, etc.), but sub-dataclass fields use Arrowstructtype (notbinary), since they are embedded inline rather than serialized as separate IPC streams.
Serialization transforms¶
When writing a value to an Arrow column:
- Enum → write the member's name as a UTF-8 string (not its value).
- dict → convert to a list of
(key, value)tuples, then write asmap(K, V). - frozenset/set → convert to a list, then write as
list(T). - Nested dataclass → serialize to Arrow IPC bytes, write as
binary.
When reading:
- Enum → look up the string by member name (
Enum["NAME"]). Name-based lookup is the normative wire format. (The Python reference implementation also supports a value-based fallback internally for nested dataclass fields, but cross-language implementations need only implement name-based lookup.) - map → convert list of tuples back to dict.
- list (when target is set) → convert to frozenset/set.
- binary (when target is dataclass) → deserialize from Arrow IPC bytes.
5. Request Batch Format¶
Every RPC request is a single IPC stream containing exactly one batch with one row:
IPC Stream:
Schema message:
- One field per method parameter, named after the parameter
- Field types per the type mapping (Section 4)
- Optional parameters have nullable = true
RecordBatch message:
- Exactly 1 row
- custom_metadata:
vgi_rpc.method = "<method_name>" (REQUIRED)
vgi_rpc.request_version = "1" (REQUIRED)
vgi_rpc.shm_segment_name = "<name>" (optional, SHM transport)
vgi_rpc.shm_segment_size = "<size>" (optional, SHM transport)
traceparent = "<W3C trace context>" (optional)
tracestate = "<W3C trace state>" (optional)
EOS marker
For methods with no parameters, the schema has zero fields and the batch has one row with zero columns.
Row count validation: The server only enforces
num_rows == 1when the schema has one or more fields. For zero-field (parameterless) methods, the server accepts batches with any row count (including 0). Conforming clients SHOULD send 1 row for consistency.
Worked example¶
Given an RPC method add(a: float, b: float) -> float:
Schema:
Batch (1 row, calling add(a=1.0, b=2.0)):
Column "a": [1.0]
Column "b": [2.0]
custom_metadata: {
"vgi_rpc.method": "add",
"vgi_rpc.request_version": "1"
}
Default values: When a parameter has a default and the caller omits it, the client merges the default into the kwargs before serialization. The server sees a complete row in all cases.
6. Response Format (Unary)¶
A unary response is a single IPC stream on the result schema:
IPC Stream:
Schema message:
- For methods returning a value: single field named "result"
- For void methods (-> None): zero fields (empty schema)
0..N log batches (zero-row, with log metadata — see Section 8)
1 result or error batch:
- Result: 1-row batch with the return value in column "result"
- Void: 0-row batch on empty schema
- Error: 0-row batch with EXCEPTION-level log metadata (see Section 8)
EOS marker
Log batches MUST appear before the result/error batch. They share the same schema as the result batch (the zero-row log batches conform to the response stream's schema).
Void return¶
When the method has no return value (-> None), the response schema is
empty (pa.schema([])) and the result batch has zero rows and zero columns.
7. Batch Classification Algorithm¶
When receiving any batch from a response stream, classify it using this decision tree:
receive(batch, custom_metadata):
IF custom_metadata is NULL:
→ DATA batch
IF batch.num_rows > 0:
→ DATA batch
// At this point: num_rows == 0 AND custom_metadata exists
IF custom_metadata contains "vgi_rpc.log_level"
AND custom_metadata contains "vgi_rpc.log_message":
level = custom_metadata["vgi_rpc.log_level"]
IF level == "EXCEPTION":
→ ERROR batch → raise RpcError (see Section 8)
ELSE:
→ LOG batch → deliver to on_log callback
IF custom_metadata contains "vgi_rpc.location":
→ EXTERNAL POINTER batch → resolve via URL fetch (see Section 12)
IF custom_metadata contains "vgi_rpc.shm_offset":
→ SHM POINTER batch → resolve via shared memory (see Section 11)
IF custom_metadata contains "vgi_rpc.stream_state":
→ STATE TOKEN batch → stream continuation (see Section 10)
// Zero-row batch with unrecognized metadata
→ DATA batch (e.g., void return, stream-finish marker)
Note: Log-level keys take priority. A zero-row batch that has both
vgi_rpc.log_levelandvgi_rpc.shm_offset(orvgi_rpc.location) is classified as a log batch, not a pointer. This is by design — pointer detection explicitly excludes batches with log-level keys. A batch cannot be both an external pointer and an SHM pointer simultaneously, so the check order between those two does not matter functionally.
8. Log & Error Batch Format¶
Log batches¶
A log batch is a zero-row batch on the response stream's schema, with the following custom metadata keys:
| Key | Required | Value |
|---|---|---|
vgi_rpc.log_level |
Yes | One of: EXCEPTION, ERROR, WARN, INFO, DEBUG, TRACE |
vgi_rpc.log_message |
Yes | Human-readable message text (UTF-8) |
vgi_rpc.log_extra |
No | JSON object with additional structured data |
vgi_rpc.server_id |
No | Server instance identifier |
vgi_rpc.request_id |
No | Request correlation ID |
Error batches (EXCEPTION level)¶
When vgi_rpc.log_level is "EXCEPTION", the batch represents a server-side
error. The client MUST raise/throw an error with the following fields
extracted from the metadata:
- error_type:
log_extra.exception_type(string) or the level string"EXCEPTION"as fallback. - error_message:
vgi_rpc.log_messagevalue. - remote_traceback:
log_extra.traceback(string) or empty string. - request_id:
vgi_rpc.request_idvalue or empty string.
log_extra JSON structure for EXCEPTION¶
{
"exception_type": "ValueError",
"exception_message": "invalid input",
"traceback": "Traceback (most recent call last):\n ...",
"frames": [
{
"file": "/path/to/module.py",
"line": 42,
"function": "my_method",
"code": "raise ValueError('invalid input')"
}
],
"cause": "Traceback ... (optional, from __cause__)",
"context": "Traceback ... (optional, from __context__)"
}
| Field | Type | Description |
|---|---|---|
exception_type |
string | Exception class name. |
exception_message |
string | str(exception). |
traceback |
string | Formatted traceback. Truncated at 16,000 characters with "\n… <traceback truncated>" suffix. |
frames |
array of objects | Last 5 stack frames (most recent at end). |
frames[].file |
string | Source file path. |
frames[].line |
integer | Line number. |
frames[].function |
string | Function/method name. |
frames[].code |
string or null | Source code at that line. |
cause |
string (optional) | Formatted __cause__ traceback. Truncated at 16,000 chars. |
context |
string (optional) | Formatted __context__ traceback (only when not suppressed). Truncated at 16,000 chars. |
Non-exception log batches¶
For levels other than EXCEPTION, the log_extra JSON structure is
freeform — it contains whatever key-value pairs the server method attached.
Clients should deliver these to the on_log callback without attempting to
parse them as error structures.
9. Stream Protocol (Pipe / Subprocess Transport)¶
Streaming methods use a multi-phase exchange over a bidirectional byte stream (two pipes: client→server and server→client).
Phase 1: Request parameters¶
Identical to a unary request (Section 5):
Phase 1.5: Optional header stream¶
Only present when the stream method declares a header type. Sent by the server immediately after reading the request, before the main data exchange.
The header is a single-row batch containing serialized header data. If the method does not declare a header type, this phase is skipped entirely.
If the server encounters an error during method initialization, it writes an
error stream (with EXCEPTION-level metadata) on the empty schema in place
of the header stream. The empty schema is used because the header schema may
not be available when the error occurs (e.g., the method raised before
returning a Stream object). Clients MUST be prepared to receive an
empty-schema error stream where a header stream was expected.
Phase 2: Lockstep data exchange¶
Both directions use a single long-lived IPC stream each:
Client → Server: IPC stream (input_schema, batch₁, batch₂, ..., EOS)
Server → Client: IPC stream (output_schema, [log*+data]₁, [log*+data]₂, ..., EOS)
The exchange is lockstep: the client writes one input batch, then reads the server's response (zero or more log batches followed by exactly one data batch). This repeats until termination.
Producer streams¶
- Input schema: empty (
pa.schema([])) — the client sends zero-row "tick" batches as timing signals. - Output: the server produces one data batch per tick.
- Termination: the server signals completion by not writing a data batch
after the final log batches — the output IPC stream reaches EOS. The
client detects this as
StopIteration. - Client-initiated close: the client closes its input IPC stream (writes EOS). The server detects this as end of input and stops producing.
Client Server
| |
|--- tick (0-row, empty) ------->|
|<------ log* + data batch₁ ----|
|--- tick (0-row, empty) ------->|
|<------ log* + data batch₂ ----|
|--- tick (0-row, empty) ------->|
|<------ log* + [EOS] ----------| (server called finish())
|--- [EOS] --------------------->|
Exchange streams¶
- Input schema: a real schema matching the exchange input type.
- Output: the server produces one data batch per input batch.
- Termination: the client closes its input stream (EOS). The server drains remaining input and closes the output stream.
Client Server
| |
|--- input batch₁ ------------->|
|<------ log* + output batch₁ --|
|--- input batch₂ ------------->|
|<------ log* + output batch₂ --|
|--- [EOS] --------------------->|
|<------ [EOS] -----------------|
Error during streaming¶
If the server encounters an error during process(), it writes an
EXCEPTION-level log batch on the output stream, then the output stream
reaches EOS. The client reads the error batch, raises RpcError, and
the session is closed.
10. HTTP Transport¶
The HTTP transport maps the pipe-based protocol to stateless HTTP request/response pairs. Streaming state is serialized into signed tokens passed between exchanges.
Content type¶
All requests and responses use:
The server MUST reject requests with any other Content-Type with HTTP 415 (Unsupported Media Type).
Endpoints¶
Given a configurable URL prefix (default /vgi):
| Endpoint | HTTP Method | Description |
|---|---|---|
{prefix}/{method} |
POST | Unary RPC call |
{prefix}/{method}/init |
POST | Stream initialization (producer and exchange) |
{prefix}/{method}/exchange |
POST | Stream continuation / exchange |
{prefix}/__describe__ |
POST | Introspection (unary) |
{prefix}/__upload_url__/init |
POST | Upload URL generation (when enabled) |
{prefix}/__capabilities__ |
OPTIONS | Server capability discovery |
Capability discovery¶
The OPTIONS {prefix}/__capabilities__ endpoint returns server capabilities
as HTTP response headers only — there is no Arrow IPC body. Clients parse
the following headers:
| Header | Type | Description |
|---|---|---|
VGI-Max-Request-Bytes |
Integer | Maximum request body size the server will accept. |
VGI-Upload-URL-Support |
"true" |
Present when the upload URL endpoint is available. |
VGI-Max-Upload-Bytes |
Integer | Maximum upload size for externalized batches. |
Upload URL generation¶
When the server has an upload_url_provider configured, the
POST {prefix}/__upload_url__/init endpoint generates pre-signed
upload/download URL pairs for client-side externalization.
Request: Standard unary request with vgi_rpc.method = "__upload_url__".
| Parameter | Arrow type | Default | Description |
|---|---|---|---|
count |
int64 |
1 | Number of URL pairs to generate (1–100). |
Response schema:
| Column | Arrow type | Nullable | Description |
|---|---|---|---|
upload_url |
utf8 |
No | Pre-signed URL for uploading batch data. |
download_url |
utf8 |
No | Pre-signed URL the server uses to fetch the uploaded data. |
expires_at |
timestamp("us", tz="UTC") |
No | Expiration time of the pre-signed URLs. |
The response has one row per requested URL pair.
Request headers¶
| Header | Description |
|---|---|
Content-Type |
MUST be application/vnd.apache.arrow.stream |
X-Request-ID |
Optional. Correlation ID echoed on response. If absent, server generates one. |
Response headers¶
| Header | Description |
|---|---|
X-Request-ID |
Echoed or generated request correlation ID. |
VGI-Max-Request-Bytes |
Server-advertised maximum request body size (optional). |
VGI-Upload-URL-Support |
"true" when upload URL endpoint is available (optional). |
VGI-Max-Upload-Bytes |
Server-advertised maximum upload size (optional). |
Unary call (HTTP)¶
POST {prefix}/{method}
Request body: IPC stream (params_schema, 1 request row, EOS)
Response body: IPC stream (result_schema, 0..N log batches, 1 result/error batch, EOS)
HTTP 200: Success (even when the response contains an error batch)
HTTP 400: Protocol error (bad IPC, missing metadata, param validation failure)
HTTP 401: Authentication failure (plain-text body, NOT Arrow IPC)
HTTP 404: Unknown method
HTTP 415: Wrong Content-Type
HTTP 500: Server implementation error
The method name in the URL path MUST match the vgi_rpc.method value in
the request batch's custom metadata. A mismatch is a 400 error.
Stream initialization (HTTP)¶
The response depends on whether the stream is a producer or exchange stream:
Producer stream init response¶
The response body contains the complete producer output:
Response body:
[IPC stream: header_schema, 0..N log batches, 1 header row, EOS] (if header declared)
[IPC stream: output_schema, (log* + data)*, EOS]
All produced data batches are included inline. If the response would exceed
max_stream_response_bytes, the server truncates the output and appends a
continuation batch: a zero-row batch with vgi_rpc.stream_state in its
custom metadata. The client then follows up with /exchange requests.
Exchange stream init response¶
Response body:
[IPC stream: header_schema, 0..N log batches, 1 header row, EOS] (if header declared)
[IPC stream: output_schema, 0..N log batches, 1 zero-row batch with state token, EOS]
The zero-row batch carries the signed state token in
vgi_rpc.stream_state custom metadata.
Stream exchange (HTTP)¶
POST {prefix}/{method}/exchange
Request body: IPC stream (input_schema, 1 input batch with state token in metadata, EOS)
Response body: IPC stream (output_schema, 0..N log batches, 1 data batch with updated state token, EOS)
The request batch's custom metadata MUST contain vgi_rpc.stream_state
with the current state token.
For producer continuation, the input is a zero-row batch on empty schema with the state token. The response may contain multiple data batches and may end with another continuation token.
For exchange, the input carries real data plus the state token. The response data batch carries an updated state token for the next exchange.
The client MUST strip vgi_rpc.stream_state from the batch metadata before
exposing it to application code.
State token binary format¶
The state token is an opaque signed blob with the following wire format (v2):
Offset Size Field
0 1 version: uint8 (currently 2)
1 8 created_at: uint64 LE (seconds since Unix epoch)
9 4 state_len: uint32 LE
13 N state_bytes: Arrow IPC stream of the StreamState dataclass
13+N 4 schema_len: uint32 LE
17+N M schema_bytes: serialized output pa.Schema
17+N+M 4 input_schema_len: uint32 LE
21+N+M P input_schema_bytes: serialized input pa.Schema
21+N+M+P 32 HMAC-SHA256(signing_key, all preceding bytes)
created_at: Token creation time as seconds since the Unix epoch. Used by the server to enforce a configurable TTL (token_ttl). Whentoken_ttl > 0, tokens older thantoken_ttlseconds are rejected with HTTP 400 ("State token expired"). Settoken_ttlto0to disable expiry checking. The default TTL is 3600 seconds (1 hour).state_bytes: The stream state dataclass serialized as a complete Arrow IPC stream (schema + 1-row batch + EOS).schema_bytes: The output Arrow schema serialized viapa.Schema.serialize().input_schema_bytes: The input Arrow schema serialized viapa.Schema.serialize(). For producer streams, this is the serialized empty schema.- HMAC: SHA-256 HMAC over the entire payload (version byte through input_schema_bytes), using the server's signing key.
Verification order: The HMAC MUST be verified before inspecting any payload fields (including the version byte and timestamp) to prevent information leakage. TTL enforcement happens after HMAC verification and version check.
Authentication (HTTP)¶
When the server has an authenticate callback configured:
- The callback receives the HTTP request and returns an
AuthContext. - On failure (
ValueErrororPermissionError), the server returns HTTP 401 with a plain-text body (NOT Arrow IPC), because no method has been resolved yet and no output schema is available. - Other exceptions from the callback propagate as HTTP 500.
- Clients MUST detect 401 responses before attempting to parse Arrow IPC.
11. Shared Memory (SHM) Transport¶
The shared memory side-channel enables zero-copy batch transfer between co-located processes. It is used alongside a pipe transport — the pipe carries control messages and small batches; large batches are written to shared memory and replaced with pointer batches on the pipe.
Segment header format¶
The shared memory segment begins with a 64 KiB (65,536 byte) header, followed by a data region. All integers are little-endian.
Offset Size Field
0 4 magic: bytes "VGIS" (0x56 0x47 0x49 0x53)
4 4 version: uint32 = 1
8 8 data_size: uint64 (segment size minus 65536)
16 4 num_allocs: uint32 (number of active allocations)
20 4 padding: uint32 = 0
24 N*16 allocations: array of (offset: uint64, length: uint64)
sorted by offset, where N = num_allocs
- Maximum allocations:
(65536 - 24) / 16 = 4094. - Offsets are absolute — measured from the start of the shared memory segment (not from the data region start).
- Data region starts at byte offset 65,536 (immediately after the header).
Allocation strategy¶
The allocator uses a first-fit strategy with implicit coalescing:
- Scan the sorted allocation list for the first gap that fits the requested size.
- Gaps are computed as: before the first allocation (from offset 65536), between consecutive allocations, and after the last allocation (to segment end).
- New allocations are inserted to maintain sorted order.
- Freeing an allocation removes its entry; adjacent free space coalesces implicitly since only occupied regions are tracked.
Batch serialization in SHM¶
For non-dictionary-encoded batches: A complete Arrow IPC stream (schema + record batch + EOS) is written directly into the allocated SHM region.
For dictionary-encoded batches: The IPC stream is written to a temporary buffer, then the schema message and EOS marker are stripped — only the dictionary messages and record batch message are stored in SHM.
Dictionary batch reconstruction (reader side):
To deserialize a dictionary-encoded batch from SHM:
- Serialize the pointer batch's schema into a schema message by creating a temporary IPC stream writer (which emits a schema message + EOS), then strip the trailing 8-byte EOS marker. This yields the schema message bytes.
- Concatenate:
schema_message_bytes+shm_stored_bytes+EOS_marker(8 bytes:0xFF 0xFF 0xFF 0xFF 0x00 0x00 0x00 0x00). - Open the concatenated buffer as a standard Arrow IPC stream and read the batch.
Non-dictionary batches do not need this reconstruction — they are stored as complete IPC streams and can be read directly.
SHM pointer batch¶
A batch stored in shared memory is replaced on the pipe with a pointer batch:
- num_rows: 0
- Schema: Same as the original batch's schema.
- Custom metadata:
vgi_rpc.shm_offset: Absolute byte offset in the segment (decimal string).vgi_rpc.shm_length: Number of bytes written (decimal string).
SHM segment identity in request metadata¶
When a client owns a shared memory segment, it advertises the segment in the request batch's custom metadata:
vgi_rpc.shm_segment_name: OS name of the shared memory segment.vgi_rpc.shm_segment_size: Total segment size in bytes (decimal string).
The server dynamically attaches to the segment (read-only, untracked by the resource tracker) for the duration of the request.
Resolution algorithm¶
resolve_shm_batch(batch, custom_metadata, shm_segment):
IF shm_segment is NULL:
return (batch, custom_metadata, null)
IF batch.num_rows != 0:
return (batch, custom_metadata, null)
IF custom_metadata is NULL:
return (batch, custom_metadata, null)
IF "vgi_rpc.shm_offset" NOT IN custom_metadata:
return (batch, custom_metadata, null)
IF "vgi_rpc.log_level" IN custom_metadata:
return (batch, custom_metadata, null) // log batch, not pointer
offset = int(custom_metadata["vgi_rpc.shm_offset"])
length = int(custom_metadata["vgi_rpc.shm_length"])
buffer = shm_segment.read(offset, length)
resolved_batch = deserialize_ipc_stream(buffer, batch.schema)
// Strip pointer keys, add provenance
resolved_metadata = remove_keys(custom_metadata, "vgi_rpc.shm_offset", "vgi_rpc.shm_length")
resolved_metadata["vgi_rpc.shm_source"] = shm_segment.name
release_fn = () => shm_segment.free(offset)
return (resolved_batch, resolved_metadata, release_fn)
12. External Storage Pointer Batches¶
When batches exceed a configurable size threshold, they can be externalized to remote storage (e.g., S3, GCS) and replaced with pointer batches.
Pointer batch format¶
- num_rows: 0
- Schema: Same as the original batch's schema.
- Custom metadata:
vgi_rpc.location: URL to fetch the batch data (typically a pre-signed URL).- Must NOT contain
vgi_rpc.log_level(to distinguish from log batches).
Externalization (writing)¶
When a data batch's total buffer size exceeds the threshold:
- Serialize all batches from the current output cycle (log batches + data batch) as a single IPC stream.
- Optionally compress with zstd.
- Upload to external storage via the
ExternalStorage.upload()interface. - Replace the entire cycle with a single zero-row pointer batch containing
vgi_rpc.location.
Resolution (reading)¶
resolve_external_location(batch, custom_metadata, config):
IF config is NULL:
return (batch, custom_metadata)
IF NOT is_external_pointer(batch, custom_metadata):
return (batch, custom_metadata)
url = custom_metadata["vgi_rpc.location"]
// Validate URL (default: HTTPS only)
config.url_validator(url)
// Fetch with retries
// Default: max 3 total attempts (max_retries=2, capped at 2)
// Retry delay: 0.5s fixed between attempts
// Retryable errors: network/OS errors, Arrow parse errors, HTTP client errors
data = fetch_url(url, config.fetch_config)
// Decompress if needed (zstd)
// Open as IPC stream, dispatch log batches, extract data batch
reader = open_ipc_stream(data)
FOR each batch in reader:
IF is_log_or_error(batch):
deliver to on_log callback
CONTINUE
IF has "vgi_rpc.location":
ERROR: redirect loop detected
data_batch = batch
// Validate schema match
IF data_batch.schema != expected_schema:
ERROR: schema mismatch
// Add fetch provenance metadata
resolved_metadata["vgi_rpc.location.fetch_ms"] = elapsed_ms
resolved_metadata["vgi_rpc.location.source"] = url
return (data_batch, resolved_metadata)
Stream externalization¶
For stream methods, the server may externalize an entire output cycle (log batches + data batch) as one IPC stream. The pointer batch replaces the entire cycle. On resolution, the client reads back all batches, dispatches log batches, and returns the data batch.
13. Version Negotiation & Error Handling¶
Version checking¶
Every request batch MUST carry vgi_rpc.request_version in its custom
metadata with the value "1".
| Condition | Error |
|---|---|
vgi_rpc.request_version missing |
VersionError — server writes an error stream on the empty schema. |
vgi_rpc.request_version != "1" |
VersionError — server writes an error stream on the empty schema. |
vgi_rpc.method missing |
RpcError (ProtocolError) — server writes an error stream. |
| Unknown method name | RpcError (AttributeError) — error stream includes available method names. |
| Request batch has wrong row count (not 1, on non-empty schema) | RpcError (ProtocolError). |
| Non-optional parameter is null | TypeError — error stream on the method's result schema. |
Error stream format¶
Protocol-level errors are written as a complete IPC stream on the appropriate schema (empty schema for version/method errors, result schema for parameter validation errors):
IPC Stream (error):
Schema message (empty or result schema)
1 zero-row batch with EXCEPTION-level log metadata
EOS marker
HTTP status code mapping¶
| Error condition | HTTP status |
|---|---|
| Bad IPC, missing metadata, version mismatch, param validation | 400 Bad Request |
| Expired or tampered state token | 400 Bad Request |
| Authentication failure | 401 Unauthorized |
| Unknown method | 404 Not Found |
| Wrong Content-Type | 415 Unsupported Media Type |
| Implementation error (unary) | 500 Internal Server Error |
| Implementation error (stream init) | 500 Internal Server Error |
| Type error in implementation | 400 Bad Request |
Note: Even for HTTP 400/500 responses, the response body is a valid Arrow IPC stream containing an error batch, except for 401 (plain text) and 415 (Falcon default response).
14. Introspection (__describe__)¶
The __describe__ method is a built-in synthetic unary method that returns
machine-readable metadata about all methods exposed by the server. It is
optional for implementors.
Request¶
Standard unary request with:
- vgi_rpc.method = "__describe__"
- Empty params schema (zero fields, one row)
Response¶
A single IPC stream with one row per method. The response batch carries custom metadata:
vgi_rpc.protocol_name— Protocol class namevgi_rpc.request_version— Wire protocol version ("1")vgi_rpc.describe_version— Introspection format version ("2")vgi_rpc.server_id— Server instance identifier
Response batch schema¶
| Column | Arrow type | Nullable | Description |
|---|---|---|---|
name |
utf8 |
No | Method name |
method_type |
utf8 |
No | "unary" or "stream" |
doc |
utf8 |
Yes | Method docstring |
has_return |
bool |
No | Whether the unary method returns a value |
params_schema_ipc |
binary |
No | Serialized pa.Schema for request parameters |
result_schema_ipc |
binary |
No | Serialized pa.Schema for unary response |
param_types_json |
utf8 |
Yes | JSON: {"param_name": "type_name", ...} |
param_defaults_json |
utf8 |
Yes | JSON: {"param_name": default_value, ...} |
has_header |
bool |
No | Whether the stream method has a header type |
header_schema_ipc |
binary |
Yes | Serialized pa.Schema for the header (null if no header) |
The params_schema_ipc, result_schema_ipc, and header_schema_ipc
columns contain Arrow schemas serialized via pa.Schema.serialize().
Appendix A: IPC Stream EOS Marker¶
The end-of-stream marker is the 8-byte sequence:
This signals to the IPC stream reader that no more messages follow.
Appendix B: Empty Schema¶
The "empty schema" referenced throughout this specification is an Arrow
schema with zero fields: pa.schema([]). When serialized, it produces a
small fixed-size blob. Batches on the empty schema have zero columns.
Appendix C: Empty Schema Serialized Form¶
The empty schema (pa.schema([])) serializes to a fixed 56-byte blob via
pa.Schema.serialize(). This is useful for cross-language implementations
that need to produce or compare serialized empty schemas (e.g., for the
input_schema_bytes field in state tokens for producer streams):
ff ff ff ff 30 00 00 00 10 00 00 00 00 00 0a 00
0c 00 06 00 05 00 08 00 0a 00 00 00 00 01 04 00
0c 00 00 00 08 00 08 00 00 00 04 00 08 00 00 00
04 00 00 00 00 00 00 00
This is a Flatbuffers-encoded Arrow Schema message with zero fields. Implementations MAY hard-code this constant rather than generating it at runtime. The serialized form is stable across Arrow versions.
Appendix D: Request Metadata Location¶
A common implementation question: the vgi_rpc.method key appears in the
request batch's custom metadata (per-batch metadata), not in the
schema-level metadata. This is by design — schema-level metadata is part of
the IPC stream schema message and cannot vary between batches, while
custom metadata is per-batch and can carry request-specific values.
vgi_rpc.request_version is also in batch custom metadata.