RFC: Consistent wire-format test cases

5 Jul 2018

Currently, conjure-generators all use slightly different test-cases, each covering different edge cases. This makes code reviews tricky as it isn't always self-evident whether a change will maintain wire compatibility. This RFC proposes a single tool to test that clients adhere to a common wire spec, defined in wire.md. Verifying server compliance will be tackled in a separate, dedicated RFC.

Goals

Fearless code reviews - a conjure-generator can be drastically refactored and still demonstrate that it will maintain wire compatibility.
Eliminate duplicated work - maintainers no longer need to try to think of all possible edge cases when testing a conjure-generator or client, these should be exhaustively defined in one place.
Easy to use - it should be easy for contributors to verify their generator is compliant locally and on CI.
Easy to update the tests - the master test cases should have a compact, human-readable definition and should be published centrally so that consumers can easily upgrade.

Types of compliance

To be fully compliant, a Conjure-generated client must:

understand spec-compliant server responses
always send spec-compliant requests
reject non-compliant server responses

Note, there isn't a 'serialization' category here because client-side and server-side serialization and deserialization are subtly different. For example, clients must tolerate extra fields in a server response (otherwise server authors would never be able to add new functionality). Servers on the other hand may reject unknown fields in a JSON request body as this is likely a programmer error.

Published artifacts

A single repository will publish the following versioned artifacts:

test-cases.json - a self-contained file containing exhaustive tests for both clients and servers.
verification-api.conjure.json - clients must be generated from this conjure IR file in order to interact with the test server. This conjure definition should also contain types sufficient for deserializing the test-cases.json file.
verification-server - an executable that will produce sample responses and make assertions about the generated client's requests. This will be published as a docker image and as a standalone binary (initially just supporting Linux/Darwin).

Workflow

Prerequisites

ensure test-cases.json is available locally
ensure clients have been generated from the verification-api.conjure.json IR file
ensure verification-server is available (either as a binary or as a docker image)

In some languages it is acceptable to check-in these files and in other languages it is more appropriate to declare a dependency on the versioned artifact and download them on demand. We recommend downloading them on demand if possible, as this reduces the risk of contributors modifying these files.

Users must write some language-specific test harness - this harness must:

spin up the verification-server as an external resource. For example in JUnit, this would probably be a @BeforeClass method.
read in test-cases.json and invoke their client to make all network calls to the running verification-server.
make one final call to the verification-server to check no test-cases have been missed. In JUnit this could be an @AfterClass method.
shut down the verification-server.

Note, we recommend making the harness easy to run from within your IDE so that devs have familiar tools available if they want to step through a particular test.

`test-cases.json`

The test-cases.json file is a well-structured file that will capture behaviour and edge cases from wire.md, e.g.:

client:
  autoDeserialize:
    ...
    receiveSafeLongExample:
      positive:
        - '{"value":-9007199254740991}'
        - '{"value":0}'
        - '{"value":9007199254740991}'
      negative:
        - '{"value":null}'
        - '{}'
        - '{"value":-9007199254740992}'
        - '{"value":9007199254740992}'
        - '{"value":1.23}'
        - '{"value":"12"}'

test-cases.json file can be deserialized using the Conjure TestCases type, defined in verification-api.conjure.json. Note that this RFC doesn't specify the exact format or contents as we'll undoubtedly want to release improvements (see the 'Versioning' section below). Key topics that will be covered include:

primitives (string, boolean, datetime, safelong, etc)
collection types (list, set, map, optional)
complex types (object, union, enum)
binary uploads and responses, binary fields and path params
auth types (header, none, cookie)
headers
http methods (GET, PUT, POST, DELETE)
path and query parameters
coercing from absent fields / null fields
invalid JSON
set uniqueness, duplicate map keys

Partial compliance

Ideally generators should be 100% compliant, but new generators might not comply with all the edge cases from test-cases.json. Generators may still be usable in this state, but it is recommended to declare any ignored tests in a well-structured ignored-test-cases.json file so that potential users can easily evaluate degrees of completeness.

This file should be typed according to the IgnoredTestCases Conjure type from verification-api.conjure.json.

Where should tests be run

Ideally, these tests should run as part of continuous integration for the conjure-<lang> generator, but also on the conjure-<lang>-runtime repo (suggested in RFC 003) so that regressions are also prevented in the supporting libraries.

Versioning

Changes to the test-cases.json, verification-api.conjure.json or verification-server will be released and versioned according to SemVer - new test cases will be considered a minor bump, adding whole new categories of tests or renaming classes will be considered a break.

Alternatives considered

tests are driven by a central harness binary - instead of requiring users to write a harness as described above, the harness could be written once and then provided to all languages. In this formulation, the harness "drives" all the tests. The downside of this approach is that IDE integration is sub-optimal, so it's harder to debug failing tests. The upside is that the harness binary is the source of truth of which tests to run, so there isn't the "checkNoTestCasesHaveBeenMissed" step at the end.
generative tests - fuzzing and generative testing involve producing vast numbers of test cases that all maintain some invariant. This can catch bugs that nobody anticipated, but has the downside that tests are not always deterministic or reproducible.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!