Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update Connector in Typescript course #1039

Open
wants to merge 9 commits into
base: master
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from 4 commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Original file line number Diff line number Diff line change
Expand Up @@ -8,212 +8,153 @@ Let's implement aggregates in our SQLite connector.

Like we've done before, we won't implement aggregates in their full generality, and instead we're going to implement two
types of aggregates, called `star_count` and `column_count`. Other aggregates like `SUM` and `MAX` that you know from
Postgres will come under the umbrella of _custom aggregate functions_.

If we take a look at our failing tests, we see that aggregate queries are indicated by the presence of the `aggregates`
field in the query request body. Just like the `fields` property that we handled previously, each aggregate is named
with a key, and has a `type`, in this case `star_count`. So we're going to handle aggregates very similarly to fields,
by building up a SQL target list from these aggregates.

```JSON
{
"collection": "albums",
"query": {
"aggregates": {
"count": {
"type": "star_count"
}
},
"limit": 10
},
"arguments": {},
"collection_relationships": {}
Postgres will come under the umbrella of _custom aggregate functions_, and we'll cover those separately in another
tutorial.

Let's start by adding the `aggregates` capability to our capabilities response:

```typescript
return {
version: "0.1.2",
capabilities: {
query: { aggregates: {} },
mutation: {},
relationships: {}
}
}
```
Aggregate queries are indicated by the presence of the `aggregates` field in the query request body. Just like the
`fields` property that we handled previously, each aggregate is named with a key, and has a `type`, in this case
`star_count`. So we're going to handle aggregates very similarly to fields, by building up a SQL target list from these
aggregates.

[The NDC spec](https://hasura.github.io/ndc-spec/specification/queries/aggregates.html) says that each aggregate should
act over the same set of rows that we consider when returning `rows`. That is, we should apply any predicates, sorting,
pagination, and so on, and then apply the aggregate functions over the resulting set of rows.
The NDC spec says that each aggregate should act over the same set of rows that we consider when returning `rows`. That
is, we should apply any predicates, sorting, pagination, and so on, and then apply the aggregate functions over the
resulting set of rows.

So assuming we have a function called `fetch_aggregates` which builds the SQL in this way, we can fill in the
`aggregates` in the response.

If the `query` function, add this line and amend the return type to include aggregates:
`aggregates` in the response:

```typescript
const aggregates = request.query.aggregates && await fetch_aggregates(state, request);

return [{ rows,aggregates }];
```

So the final query function becomes:
```typescript
async function query(configuration: Configuration, state: State, request: QueryRequest): Promise<QueryResponse> {
console.log(JSON.stringify(request, null, 2));
Now let's start to fill in a `fetch_aggregates` function.

We'll actually copy/paste the `fetch_rows` function and create a new function for handling aggregates. It'd be possible to extract that commonality into a shared function, but arguably not worth it, since so much is already extracted out into small helper functions anyway.

const rows = request.query.fields && await fetch_rows(state, request);
const aggregates = request.query.aggregates && await fetch_aggregates(state, request);
```typescript
async function fetch_aggregates(state: State, request: QueryRequest): Promise<{
[k: string]: unknown
}> {

return [{ rows, aggregates }];
}
```

Now let's start to fill in a `fetch_aggregates` helper function.

We'll actually copy/paste the `fetch_rows` function and create a new function for handling aggregates. It would be
possible to extract that commonality into a shared function, but arguably not worth it, since so much is already
extracted out into small helper functions anyway.

The first difference is the return type. Instead of `RowFieldValue`, we're going to return a value directly from the
database, so let's change that to `unknown`.
The first difference is the return type. Instead of `RowFieldValue`, we're going to return a value directly from the database, so let's change that to `unknown`.

Next, we want to generate the target list using the requested aggregates, so let's change that.

```typescript
async function fetch_aggregates(state: State, request: QueryRequest): Promise<{ [k: string]: unknown }> {
const target_list = [];

for (const aggregateName in request.query.aggregates) {
if (Object.prototype.hasOwnProperty.call(request.query.aggregates, aggregateName)) {
const aggregate = request.query.aggregates[aggregateName];
switch(aggregate.type) {
case 'star_count':

case 'column_count':

case 'single_column':
}
const target_list = [];

for (const aggregateName in request.query.aggregates) {
if (Object.prototype.hasOwnProperty.call(request.query.aggregates, aggregateName)) {
const aggregate = request.query.aggregates[aggregateName];
switch (aggregate.type) {
case 'star_count':
// TODO
case 'column_count':
// TODO
case 'single_column':
// TODO
}
}

}
```

For now, we'll handle the first two cases here.
For now, we'll handle the first two cases here, and save the last for when we talk about custom aggregates.

In the first case, we want to generate a target list element which uses the `COUNT` SQL aggregate function.

```typescript
// ...
case 'star_count':
target_list.push(`COUNT(1) AS ${aggregateName}`);
break;
// ...
```

In the second case, we'll also use the `COUNT` function, but this time, we're counting non-null values in a single column:

```typescript
// ...
case 'column_count':
target_list.push(`COUNT(${aggregate.column}) AS ${aggregateName}`);
break;
// ...
```

We also need to interpret the `distinct` property of the aggregate object, and insert the `DISTINCT` keyword if needed:

```typescript
// ...
case 'column_count':
target_list.push(`COUNT(${aggregate.distinct ? 'DISTINCT ' : ''}${aggregate.column}) AS ${aggregateName}`);
break;
// ...
```

We'll create a new generated SQL function within `fetch_aggregates()` to use the generated target list:
Now let's update our generated SQL to use the generated target list:

```typescript
// ...
const sql = `SELECT ${target_list.join(", ")} FROM (
(
SELECT * FROM ${request.collection} ${where_clause} ${order_by_clause} ${limit_clause} ${offset_clause}
)`;
// ...
const sql = `SELECT ${target_list.length ? target_list.join(", ") : "1 AS __empty"} FROM (
(
SELECT * FROM ${request.collection} ${where_clause} ${order_by_clause} ${limit_clause} ${offset_clause}
)`;
```

Note that we form the set of rows to be aggregated first, so that the limit and offset clauses are applied correctly.

And instead of returning all rows, we're going to assume that we only get a single row back, so we can match on that and
return the single row of aggregates:
And instead of returning all rows, we're going to assume that we only get a single row back, so we can match on that and return the single row of aggregates:

```typescript
const result = await state.db.get(sql, ...parameters);

delete result.__empty;

if (result === undefined) {
throw new InternalServerError("Unable to fetch aggregates");
}

return result;
```

Here's the full function:

```typescript
async function fetch_aggregates(state: State, request: QueryRequest): Promise<{
[k: string]: unknown
}> {
const target_list = [];

for (const aggregateName in request.query.aggregates) {
if (Object.prototype.hasOwnProperty.call(request.query.aggregates, aggregateName)) {
const aggregate = request.query.aggregates[aggregateName];
switch(aggregate.type) {
case 'star_count':
target_list.push(`COUNT(1) AS ${aggregateName}`);
break;
case 'column_count':
target_list.push(`COUNT(${aggregate.distinct ? 'DISTINCT ' : ''}${aggregate.column}) AS ${aggregateName}`);
break;
case 'single_column':
throw new NotSupported("custom aggregates not yet supported");
}
}
}

const parameters: any[] = [];

const limit_clause = request.query.limit == null ? "" : `LIMIT ${request.query.limit}`;

const offset_clause = request.query.offset == null ? "" : `OFFSET ${request.query.offset}`;

const where_clause = request.query.where == null ? "" : `WHERE ${visit_expression(parameters, request.query.where)}`;

const order_by_clause = request.query.order_by == null ? "" : `ORDER BY ${visit_order_by_elements(request.query.order_by.elements)}`;

const sql = `SELECT ${target_list.join(", ")} FROM (
SELECT * FROM ${request.collection} ${where_clause} ${order_by_clause} ${limit_clause} ${offset_clause}
)`;

console.log(JSON.stringify({ sql, parameters }, null, 2));

const result = state.db.get(sql, ...parameters);

if (result === undefined) {
throw new InternalServerError("Unable to fetch aggregates");
}

return result;
}
That's it, so let's test our connector one more time, and hopefully see some passing tests this time.

```sh
ndc-test test --endpoint http://localhost:8080 --snapshots-dir snapshots

...
├ Query ...
│ ├ albums ...
│ │ ├ Simple queries ...
│ │ │ ├ Select top N ... OK
│ │ │ ├ Predicates ... OK
│ │ │ ├ Sorting ... OK
│ │ ├ Relationship queries ...
│ │ ├ Aggregate queries ...
│ │ │ ├ star_count ... OK
│ │ │ ├ column_count ... OK
│ │ │ ├ single_column ... OK
...
```

That's it, so let's test our connector one more time, and hopefully see some passing tests this time.

Remember to delete the snapshots first, so that we can generate new ones:

```bash
rm -rf snapshots
```
Note that `ndc-test` is now testing aggregates automatically, since we turned on the `aggregates` capability.

And re-run the tests with the snapshots directory:

```shell
ndc-test test --endpoint http://0.0.0.0:8100 --snapshots-dir snapshots
```

OR
```shell
cargo run --bin ndc-test -- test --endpoint http://localhost:8100 --snapshots-dir snapshots
```
And let's check that we're generating the right SQL. Picking a random example from the logs, we can see that we are indeed generating well-formed SQL:

Nice! We've now implemented the `star_count` and `column_count` aggregates, and we've seen how to generate SQL for them.
```sql
SELECT
COUNT(id) AS id_count,
COUNT(DISTINCT id) AS id_distinct_count,
COUNT(name) AS name_count,
COUNT(DISTINCT name) AS name_distinct_count
FROM (
SELECT * FROM artists LIMIT 10
)
```

This file was deleted.

Original file line number Diff line number Diff line change
Expand Up @@ -26,20 +26,23 @@ function which take a `connector` of type `Connector`.
In your `src/index.ts` file, add the following:

```typescript
const connector: Connector<RawConfiguration, Configuration, State> = {};
const connector: Connector<Configuration, State> = {};

start(connector);
```

We will also need some imports over the course of the tutorial. Paste these at the top of your index.ts file:

```typescript
import opentelemetry from '@opentelemetry/api';
import sqlite3 from 'sqlite3';
import { readFile } from 'fs/promises';
import { resolve } from 'path';
import { Database, open } from 'sqlite';
import { BadRequest, CapabilitiesResponse, CollectionInfo, ComparisonValue, Connector, ExplainResponse, InternalServerError, MutationRequest, MutationResponse, NotSupported, ObjectField, ObjectType, OrderByElement, QueryRequest, QueryResponse, RowFieldValue, ScalarType, SchemaResponse, start } from "@hasura/ndc-sdk-typescript";
import { JSONSchemaObject } from "@json-schema-tools/meta-schema";
import { ComparisonTarget, Expression } from '@hasura/ndc-sdk-typescript/dist/generated/typescript/QueryRequest';
import { BadGateway, BadRequest, CapabilitiesResponse, CollectionInfo, ComparisonTarget, ComparisonValue, Connector, ConnectorError, ExplainResponse, Expression, ForeignKeyConstraint, InternalServerError, MutationRequest, MutationResponse, NotSupported, ObjectField, ObjectType, OrderByElement, Query, QueryRequest, QueryResponse, Relationship, RowFieldValue, ScalarType, SchemaResponse, start } from "@hasura/ndc-sdk-typescript";
import { withActiveSpan } from "@hasura/ndc-sdk-typescript/instrumentation";
import { Counter, Registry } from 'prom-client';
```

You'll notice that your IDE will complain about the `connector` object not having the correct type, and
`RawConfiguration, Configuration, State` all being undefined. Let's fix that in the next section...
`Configuration, State` all being undefined. Let's fix that in the next section...
Loading