Add security best practices page. #1820

mandiwise · 2024-11-07T17:19:05Z

Addresses a todo in #41

Description

This PR adds a new security best practices page with content on transport layer security, demand control, and schema considerations.

@benjie @jorydotcom

vercel · 2024-11-07T17:19:09Z

@mandiwise is attempting to deploy a commit to the The GraphQL Foundation Team on Vercel.

A member of the Team first needs to authorize it.

benjie

This is going to be a super valuable resource and one the community will refer to time and time again - thanks so much for your work on it! ✨ It reads really well and covers most of the important stuff without getting too bogged down in technical detail; I've noted a few minor issues with precision and have made a few suggestions for improvement below.

src/pages/learn/security.mdx

Co-authored-by: Benjie <[email protected]>

benjie

Looks good to me! Excited to get this out there 🙌 🎉

vercel · 2024-11-11T13:46:32Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
graphql-github-io	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	Nov 21, 2024 6:51pm

saihaj · 2024-11-14T14:38:49Z

src/pages/learn/security.mdx

+
+As a build step during development clients may submit their GraphQL documents to the server's allowlist, where each document is stored with a unique ID—usually its hash. At runtime, the client can send a document ID instead of the full GraphQL document, and the server will only execute requests for known document IDs.
+
+Trusted documents are simply _persisted documents_ that are deemed to not be malicious, usually because they were authored by your developers and approved through code review. Efforts are underway to [standardize _persisted documents_ as part of the GraphQL-over-HTTP specification](https://github.com/graphql/graphql-over-http/pull/264)—an increasing number of GraphQL clients and servers already support them (sometimes under their legacy misnomer: _persisted queries_).


people also call these persisted operations

Also "stored queries" and "stored operations" (and maybe "stored documents", but I don't recall that?) have been used. I think that's too much detail to include in these docs though; "persisted queries" is sufficiently pervasive to warrant mentioning but the others can be glossed over.

Urigo

Very exciting to have this content here!

I would also maybe take this opportunity to link from the article to sections of tools and libraries that are related to the concepts in the article?
We can add a section call "security" under https://graphql.org/community/tools-and-libraries/ and link it here

src/pages/learn/security.mdx

iCarossio

Some suggestions :)

iCarossio · 2024-11-18T16:32:51Z

src/pages/learn/security.mdx

+Depending on the client implementation, query batching can be a useful strategy for limiting the number of round trips to a server to fetch all of the required data to render a user interface. However, there should be an upper limit on the total number of queries allowed in a single batch.
+
+As with depth limiting, a GraphQL implementation may have configuration options to restrict operation breadth, field alias usage, and batching.
+


Would be nice to add paragraph about recursive fragments :D

A GraphQL API cannot have recursive fragments, as stated by the GraphQL specification:

https://spec.graphql.org/draft/#sec-Fragment-Spreads-Must-Not-Form-Cycles

Interesting thanks!

Note: seems like it’s not always implemented properly — a CVE we found with our team.

GHSA-4rx6-g5vg-5f3j

That definitely seems like a bug in that implementation, missing one of the core validation rules is quite a major diversion from the spec - I would expect that to be opt-in.

It would be really good to get the https://github.com/graphql-cats/graphql-cats project (or one like it) up and running again so it can be checked if an implementation is actually spec-compliant. If anyone was interested in taking this on, I'm fairly sure it would likely get approval from the community grant program.

iCarossio · 2024-11-18T16:40:30Z

src/pages/learn/security.mdx

+Depth, breadth, and batch limiting help prevent broad categories of malicious operations such as cyclic queries and batching attacks, but they don't provide a way to declare that a particular field is computationally expensive to resolve. So for more advanced demand control requirements, you may wish to implement rate limiting. 
+
+Rate limiting may take place in different layers of an application, for example, in the network layer or the business logic layer. Because GraphQL allows clients to specify exactly what data they need in their queries, a server may not be able to know in advance if a request includes fields that will place a higher load on its resources during execution. As a result, applying useful rate limits for a GraphQL API typically requires a different approach than simply keeping track of the total number of incoming requests over a time period in the network layer; therefore applying rate limits within the business logic layer is generally recommended.
+


The perfect common example here is trying to by pass authentication server rate limiting using aliasing or batching.

iCarossio · 2024-11-18T16:42:22Z

src/pages/learn/security.mdx

+
+Even though the overall depth of this query is shallow, the underlying data source for the API will still have to handle a large number of requests to resolve data for the aliased `hero` field.
+
+Similarly, a client may send a GraphQL document with many batched operations in a request:


Is batching standard to GraphQL? I think it's not in the official specification: https://spec.graphql.org/October2021/

It's only on some GraphQL clients.

The official specification doesn't even cover HTTP. We're in the process of specifying batching as part of the GraphQL-over-HTTP working group:

Adds Variable Batching Proposal graphql-over-http#307

Add Request Batching Appendix graphql-over-http#308

iCarossio · 2024-11-18T16:44:56Z

src/pages/learn/security.mdx

+[Introspection](/learn/introspection/) is a powerful feature of GraphQL that allows you to query a GraphQL API for information about its schema. However, GraphQL APIs that only serve first-party clients may forbid introspection queries in non-development environments.
+
+Note that disabling introspection can be used as a form of "security through obscurity," but will not be sufficient to entirely obscure a GraphQL API by itself. It may be used as a part of a broader security strategy to limit the discoverability of API information from potential attackers. For more comprehensive protection of sensitive schema details and user data, trusted documents and authorization should be used.
+


Since you already added considerations about batching which I think is not standard in GraphQL, you can also talk about Field Suggestion which exits in some GraphQL clients.

This is one of many many ways of extracting schema information despite introspection being disabled, I worry if we list this as an example then people will think they just need to solve that too and then they're good - they're not. Ultimately: GraphQL schemas should not be treated as secrets/if your schema is secret then it should not accept arbitrary operations (i.e. you should use trusted documents).

If you're interested, a little more information on this topic can be found in my GraphQLConf 2024 talk; here's a timestamped link to the relevant part: https://youtu.be/Ytt1_ZIlYdg?t=1267

"Looking over the shoulder" is a fun one ;)
But that's totally true!

And Nice talk and thanks for the various mentions 🙏

benjie · 2024-11-18T18:00:40Z

Thanks for taking the time to review @iCarossio 🙌

benjie · 2024-11-21T18:47:33Z

I believe Uri's feedback has been addressed - the remaining action point I've filed an issue about:

Create a "tools and libraries" topic for "security" and link to it from the security page #1832

@iCarossio Please follow up with any suggested edits as PRs - ideally separate PRs for each topic so we can discuss each on its own grounds. Perhaps for some of the examples you propose it might make sense to add them via <details> tags so that people who are intrigued can read more but the casual reader doesn't need to be overwhelmed with detail? (This page is generally intended for someone new to the technology.)

Thanks for your feedback everyone, and for your work on this Mandi - I'm going to go ahead and merge! 🎉

I believe Uri's suggestions have been adopted.

Co-authored-by: Benjie <[email protected]>

Add security best practices page.

20317e3

benjie requested changes Nov 8, 2024

View reviewed changes

mandiwise and others added 2 commits November 10, 2024 14:17

Apply suggestions from code review

e171ddc

Co-authored-by: Benjie <[email protected]>

Apply more suggestions from code review

b690783

Co-authored-by: Benjie <[email protected]>

benjie approved these changes Nov 11, 2024

View reviewed changes

vercel bot deployed to Preview November 11, 2024 13:51 View deployment

Add note about masking field errors.

1699210

benjie approved these changes Nov 11, 2024

View reviewed changes

vercel bot deployed to Preview November 11, 2024 17:34 View deployment

saihaj reviewed Nov 14, 2024

View reviewed changes

saihaj approved these changes Nov 14, 2024

View reviewed changes

Urigo previously requested changes Nov 15, 2024

View reviewed changes

src/pages/learn/security.mdx Outdated Show resolved Hide resolved

iCarossio reviewed Nov 18, 2024

View reviewed changes

Edit trusted documents section.

fe4d1b1

benjie mentioned this pull request Nov 21, 2024

Create a "tools and libraries" topic for "security" and link to it from the security page #1832

Open

benjie approved these changes Nov 21, 2024

View reviewed changes

vercel bot deployed to Preview November 21, 2024 18:51 View deployment

benjie merged commit f5eac14 into graphql:source Nov 21, 2024
4 checks passed

mandiwise added a commit to mandiwise/graphql.github.io that referenced this pull request Nov 22, 2024

Add security best practices page. (graphql#1820)

fc47a37

Co-authored-by: Benjie <[email protected]>


		As a build step during development clients may submit their GraphQL documents to the server's allowlist, where each document is stored with a unique ID—usually its hash. At runtime, the client can send a document ID instead of the full GraphQL document, and the server will only execute requests for known document IDs.

		Trusted documents are simply _persisted documents_ that are deemed to not be malicious, usually because they were authored by your developers and approved through code review. Efforts are underway to [standardize _persisted documents_ as part of the GraphQL-over-HTTP specification](https://github.com/graphql/graphql-over-http/pull/264)—an increasing number of GraphQL clients and servers already support them (sometimes under their legacy misnomer: _persisted queries_).

		Depending on the client implementation, query batching can be a useful strategy for limiting the number of round trips to a server to fetch all of the required data to render a user interface. However, there should be an upper limit on the total number of queries allowed in a single batch.

		As with depth limiting, a GraphQL implementation may have configuration options to restrict operation breadth, field alias usage, and batching.

		Depth, breadth, and batch limiting help prevent broad categories of malicious operations such as cyclic queries and batching attacks, but they don't provide a way to declare that a particular field is computationally expensive to resolve. So for more advanced demand control requirements, you may wish to implement rate limiting.

		Rate limiting may take place in different layers of an application, for example, in the network layer or the business logic layer. Because GraphQL allows clients to specify exactly what data they need in their queries, a server may not be able to know in advance if a request includes fields that will place a higher load on its resources during execution. As a result, applying useful rate limits for a GraphQL API typically requires a different approach than simply keeping track of the total number of incoming requests over a time period in the network layer; therefore applying rate limits within the business logic layer is generally recommended.


		Even though the overall depth of this query is shallow, the underlying data source for the API will still have to handle a large number of requests to resolve data for the aliased `hero` field.

		Similarly, a client may send a GraphQL document with many batched operations in a request:

		[Introspection](/learn/introspection/) is a powerful feature of GraphQL that allows you to query a GraphQL API for information about its schema. However, GraphQL APIs that only serve first-party clients may forbid introspection queries in non-development environments.

		Note that disabling introspection can be used as a form of "security through obscurity," but will not be sufficient to entirely obscure a GraphQL API by itself. It may be used as a part of a broader security strategy to limit the discoverability of API information from potential attackers. For more comprehensive protection of sensitive schema details and user data, trusted documents and authorization should be used.

Add security best practices page. #1820

Add security best practices page. #1820

Uh oh!

Conversation

mandiwise commented Nov 7, 2024

Description

Uh oh!

vercel bot commented Nov 7, 2024

Uh oh!

benjie left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

benjie left a comment

Choose a reason for hiding this comment

Uh oh!

vercel bot commented Nov 11, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Urigo left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

iCarossio left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

benjie commented Nov 18, 2024

Uh oh!

benjie commented Nov 21, 2024

Uh oh!

Uh oh!

Uh oh!

vercel bot commented Nov 11, 2024 •

edited

Loading