-
Notifications
You must be signed in to change notification settings - Fork 61
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Declaring summary
to have markup other than text/html
?
#620
Comments
I'm not seeing any justification in
mastodon/mastodon#32538 for why any content type
other than HTML would be useful or preferred. I don't think Claire is
expressing any sort of *preference* for a markdown summary type, just
saying that she initially misunderstood its type.
Adding the requirement to have to process *different* mime types would have
made the PR much more complicated, anyway, than just adding HTML
sanitization, since in any case—regardless of what you're producing—you'll have to handle incoming HTML.
this just feels like overcomplicating the spec for no additional benefit. I'm still not convinced there's even a good justification for allowing `content` to be different media types—are there any major non-HTML implementations?
|
This issue has been labelled as potentially needing a FEP, and contributors are welcome to submit a FEP on the topic. |
So, I think the problem with adding flags to indicate that the summary is not HTML is that it's not backwards compatible; consumers will expect I agree that a primer page makes sense. I'd also suggest a FEP for defining a new |
One thing about the primer page is that there is the question of when an object does not have a |
minimal example where HTML parsing is destructive: {
"summary": "I am trying to serialize the RDF statement <Alice> <knows> <bob> into plain-text, but a naive HTML sanitizer is stripping the statement completely"
} {
"summary": "I am trying to serialize the RDF statement into plain-text, but a naive HTML sanitizer is stripping the statement completely"
} the workaround is to HTML-escape the angle brackets which might not be unescaped by every consumer |
Description of issue
name
is defined as "A simple, human-readable, plain-text name for the object. HTML markup MUST NOT be included."summary
is defined as "A natural language summarization of the object encoded as HTML."content
includes in its definition that "By default, the value of content is HTML. ThemediaType
property can be used in the object to indicate a different content type."So to synthesize these three definitions:
name
is alwaystext/plain
content
is whatever the value ofmediaType
is, wheremediaType
defaults totext/html
summary
is alwaystext/html
?But there are cases where a producer might want to signal a different content type for
summary
; for example,text/plain
ortext/markdown
. Recently, mastodon/mastodon#32538 came up as an example of wanting to produce asummary
that is NOTtext/html
. So the question is, might it make sense to provide a mechanism for declaring thatsummary
is something other thantext/html
?Potential solutions
mediaType
to cover bothcontent
andsummary
could work, but would prevent using different formats for each of the two separately.mediaTypeOfSummary
seems clunky, but might end up making sense or being necessary.@value
can have its ownmediaType
, although this would be pretty complicated and not backwards-compatible. (It would also break JSON-LD language containers, sonameMap
/summaryMap
/contentMap
would not work.)Action items
summary
to be something other thantext/html
, with eithermediaType
extending to cover it, or definingmediaTypeOfSummary
as an analogous property.The text was updated successfully, but these errors were encountered: