Does not parse HTML properly

Our production application contains quite a few inline `<script>` tags with accumulated javascript inside. An excerpt looks like this:

```
<head>
<script>
// snip
                            if ( something < other ) {
// snip
                            // explanatory comment: we replace " and ' as late as possible
// snip
</script>

<esi:remove>This directive is not executed</esi:remove>
</head>
```

When processing this kind of content, the `esi` crate does not execute any esi-directives (at least inside `<head>` in the example, directives later in `<body>` are picked up). I guess this is due to using `quick_xml` as the parser, which expects XML, where e.g. `<` inside the script tag would have to be escaped as `&lt;`, but is getting HTML, where the [escaping rules are much more relaxed](https://html.spec.whatwg.org/multipage/syntax.html#raw-text-elements) -- and conversely, applying XML-style escapes in an HTML document results in JavaScript syntax errors, so that's not a solution. I think we really need an HTML-aware parser here.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Does not parse HTML properly #18

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Does not parse HTML properly #18

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions