Can I set a limit to the maximum number of tokens allowed in a request? I.e. once the lexer knows the token count, if `count > N` fail the request.