-
Notifications
You must be signed in to change notification settings - Fork 214
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
timeout while waiting for state to become 'success' (timeout: 2m0s) #760 #765
Comments
Any updates on this? I am hitting the same error even though the API key is far from reaching the rate limit. |
I am hitting this same issue |
Is the team aware of this issue and actively working on it? Our pipeline is broken because of this bug for weeks now. Appreciate if somebody can look into it. |
seems like it might be related to how the terraform SDK handles retries [related issue] I'm receiving the same WaitForState errors the users in the issue above (and related issues) are seeing:
A fix has been submitted upstream to the SDK but has not been reviewed for nearly two years. |
This is causing major negatives including data loss |
This may be related to the new change in API limits. The best practice documented by PagerDuty should be implemented. |
You can check that by turning on debug logging before running your terraform. None of the API calls which timed out for me were even close to hitting the rate limit. Agreed that best practices outlined by PagerDuty should be followed. |
I am sure we have confirmed it isn't 429s already but the code is already aware of 429s and does a retry it looks like. If this were happening because of throttling the 429 shouldn't be hidden in the debug log, It's already giving an error, it should be the right error. Giving the actual error message for failure is pretty standard to present without requiring debugging on. I'm going to just delete the alerting entirely today b/c I have deadlines. |
@imjaroiswebdev I believe this issue is still present in both 3.2.1 and 3.2.2 I have not had any failures due to rate limiting so I'm not surprised that a fix only focused on that had no change to the behavior I and others are seeing.
The last 200 I received before this showed the following retry headers:
|
There was a patch predicted to address the timeouts but it does not appear to have been successful.
The PR, #763, seems to address the issue by making changes to the EVENTS API but the timeout we suspect is in the REST API (reference: https://developer.pagerduty.com/docs/ZG9jOjExMDI5NTUz-rate-limiting)
For full issue history, see #760
The text was updated successfully, but these errors were encountered: