Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

RSDK-9706: Fix FTDC tracking of module restarts/crashes. #4695

Merged
merged 2 commits into from
Jan 10, 2025

Conversation

dgottlieb
Copy link
Member

No description provided.

@dgottlieb dgottlieb requested a review from cheukt January 8, 2025 21:14
@viambot viambot added the safe to test This pull request is marked safe to test from a trusted zone label Jan 8, 2025
@@ -441,6 +441,9 @@ func main() {
nolintPrintln("reset range")
nolintPrintln("- Unset any prior range. \"zoom out to full\"")
nolintPrintln()
nolintPrintln("r, refresh")
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Drive-by to document the "help" output for something I added in a prior PR.

@@ -867,6 +867,10 @@ func (mgr *Manager) newOnUnexpectedExitHandler(mod *module) func(exitCode int) b
"error", err)
}

if mgr.ftdc != nil {
mgr.ftdc.Remove(mod.getFTDCName())
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The bug fix for modules crashing -- this was just missing altogether before.

@@ -1225,7 +1229,7 @@ func (m *module) stopProcess() error {
// of metrics will be reported. Therefore it is safe to continue monitoring the module process
// while it's in shutdown.
if m.ftdc != nil {
m.ftdc.Remove(m.process.ID())
m.ftdc.Remove(m.getFTDCName())
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The happy path for a stop -> restart (presumably a module upgrade falls into this category).

@viambot viambot added safe to test This pull request is marked safe to test from a trusted zone and removed safe to test This pull request is marked safe to test from a trusted zone labels Jan 10, 2025
Copy link
Member

@cheukt cheukt left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@dgottlieb dgottlieb merged commit d2b9c7e into viamrobotics:main Jan 10, 2025
16 checks passed
@dgottlieb dgottlieb deleted the RSDK-9706 branch January 10, 2025 16:09
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
safe to test This pull request is marked safe to test from a trusted zone
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants