Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

AGC design principles #2076

Open
jirhiker opened this issue Jan 11, 2022 · 0 comments
Open

AGC design principles #2076

jirhiker opened this issue Jan 11, 2022 · 0 comments

Comments

@jirhiker
Copy link
Member

  1. Use high level language
  2. divide into jobs
  3. if software fails restart
  4. checkpoint good state
  5. hardware monitors software
  6. Send Telemetry

pychron already follows 1 and 2 pretty strictly

  1. if software fails restart. pychron should launch automatically on experiment computers. use information from (4) if available.
  2. checkpoint good state. extend the events system to persist experiment state
  3. Hardware monitors software. Watchdog is kinda like this. software monitoring software but in this context i think its ok
  4. Send Telemetry. Add a telemetry to the watch dog plugin. added more events and context to experiment executor so more telemetry is send to the watchdog.

If the computer/experiment fails and the system passes all checks on relaunch then restart the experiment.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant