Active Monitoring
Once your inference stack is deployed you want to set up active monitoring to ensure you are alerted to any issues before they impact your users.
Once your inference stack is deployed you want to set up active monitoring to ensure you are alerted to any issues before they impact your users.
Kubernetes offers Liveness, Readiness and Startup probes to monitor the health of your applications. These can be configured in the Inference Stack to ensure your models are running correctly.