How to increase AWS Sagemaker invocation time out while waiting for a response
It’s currently not possible to increase timeout—this is an open issue in GitHub. Looking through the issue and similar questions on SO, it seems like you may be able to use batch transforms in conjunction with inference.
References
https://stackoverflow.com/a/55642675/806876
Sagemaker Python SDK timeout issue: https://github.com/aws/sagemaker-python-sdk/issues/1119