Amazon SageMaker AI Announces New observability capability For Inference Endpoints - AWS
…Amazon CloudWatch gives customers token latency, GPU utilization, inference component copy counts, scaling events, and cold start breakdowns in a single view with OpenTelemetry native metrics published automatically, no instrumentation required. This…