Azure Machine Learning

Disable network policies for private endpoints

az network vnet subnet update --name default --resource-group myResourceGroup --vnet-name myVirtualNetwork --disable-private-endpoint-network-policies true

Add a private endpoint to a workspace

az ml workspace private-endpoint add --resource-group myWSResourceGroup --workspace-name myWorkspaceName --pe-name myPrivateEndpoint --pe-vnet-name myVirtualNetwork --pe-subnet-name mySubnet --pe-resource-group myVNResourceGroup

Internal AKS load balancer

az ml computetarget update aks --name myInferenceCluster --load-balancer-subnet mySubnet --load-balancer-type InternalLoadBalancer --workspace myWorkspaceName --resource-group myResourceGroup

Consume an Azure Machine Learning model deployed as a web service

Connection information

Python

If you know the name of the deployed service, you can create a new instance of Webservice, and provide the workspace and service name as parameters. The new object contains information about the deployed service.:

service = Webservice(workspace='myWorkspaceName', name='myServiceName')
print(service.scoring_uri)
print(service.swagger_uri)

Azure CLI

If you know the name of the deployed service, use the az ml service show command:

az ml service show --name myServiceName --resource-group myWSResourceGroup --workspace-name myWorkspaceName

Authentication with keys

When you enable authentication for a deployment, you automatically create authentication keys.

Authentication is enabled by default when you are deploying to Azure Kubernetes Service.
Authentication is disabled by default when you are deploying to Azure Container Instances.

If authentication is enabled, you can use the get-keys method to retrieve a primary and secondary authentication key:

Python

primary, secondary = service.get_keys()
print(primary)

Azure CLI

az ml service get-keys --name myServiceName --resource-group myWSResourceGroup --workspace-name myWorkspaceName

If you need to regenerate a key, use regen-key.

Authentication with tokens

When you enable token authentication for a web service, a user must provide an Azure Machine Learning JWT token to the web service to access it.

Token authentication is disabled by default when you are deploying to Azure Kubernetes Service.
Token authentication is not supported when you are deploying to Azure Container Instances.

If token authentication is enabled, you can use the get-access-token method to retrieve a bearer token and that tokens expiration time:

Python

token, refresh_by = service.get_token()
print(token)

Azure CLI

az ml service get-access-token --name myServiceName --resource-group myWSResourceGroup --workspace-name myWorkspaceName

Currently the only way to retrieve the token is by using the Azure Machine Learning SDK or the Azure CLI machine learning extension.

Troubleshooting tips

Most users are able to resolve issues concerning consuming endpoints by using the following steps.

Recommended Steps

See Consume an Azure Machine Learning model deployed as a web service. Deploying an Azure Machine Learning model as a web service creates a REST API endpoint. You can send data to this endpoint and receive the prediction returned by the model. The linked article will show you how to create clients for the web service by using C#, Go, Java, and Python.
See Consume the service from Power BI. After the web service is deployed, it’s consumable from Power BI dataflows. Learn how to consume an Azure Machine Learning web service from Power BI.
How to update a deployed web service
See Advance Entry Script Authoring. This article shows how to write entry scripts for specialized use cases:
- Automatically generate a Swagger schema
- Power BI Compatible endpoint
- Binary (image) data
- Cross-origin resource sharing (CORS)
- Load Registered models. There are two ways to locate models in your entry script.
- Framework specific examples. More entry script examples for specific machine learning use cases (PyTorch, TensorFlow, Keras, AutoML, ONNX).
The InferenceSchema Python package provides uniform schema for common machine learning applications, as well as a set of decorators that can be used to aid in web-based ML prediction applications.
Troubleshoot a failed deployment
- Debug Locally
- HTTP status code 502
- HTTP status code 503
- HTTP status code 504
- Container cannot be scheduled error. When deploying a service to an Azure Kubernetes Service compute target, Azure Machine Learning will attempt to schedule the service with the requested amount of resources. If there are no nodes available in the cluster with appropriate amount of resources after 5 minutes, the deployment will fail.
- Service launch fails. As part of container starting-up process, the init() function in your scoring script is invoked by the system. If there are uncaught error exceptions in the init() function, you might see CrashLoopBackOff error in the error message.
- Function fails: get_model_path(). Often, in the init() function in the scoring script, Model.get_model_path() function is called to locate a model file or folder of model files in the container. If the model file or folder cannot be found, the function fails.
- Function fails: run(input_data). If service is successfully deployed, but it crashes when you post data to the scoring endpoint, you can add error catching statement in your run(input_data).
- Advance Debugging. You may need to interactively debug the Python code contained in your model deployment. By using Visual Studio Code and the debugpy, you can attach to the code running inside Docker container.
- Webservices in Azure Kubernetes Service Failures. Many webservice failures in Azure Kubernetes Service can be debugged by connecting to the cluster using kubectl.
Collect and evaluate model data
Monitor and collect data from ML web service endpoints
Limitations when deploying model to Azure Container Instances.
Secure an Azure Machine Learning Inferencing environment with virtual networks
- Learn about network requirements that must be met to use an AKS cluster in a virtual network
- Network Contributor Role
- Secure VNet Traffic. There are two ways to isolate traffic to and from AKS cluster to the virtual network. Private AKS cluster and Internal AKS Load Balancer
- To enable Azure Machine Learning to create ACI inside the virtual network, you must enable subnet delegation for the subnet used by the deployment
- Limit outbound connectivity from the virtual network
- Understand connectivity requirements for AKS inferencing cluster

Recommended Documents

Python SDK

Webservice Package contains functionality for deploying machine learning models as web service endpoints in Azure Machine Learning
AksWebService class
AciWebService class
SSL configuration class
AksProvisioningConfiguration.enable_ssl()
AksAttachConfiguration.enable_ssl()

Enterprise Readiness and Security