Skip to content

Commit 6622a6c

Browse files
authored
Merge pull request Azure#1 from Azure/master
merge latest changes from Azure/MLNB repo
2 parents 4fe8c17 + 6b19e2f commit 6622a6c

File tree

380 files changed

+22738
-36148
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

380 files changed

+22738
-36148
lines changed
Lines changed: 30 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,30 @@
1+
---
2+
name: Bug report
3+
about: Create a report to help us improve
4+
title: "[Notebook issue]"
5+
labels: ''
6+
assignees: ''
7+
8+
---
9+
10+
**Describe the bug**
11+
A clear and concise description of what the bug is.
12+
13+
Provide the following if applicable:
14+
+ Your Python & SDK version
15+
+ Python Scripts or the full notebook name
16+
+ Pipeline definition
17+
+ Environment definition
18+
+ Example data
19+
+ Any log files.
20+
+ Run and Workspace Id
21+
22+
**To Reproduce**
23+
Steps to reproduce the behavior:
24+
1.
25+
26+
**Expected behavior**
27+
A clear and concise description of what you expected to happen.
28+
29+
**Additional context**
30+
Add any other context about the problem here.
Lines changed: 43 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,43 @@
1+
---
2+
name: Notebook issue
3+
about: Describe your notebook issue
4+
title: "[Notebook] DESCRIPTIVE TITLE"
5+
labels: notebook
6+
assignees: ''
7+
8+
---
9+
10+
### DESCRIPTION: Describe clearly + concisely
11+
12+
13+
.
14+
### REPRODUCIBLE: Steps
15+
16+
17+
.
18+
### EXPECTATION: Clear description
19+
20+
21+
.
22+
### CONFIG/ENVIRONMENT:
23+
```Provide where applicable
24+
25+
## Your Python & SDK version:
26+
27+
## Environment definition:
28+
29+
## Notebook name or Python scripts:
30+
31+
## Run and Workspace Id:
32+
33+
## Pipeline definition:
34+
35+
## Example data:
36+
37+
## Any log files:
38+
39+
40+
41+
42+
43+
```

README.md

Lines changed: 8 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,8 @@
22

33
This repository contains example notebooks demonstrating the [Azure Machine Learning](https://azure.microsoft.com/en-us/services/machine-learning-service/) Python SDK which allows you to build, train, deploy and manage machine learning solutions using Azure. The AML SDK allows you the choice of using local or cloud compute resources, while managing and maintaining the complete data science workflow from the cloud.
44

5-
![Azure ML workflow](https://raw.githubusercontent.com/MicrosoftDocs/azure-docs/master/articles/machine-learning/service/media/overview-what-is-azure-ml/aml.png)
5+
![Azure ML Workflow](https://raw.githubusercontent.com/MicrosoftDocs/azure-docs/master/articles/machine-learning/service/media/concept-azure-machine-learning-architecture/workflow.png)
6+
67

78
## Quick installation
89
```sh
@@ -38,6 +39,7 @@ The [How to use Azure ML](./how-to-use-azureml) folder contains specific example
3839
- [Machine Learning Pipelines](./how-to-use-azureml/machine-learning-pipelines) - Examples showing how to create and use reusable pipelines for training and batch scoring
3940
- [Deployment](./how-to-use-azureml/deployment) - Examples showing how to deploy and manage machine learning models and solutions
4041
- [Azure Databricks](./how-to-use-azureml/azure-databricks) - Examples showing how to use Azure ML with Azure Databricks
42+
- [Monitor Models](./how-to-use-azureml/monitor-models) - Examples showing how to enable model monitoring services such as DataDrift
4143

4244
---
4345
## Documentation
@@ -48,10 +50,14 @@ The [How to use Azure ML](./how-to-use-azureml) folder contains specific example
4850

4951
---
5052

53+
54+
## Community Repository
55+
Visit this [community repository](https://github.com/microsoft/MLOps/tree/master/examples) to find useful end-to-end sample notebooks. Also, please follow these [contribution guidelines](https://github.com/microsoft/MLOps/blob/master/contributing.md) when contributing to this repository.
56+
5157
## Projects using Azure Machine Learning
5258

5359
Visit following repos to see projects contributed by Azure ML users:
54-
60+
- [AMLSamples](https://github.com/Azure/AMLSamples) Number of end-to-end examples, including face recognition, predictive maintenance, customer churn and sentiment analysis.
5561
- [Fine tune natural language processing models using Azure Machine Learning service](https://github.com/Microsoft/AzureML-BERT)
5662
- [Fashion MNIST with Azure ML SDK](https://github.com/amynic/azureml-sdk-fashion)
5763

configuration.ipynb

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -103,7 +103,7 @@
103103
"source": [
104104
"import azureml.core\n",
105105
"\n",
106-
"print(\"This notebook was created using version 1.0.48\r\n of the Azure ML SDK\")\n",
106+
"print(\"This notebook was created using version 1.0.62 of the Azure ML SDK\")\n",
107107
"print(\"You are currently using version\", azureml.core.VERSION, \"of the Azure ML SDK\")"
108108
]
109109
},
File renamed without changes.

how-to-use-azureml/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@ As a pre-requisite, run the [configuration Notebook](../configuration.ipynb) not
88
* [train-on-local](./training/train-on-local): Learn how to submit a run to local computer and use Azure ML managed run configuration.
99
* [train-on-amlcompute](./training/train-on-amlcompute): Use a 1-n node Azure ML managed compute cluster for remote runs on Azure CPU or GPU infrastructure.
1010
* [train-on-remote-vm](./training/train-on-remote-vm): Use Data Science Virtual Machine as a target for remote runs.
11-
* [logging-api](./training/logging-api): Learn about the details of logging metrics to run history.
11+
* [logging-api](./track-and-monitor-experiments/logging-api): Learn about the details of logging metrics to run history.
1212
* [register-model-create-image-deploy-service](./deployment/register-model-create-image-deploy-service): Learn about the details of model management.
1313
* [production-deploy-to-aks](./deployment/production-deploy-to-aks) Deploy a model to production at scale on Azure Kubernetes Service.
1414
* [enable-data-collection-for-models-in-aks](./deployment/enable-data-collection-for-models-in-aks) Learn about data collection APIs for deployed model.

how-to-use-azureml/automated-machine-learning/README.md

Lines changed: 20 additions & 11 deletions
Original file line numberDiff line numberDiff line change
@@ -155,11 +155,11 @@ jupyter notebook
155155
- [auto-ml-subsampling-local.ipynb](subsampling/auto-ml-subsampling-local.ipynb)
156156
- How to enable subsampling
157157

158-
- [auto-ml-dataprep.ipynb](dataprep/auto-ml-dataprep.ipynb)
159-
- Using DataPrep for reading data
158+
- [auto-ml-dataset.ipynb](dataprep/auto-ml-dataset.ipynb)
159+
- Using Dataset for reading data
160160

161-
- [auto-ml-dataprep-remote-execution.ipynb](dataprep-remote-execution/auto-ml-dataprep-remote-execution.ipynb)
162-
- Using DataPrep for reading data with remote execution
161+
- [auto-ml-dataset-remote-execution.ipynb](dataprep-remote-execution/auto-ml-dataset-remote-execution.ipynb)
162+
- Using Dataset for reading data with remote execution
163163

164164
- [auto-ml-classification-with-whitelisting.ipynb](classification-with-whitelisting/auto-ml-classification-with-whitelisting.ipynb)
165165
- Dataset: scikit learn's [digit dataset](http://scikit-learn.org/stable/modules/generated/sklearn.datasets.load_digits.html#sklearn.datasets.load_digits)
@@ -175,10 +175,19 @@ jupyter notebook
175175
- Example of training an automated ML forecasting model on multiple time-series
176176

177177
- [auto-ml-classification-with-onnx.ipynb](classification-with-onnx/auto-ml-classification-with-onnx.ipynb)
178-
- Dataset: scikit learn's [digit dataset](http://scikit-learn.org/stable/modules/generated/sklearn.datasets.load_digits.html#sklearn.datasets.load_digits)
178+
- Dataset: scikit learn's [iris dataset](http://scikit-learn.org/stable/modules/generated/sklearn.datasets.load_iris.html)
179179
- Simple example of using automated ML for classification with ONNX models
180180
- Uses local compute for training
181181

182+
- [auto-ml-remote-amlcompute-with-onnx.ipynb](remote-amlcompute-with-onnx/auto-ml-remote-amlcompute-with-onnx.ipynb)
183+
- Dataset: scikit learn's [iris dataset](http://scikit-learn.org/stable/modules/generated/sklearn.datasets.load_iris.html)
184+
- Example of using automated ML for classification using remote AmlCompute for training
185+
- Train the models with ONNX compatible config on
186+
- Parallel execution of iterations
187+
- Async tracking of progress
188+
- Cancelling individual iterations or entire run
189+
- Retrieving the ONNX models and do the inference with them
190+
182191
- [auto-ml-bank-marketing-subscribers-with-deployment.ipynb](bank-marketing-subscribers-with-deployment/auto-ml-bank-marketing-with-deployment.ipynb)
183192
- Dataset: UCI's [bank marketing dataset](https://www.kaggle.com/janiobachmann/bank-marketing-dataset)
184193
- Simple example of using automated ML for classification to predict term deposit subscriptions for a bank
@@ -220,7 +229,7 @@ The main code of the file must be indented so that it is under this condition.
220229
2. Check that you have conda 64-bit installed rather than 32-bit. You can check this with the command `conda info`. The `platform` should be `win-64` for Windows or `osx-64` for Mac.
221230
3. Check that you have conda 4.4.10 or later. You can check the version with the command `conda -V`. If you have a previous version installed, you can update it using the command: `conda update conda`.
222231
4. On Linux, if the error is `gcc: error trying to exec 'cc1plus': execvp: No such file or directory`, install build essentials using the command `sudo apt-get install build-essential`.
223-
5. Pass a new name as the first parameter to automl_setup so that it creates a new conda environment. You can view existing conda environments using `conda env list` and remove them with `conda env remove -n <environmentname>`.
232+
5. Pass a new name as the first parameter to automl_setup so that it creates a new conda environment. You can view existing conda environments using `conda env list` and remove them with `conda env remove -n <environmentname>`.
224233

225234
## automl_setup_linux.sh fails
226235
If automl_setup_linux.sh fails on Ubuntu Linux with the error: `unable to execute 'gcc': No such file or directory`
@@ -255,13 +264,13 @@ Some Windows environments see an error loading numpy with the latest Python vers
255264
Check the tensorflow version in the automated ml conda environment. Supported versions are < 1.13. Uninstall tensorflow from the environment if version is >= 1.13
256265
You may check the version of tensorflow and uninstall as follows
257266
1) start a command shell, activate conda environment where automated ml packages are installed
258-
2) enter `pip freeze` and look for `tensorflow` , if found, the version listed should be < 1.13
259-
3) If the listed version is a not a supported version, `pip uninstall tensorflow` in the command shell and enter y for confirmation.
267+
2) enter `pip freeze` and look for `tensorflow` , if found, the version listed should be < 1.13
268+
3) If the listed version is a not a supported version, `pip uninstall tensorflow` in the command shell and enter y for confirmation.
260269

261-
## Remote run: DsvmCompute.create fails
270+
## Remote run: DsvmCompute.create fails
262271
There are several reasons why the DsvmCompute.create can fail. The reason is usually in the error message but you have to look at the end of the error message for the detailed reason. Some common reasons are:
263272
1) `Compute name is invalid, it should start with a letter, be between 2 and 16 character, and only include letters (a-zA-Z), numbers (0-9) and \'-\'.` Note that underscore is not allowed in the name.
264-
2) `The requested VM size xxxxx is not available in the current region.` You can select a different region or vm_size.
273+
2) `The requested VM size xxxxx is not available in the current region.` You can select a different region or vm_size.
265274

266275
## Remote run: Unable to establish SSH connection
267276
Automated ML uses the SSH protocol to communicate with remote DSVMs. This defaults to port 22. Possible causes for this error are:
@@ -287,4 +296,4 @@ To resolve this issue, allocate a DSVM with more memory or reduce the value spec
287296

288297
## Remote run: Iterations show as "Not Responding" in the RunDetails widget.
289298
This can be caused by too many concurrent iterations for a remote DSVM. Each concurrent iteration usually takes 100% of a core when it is running. Some iterations can use multiple cores. So, the max_concurrent_iterations setting should always be less than the number of cores of the DSVM.
290-
To resolve this issue, try reducing the value specified for the max_concurrent_iterations setting.
299+
To resolve this issue, try reducing the value specified for the max_concurrent_iterations setting.

how-to-use-azureml/automated-machine-learning/automl_env.yml

Lines changed: 5 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -13,10 +13,14 @@ dependencies:
1313
- scikit-learn>=0.19.0,<=0.20.3
1414
- pandas>=0.22.0,<=0.23.4
1515
- py-xgboost<=0.80
16+
- pyarrow>=0.11.0
1617

1718
- pip:
1819
# Required packages for AzureML execution, history, and data preparation.
19-
- azureml-sdk[automl,explain]
20+
- azureml-defaults
21+
- azureml-train-automl
2022
- azureml-widgets
23+
- azureml-explain-model
24+
- azureml-contrib-explain-model
2125
- pandas_ml
2226

how-to-use-azureml/automated-machine-learning/automl_env_mac.yml

Lines changed: 6 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -2,6 +2,7 @@ name: azure_automl
22
dependencies:
33
# The python interpreter version.
44
# Currently Azure ML only supports 3.5.2 and later.
5+
- pip
56
- nomkl
67
- python>=3.5.2,<3.6.8
78
- nb_conda
@@ -13,10 +14,14 @@ dependencies:
1314
- scikit-learn>=0.19.0,<=0.20.3
1415
- pandas>=0.22.0,<0.23.0
1516
- py-xgboost<=0.80
17+
- pyarrow>=0.11.0
1618

1719
- pip:
1820
# Required packages for AzureML execution, history, and data preparation.
19-
- azureml-sdk[automl,explain]
21+
- azureml-defaults
22+
- azureml-train-automl
2023
- azureml-widgets
24+
- azureml-explain-model
25+
- azureml-contrib-explain-model
2126
- pandas_ml
2227

0 commit comments

Comments
 (0)