You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/code_description.md
+8-2Lines changed: 8 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -20,15 +20,21 @@
20
20
21
21
### ML Services
22
22
23
-
-`ml_service/pipelines/build_train_pipeline.py` : builds and publishes an ML training pipeline.
24
-
-`ml_service/pipelines/run_train_pipeline.py` : invokes a published ML training pipeline via REST API.
23
+
-`ml_service/pipelines/build_train_pipeline.py` : builds and publishes an ML training pipeline. It uses Python on ML Compute.
24
+
-`ml_service/pipelines/build_train_pipeline_with_r.py` : builds and publishes an ML training pipeline. It uses R on ML Compute.
25
+
-`ml_service/pipelines/build_train_pipeline_with_r_on_dbricks.py` : builds and publishes an ML training pipeline. It uses R on Databricks Compute.
26
+
-`ml_service/pipelines/run_train_pipeline.py` : invokes a published ML training pipeline (Python on ML Compute) via REST API.
25
27
-`ml_service/util` : contains common utility functions used to build and publish an ML training pipeline.
26
28
27
29
### Code
28
30
29
31
-`code/training/train.py` : a training step of an ML training pipeline.
30
32
-`code/evaluate/evaluate_model.py` : an evaluating step of an ML training pipeline which registers a new trained model if evaluation shows the new model is more performant than the previous one.
31
33
-`code/evaluate/register_model.py` : (LEGACY) registers a new trained model if evaluation shows the new model is more performant than the previous one.
34
+
-`code/training/R/r_train.r` : training a model with R basing on a sample dataset (weight_data.csv).
35
+
-`code/training/R/train_with_r.py` : a python wrapper (ML Pipeline Step) invoking R training script on ML Compute
36
+
-`code/training/R/train_with_r_on_databricks.py` : a python wrapper (ML Pipeline Step) invoking R training script on Databricks Compute
37
+
-`code/training/R/weight_data.csv` : a sample dataset used by R script (r_train.r) to train a model
32
38
33
39
### Scoring
34
40
- code/scoring/score.py : a scoring script which is about to be packed into a Docker Image along with a model while being deployed to QA/Prod environment.
@@ -108,6 +110,8 @@ and checkout a published training pipeline in the **mlops-AML-WS** workspace in
108
110
109
111
Great, you now have the build pipeline setup, you can either manually trigger it or it gets automatically triggered everytime there is a change in the master branch. The pipeline performs linting, unit testing, builds and publishes an **ML Training Pipeline** in an **ML Workspace**
110
112
113
+
**Note:** The building pipeline contains disabled steps to build and publish ML pipelines using R to train a model. Enable these steps if you want to play with this approach. For the pipeline training a model with R on Databricks you have to manually create a Databricks cluster and attach it to the ML Workspace as a compute (Values DB_CLUSTER_ID and DATABRICKS_COMPUTE_NAME variables shoud be specified).
114
+
111
115
### 7. Train the Model
112
116
113
117
The next step is to invoke the training pipeline created in the previous step. It can be done with a **Release Pipeline**. Click on the Pipelines/Releases menu, and then **New pipeline**, and then click on "Empty Job" on the "Select a template" window that pops to the right:
0 commit comments