Skip to content

Commit

Permalink
Update e2e files for new project (#99)
Browse files Browse the repository at this point in the history
  • Loading branch information
safoinme committed Apr 25, 2024
1 parent 9619569 commit 1972552
Show file tree
Hide file tree
Showing 42 changed files with 2,462 additions and 0 deletions.
Binary file added e2e/.assets/00_pipelines_composition.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
2 changes: 2 additions & 0 deletions e2e/.dockerignore
Original file line number Diff line number Diff line change
@@ -0,0 +1,2 @@
.venv*
.requirements*
15 changes: 15 additions & 0 deletions e2e/LICENSE
Original file line number Diff line number Diff line change
@@ -0,0 +1,15 @@
Apache Software License 2.0

Copyright (c) ZenML GmbH 2024. All rights reserved.

Licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at

http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.
24 changes: 24 additions & 0 deletions e2e/Makefile
Original file line number Diff line number Diff line change
@@ -0,0 +1,24 @@
stack_name ?= e2e_template_stack
setup:
pip install -r requirements.txt
zenml integration install aws sklearn mlflow slack evidently kubeflow kubernetes -y

install-stack-local:
@echo "Specify stack name [$(stack_name)]: " && read input && [ -n "$$input" ] && stack_name="$$input" || stack_name="$(stack_name)" && \
zenml experiment-tracker register -f mlflow mlflow_local_$${stack_name} && \
zenml model-registry register -f mlflow mlflow_local_$${stack_name} && \
zenml model-deployer register -f mlflow mlflow_local_$${stack_name} && \
zenml data-validator register -f evidently evidently_$${stack_name} && \
zenml stack register -a default -o default -r mlflow_local_$${stack_name} \
-d mlflow_local_$${stack_name} -e mlflow_local_$${stack_name} -dv \
evidently_$${stack_name} $${stack_name} && \
zenml stack set $${stack_name} && \
zenml stack up


install-local-stack:
zenml experiment-tracker register -f mlflow local_mlflow_tracker && \
zenml data-validator register -f evidently evidently_data_validator && \
zenml model-deployer register bentoml_deployer --flavor=bentoml && \
zenml stack register -a default -o default -e mlflow_local -dv evidently_local -d bentoml_deployer local_stack && \
zenml stack set local_stack
233 changes: 233 additions & 0 deletions e2e/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,233 @@
# ZenML E2E project

This is a comprehensive supervised ML project built with the
ZenML framework and its integration. The project trains one or more
scikit-learn classification models to make predictions on the tabular
classification datasets provided by the scikit-learn library. The project was
generated from the [E2E Batch ZenML project template](https://github.com/zenml-io/template-e2e-batch)
with the following properties:
- Project name: ZenML E2E project
- Technical Name: e2e_use_case
- Version: `0.0.1`
- Licensed with apache to ZenML GmbH<>
- Deployment environment: `staging`

Settings of your project are:
- Hyperparameters and model architecture tuning using configuration from `config.py`
- Trained model promotion to `staging` based on accuracy metric vs currently deployed model
- Data drift checks based on Evidently report
- Notifications about failures enabled

> [!NOTE]
> If you are coming from [our production guide](https://docs.zenml.io/user-guide/production-guide),
> you can apply the same principles that you have seen in the guide to this project.
> You can [connect it to remote storage](https://docs.zenml.io/user-guide/production-guide/remote-storage),
> [run it with a cloud orchestrator](https://docs.zenml.io/user-guide/production-guide/cloud-orchestration),
> [attach a git repository to it](https://docs.zenml.io/user-guide/production-guide/connect-code-repository),
> and much more. If you are looking to try these steps with a simpler example,
> feel free to take a look at [our starter template](https://github.com/zenml-io/template-starter)
> as well.
## 👋 Introduction

Welcome to your newly generated "ZenML E2E project" project! This is
a great way to get hands-on with ZenML using production-like template.
The project contains a collection of standard and custom ZenML steps,
pipelines and other artifacts and useful resources that can serve as a
solid starting point for your smooth journey with ZenML.

What to do first? You can start by giving the project a quick run. The
project is ready to be used and can run as-is without any further code
changes! You can try it right away by installing ZenML, the needed
ZenML integration and then calling the CLI included in the project. We also
recommend that you start the ZenML UI locally to get a better sense of what
is going on under the hood:

```bash
# Set up a Python virtual environment, if you haven't already
python3 -m venv .venv
source .venv/bin/activate
# Install requirements & integrations
make setup
# Optionally, provision default local stack
make install-stack-local
# Start the ZenML UI locally (recommended, but optional);
# the default username is "admin" with an empty password
zenml up
# Run the pipeline included in the project
python run.py
```

When the pipelines are done running, you can check out the results in the ZenML
UI by following the link printed in the terminal (or you can go straight to
the [ZenML UI pipelines run page](http://127.0.0.1:8237/workspaces/default/all-runs?page=1).

Next, you should:

* look at the CLI help to see what you can do with the project:
```bash
python run.py --help
```
* go back and [try out different parameters](https://github.com/zenml-io/template-e2e-batch#-template-parameters)
for your generated project. For example, you could disable hyperparameters
tuning and use your favorite model architecture or promote every trained model,
if you haven't already!
* take a look at [the project structure](#-project-structure) and the code
itself. The code is heavily commented and should be easy to follow.
* read the [ZenML documentation](https://docs.zenml.io) to learn more about
various ZenML concepts referenced in the code and to get a better sense of
what you can do with ZenML.
* start building your own ZenML project by modifying this code

## 📦 What's in the box?

The ZenML E2E project project demonstrates how the most important steps of
the ML Production Lifecycle can be implemented in a reusable way remaining
agnostic to the underlying infrastructure, and how to integrate them together
into pipelines serving Training and Batch Inference purposes.

This template uses
[the Breast Cancer Dataset](https://scikit-learn.org/stable/modules/generated/sklearn.datasets.load_breast_cancer.html)
to demonstrate how to perform major critical steps for Continuous Training (CT)
and Continuous Delivery (CD).

It consists of three pipelines with the following high-level setup:
<p align="center">
<img height=800 src=".assets/00_pipelines_composition.png">
</p>

All pipelines are leveraging the Model Control Plane to bring all parts together - the training pipeline creates and promotes a new Model Control Plane version with a trained model object in it, deployment pipeline uses the inference Model Control Plane version (the one promoted during training) to create a deployment service and inference pipeline using deployment service from the inference Model Control Plane version and store back new set of predictions as a versioned data artifact for future use. This makes those pipelines closely connected while ensuring that only quality-assured Model Control Plane versions are used to produce predictions delivered to stakeholders.
* [CT] Training
* Load, split, and preprocess the training dataset
* Search for an optimal model object architecture and tune its hyperparameters
* Train the model object and evaluate its performance on the holdout set
* Compare a recently trained model object with one promoted earlier
* If a recently trained model object performs better - stage it as a new inference model object in model registry
* On success of the current model object - stage newly created Model Control Plane version as the one used for inference
* [CD] Deployment
* Deploy a new prediction service based on the model object connected to the inference Model Control Plane version.
* [CD] Batch Inference
* Load the inference dataset and preprocess it reusing object fitted during training
* Perform data drift analysis reusing training dataset of the inference Model Control Plane version as a reference
* Run predictions using a model object from the inference Model Control Plane version
* Store predictions as an versioned artifact and link it to the inference Model Control Plane version

In [the repository documentation](https://github.com/zenml-io/template-e2e-batch#-how-this-template-is-implemented),
you can find more details about every step of this template.

The project code is meant to be used as a template for your projects. For
this reason, you will find several places in the code specifically marked
to indicate where you can add your code:

```python
### ADD YOUR OWN CODE HERE - THIS IS JUST AN EXAMPLE ###
...
### YOUR CODE ENDS HERE ###
```

## 📜 Project Structure

The project loosely follows [the recommended ZenML project structure](https://docs.zenml.io/user-guide/starter-guide/follow-best-practices):

```
.
├── configs # pipelines configuration files
│ ├── deployer_config.yaml # the configuration of the deployment pipeline
│ ├── inference_config.yaml # the configuration of the batch inference pipeline
│ └── train_config.yaml # the configuration of the training pipeline
├── pipelines # `zenml.pipeline` implementations
│ ├── batch_inference.py # [CD] Batch Inference pipeline
│ ├── deployment.py # [CD] Deployment pipeline
│ └── training.py # [CT] Training Pipeline
├── steps # logically grouped `zenml.steps` implementations
│ ├── alerts # alert developer on pipeline status
│ ├── deployment # deploy trained model objects
│ ├── data_quality # quality gates built on top of drift report
│ ├── etl # ETL logic for dataset
│ ├── hp_tuning # tune hyperparameters and model architectures
│ ├── inference # inference on top of the model from the registry
│ ├── promotion # find if a newly trained model will be new inference
│ └── training # train and evaluate model
├── utils # helper functions
├── .dockerignore
├── Makefile # helper scripts for quick start with integrations
├── README.md # this file
├── requirements.txt # extra Python dependencies
└── run.py # CLI tool to run pipelines on ZenML Stack
```



## Guide to project:

1- Create a virtual environment and activate it:
```bash
python3 -m venv .venv
source .venv/bin/activate
```

2- Install the requirements and integrations:
```bash
pip install zenml
```

3- Register the local stack components and local stack:
```bash
zenml experiment-tracker register -f mlflow local_mlflow_tracker && \
zenml data-validator register -f evidently evidently_data_validator && \
zenml model-deployer register bentoml_deployer --flavor=bentoml && \
zenml stack register -a default -o default -e mlflow_local -dv evidently_local -d bentoml_deployer local_stack && \
zenml stack set local_stack

# or use the Makefile
make install-stack-local
```

4- Let's run the pipeline:
```bash
python run.py --training
```

5- Register a GCP Service Connector using service account credentials:
```bash
zenml service-connector register gcp_service_connector --type gcp --auth-method service-account --project_id=zenml-core [email protected]
```

6- Register gcp artifact store:
```bash
zenml artifact-store register gcp_artifact_store --flavor gcp --path=gs://zenml-internal-artifact-store --connector=gcp_service_connector
```

7- We need a remote experiment tracker if we want to use a remote artifact store:
```bash
# To do this in a proper way, we need first to register a secret that would hold the credentials for the remote experiment tracker
zenml secret create mlflow_secret --username=admin --password=nBwwc5yrw8

# Once the secret is created, we can register the remote experiment tracker
zenml experiment-tracker register mlflow_experiment_tracker --flavor=mlflow --tracking_uri=https://d13d987c-mlflow.cloudinfra.zenml.io --tracking_username={{mlflow_secret.username}} --tracking_password={{mlflow_secret.password}}
```

8- Now we can either update the local stack to use the remote artifact store or create a new stack that uses the remote artifact store:
```bash
zenml stack update local_stack -a gcp_artifact_store -e mlflow_experiment_tracker
```

9- Let's run the pipeline:
```bash
python run.py --training
```

10- Let's move this to the cloud, and use vertex AI as an orchestrator:
```bash
zenml orchestrator register vertex_orchestrator \
--flavor=vertex \
--location=europe-west3 \
--synchronous=true \
--workload_service_account=vertex-ai-sa@zenml-core.iam.gserviceaccount.com
```

11- Let's create a new stack that uses the remote artifact store and the vertex orchestrator:
```bash
zenml stack create -a gcp_artifact_store -e mlflow_experiment_tracker -o vertex_orchestrator -d bentoml_deployer -dv evidently_local vertex_stack
```

43 changes: 43 additions & 0 deletions e2e/configs/deployer_config.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,43 @@
# Apache Software License 2.0
#
# Copyright (c) ZenML GmbH 2024. All rights reserved.
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
#

# environment configuration
settings:
docker:
required_integrations:
- aws
- evidently
- mlflow
- sklearn
- slack
- bentoml

# configuration of steps
steps:
notify_on_success:
parameters:
notify_on_success: False

# configuration of the Model Control Plane
model:
name: e2e_use_case
version: staging

# pipeline level extra configurations
extra:
notify_on_failure: True

42 changes: 42 additions & 0 deletions e2e/configs/inference_config.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,42 @@
# Apache Software License 2.0
#
# Copyright (c) ZenML GmbH 2024. All rights reserved.
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
#

# environment configuration
settings:
docker:
required_integrations:
- gcp
- evidently
- mlflow
- sklearn
- slack

# configuration of steps
steps:
notify_on_success:
parameters:
notify_on_success: False

# configuration of the Model Control Plane
model:
name: e2e_use_case
version: staging

# pipeline level extra configurations
extra:
notify_on_failure: True

Loading

0 comments on commit 1972552

Please sign in to comment.