<!--
Copyright (c) 2021 - present / Neuralmagic, Inc. All Rights Reserved.

Licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at

   http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing,
software distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.
-->

# SparseML Keras Integration

This directory combines the SparseML recipe-driven approach with standard Keras training flows for models within
[Keras Applications](https://keras.io/api/applications/) and SparseML.
They are intended as example flows to show how to integrate sparsification techniques with a custom Keras training flow.
The techniques include, but are not limited to:
- Pruning
- Quantization
- Pruning + Quantization
- Sparse Transfer Learning

## Highlights

- Coming soon!

## Tutorials

- [Classification](https://github.com/neuralmagic/sparseml/blob/main/integrations/keras/notebooks/classification.ipynb)

## Installation

To begin, run `pip install sparseml[tf_keras]`

## Quick Tour

### Recipes

Recipes encode the instructions and hyperparameters for sparsifying a model using modifiers to the training process.
The modifiers can range from pruning and quantization to learning rate and weight decay.
When appropriately combined, it becomes possible to create highly sparse and accurate models.

This integration contains `--recipe` arguments appended to the Python scripts and `recipe` variables in the appropriate notebooks.
Popular recipes used with this argument are found in the [`recipes` folder](./recipes).

### SparseZoo

Pre-sparsified models and recipes can be downloaded through the [SparseZoo](https://github.com/neuralmagic/sparsezoo).

Complete lists are available online for all [models](https://sparsezoo.neuralmagic.com/tables/models/cv/classification?repo=sparseml&framework=keras) and 
[recipes](https://sparsezoo.neuralmagic.com/tables/recipes/cv/classification?repo=sparseml&framework=keras) compatible with this integration as well.

Sample code for retrieving a model from the SparseZoo:
```python
from sparsezoo import Model

model = Model("zoo:cv/classification/resnet_v1-50/keras/sparseml/imagenet/pruned-moderate")
print(model)
```

Sample code for retrieving a recipe from the SparseZoo:
```python
from sparsezoo import Model

model = Model("zoo:cv/classification/resnet_v1-50/keras/sparseml/imagenet/pruned-conservative")
recipe = model.recipes.default
print(recipe)
```

### Structure

The following table lays out the root-level files and folders along with a description for each.

| Folder/File Name     | Description                                                                                                           |
|----------------------|-----------------------------------------------------------------------------------------------------------------------|
| notebooks            | Jupyter notebooks to walk through sparsifiation within standard Keras training flows                                  |
| recipes              | Typical recipes for sparsifying Keras models along with any downloaded recipes from the SparseZoo.                    |
| tutorials            | Tutorial walkthroughs for how to sparsify Keras models using recipes.                                                 |
| classification.py    | Utility training script for sparsifying classification models in Keras (loads SparseML models).                       |
| prune_resnet20.py    | Simple example walking through pruning a ResNet-20 model on the Cifar dataset.                                        |
| README.md            | Readme file.                                                                                                          |

### Exporting for Inference

All scripts and notebooks end with an export to the [ONNX](https://onnx.ai/) file format.
The export process is modified such that the quantized and pruned models are corrected and folded properly.

The DeepSparse Engine accepts ONNX formats and is engineered to significantly speed up inference on CPUs for the sparsified models from this integration.
Examples for loading, benchmarking, and deploying can be found in the [DeepSparse repository here](https://github.com/neuralmagic/deepsparse/).
