In our last update, we stood up Spring Boot microservices that our ESP32-S3 microcontrollers reach out to for model updates before downloading the new model. This sets us up nicely for flexible deploys of PyTorch models, and our next step is to implement the model runtime infrastructure to execute the models themselves!

Model Runtime Infrastructure

We cross-compiled the executorch runtime for the ESP32s and set up an executor runner that allocates the proper memory, creates inputs for the deployed model, and executes the model on these inputs.

Demo

We did a full demo of this project so far in the video below. This includes a demonstration of the previous two updates on this devlogs site, as well as the model runtime infrastructure:

Next Steps

Now we have:

Pipelines that export a model to an edge-compatible format,
ESP32-S3 microcontrollers downloading new model .pte files,
The ESP32-S3 microcontrollers running a deployed model with some predefined inputs.

Our next steps are integrating these three steps and clean up work. There will be updates to come!