Building Serverless Event Driven Apps with Knative and Python
6 min read
Knative is an extension to Kubernetes that provides a set of tools to deploy serverless workload.
Knative is built by the following major components:
- Build — Source-to-container build orchestration.
- Eventing — Management and delivery of events.
- Serving — Request-driven compute that can scale to zero.
In this article, we will focus on the Eventing component. It provides infrastructure for serverless event driven architecture which runs your container when an event is fired. Knative Eventing supports different event sources (such as GitHub events, Pub/Sub, etc) and you can even write your own custom event source.
We chose Knative instead of other traditional serverless platforms (Functions, Lambda, etc) for the following reasons:
- It is powered by containers, meaning you can write code however you like, in every language or framework, with every module you need as long as you can make a container out of it. Usually, other serverless platforms limit you to specific runtime environments which they support.
- Resources are fully customizable (CPU, GPU, networking, etc…), as long as your Kubernetes cluster is equipped with the relevant resources you are good to go. Just like every Kubernetes deployment, you have to state what resources you need. Other serverless solutions limit the resources, these limitations are often a no-go for us.
- No timeout limit so you can deploy long-running processes (data analysis, image processing and what not). This is impossible to achieve with other serverless platforms as they usually have a timeout limit.
- The Eventing component uses cloudevents as the spec for publishing the events. Basically, it helps us writing an event source agnostic code. We just have to understand the payload of the event but do not have to integrate directly with the event source or parse the event differently.
However, it is not all roses and butterflies and we have to also mention the downside. Knative is still in a very early stage (as of today the latest version is 0.6), meaning the API may change along the way and upgrades may break your deployments. Compared to other serverless solution it is more cumbersome to write all these Kubernetes YAMLs and install everything.
This is why we isolate Knative to its own cluster and do not mix our day-to-day cluster with the Knative one. We connect relevant services with VPC and load balancers.
Knative Eventing and Python
Our primary use case for using Knative is to run long CPU-bound data analysis processes when new data arrives in the system. As most data analysis tasks, they were written in Python by our data science team. Unfortunately, Knative Eventing examples are in Go and the only library available for easily bootstrapping a Knative Eventing project is again Go. We understood that we have to create a similar open-source library for Python so others may utilize it as well. After a bit of research and digging into the Go library code, we realized that all we need to implement is an HTTP server which listens to POST requests and parse the cloudevents payload. Behind the scenes, Knative Eventing utilizes Knative Serving and upon an event is fired a POST request to the Knative Eventing container. We are glad to share our open source project, python-kncloudevents, for easily integrating Python and Knative Eventing.
The usage of python-kncloudevents is very simple, All you need to do is to write a function that receives cloudevents as a parameter. To access the event data you can just call the Data function of the cloudevent. Now to run your function when the event is fired all you need to do is to create CloudeventsServer instance and run start_receiver function that accepts your function.
Knative is great for building serverless workload that needs flexibility in resources, runtime and duration. Knative allows building event driven architecture agnostic to your dependencies and environment.
If you need to run event based processes which demand a customized environment, Knative is the right choice for you. If you want to build quick and simple functions, the managed serverless platforms like Functions or Lambda might fit you as well.