Skip to content

Key Concepts

If you enjoy learning by following examples, we recommend diving straight into the tutorials section. All of the tutorials and demos can be found in our Github repo.

This section introduces some of the key concepts in Plugboard and how to apply them when building your own models.

Terminology

Components

The basic building blocks of Plugboard models are Component objects. Typical uses of components are:

  • Loading data into and saving data out of models;
  • Preparing data and/or running calculations;
  • Modelling a particular physical entity, e.g. within a model of a factory production line, you might define separate components representing the conveyor belts, sensors and workstations;
  • Making calls to external systems, e.g. fetching data from the internet, calling an LLM, spinning up a subprocess to run some third-party simulation software.

When implementing your own components, you will need to:

  • Subclass the base Component;
  • Specify its inputs and ouputs using an IOController;
  • Define a step() method the executes the main logic of your component for a single step; and
  • Optionally define an init() method to do any required preparatory steps before the model in run.
  • In the case of event based models, define custom Event subclasses and corresponding event handler methods decorated with Event.handler.

Connectors

Data flows between components via Connectors. You will need to use these to tell Plugboard how data should flow between your components. Your overall model might be very simple, for example:

However Plugboard supports much more complex model structures, including looping and branching:

For models with explicitly declared input and output fields, connectors for each input-output pair must be defined explicitly using one of the Connector implementations. Connectors required for any events used in the model will be created for you automatically.

Processes

Components and connectors are collected together under a Process. This top-level class takes care of starting the model and running it until completion. Model execution is broken down into discrete steps. Running a model means executing all steps for each component until all of the available data has flowed through the model. The step() method advances the model forward by a single step. The run() method will repeatedly call step() until completion.

Plugboard supports both bounded and unbounded data streams. That is to say, you can either run a model with a fixed size input data set until completion, or run a model indefinitely which will continuously process new inputs as they arrive until a shutdown signal is received.

Plugboard uses Python's asynchronous concurrency to schedule execution of each of the components. Don't worry if asynchronous Python is unfamiliar to you: Plugboard takes care of all the details, so that you can focus on the logic of your model.

These Process classes are currently available which can be extended with custom implementations:

  • LocalProcess runs models in a single Python process on your computer. This is useful for initial development, and is often sufficient for models are not computationally demanding.
  • RayProcess allows you to execute components on different Python processes using the Ray Framework. This supports parallel computation on a single machine and scales out to large-scale compute clusters.

Running models

Most models start out life in a Jupyter notebook or Python script. Later on in development you may choose to convert the Plugboard process definition to a YAML file. This allows you to:

  • Separate your model code from its configuration;
  • Run the model via Plugboard's CLI - see plugboard process run --help for details;
  • Transfer your model to a cluster to take advantage of larger-scale computational resources.