Generating multimedia from Python

Using the right libraries, it is not very hard to generate multimedia from Python.

Pycairo is a Python library that provides bindings for cairo, a 2D graphics library which supports many output devices.

Getting started

Installation is as simple as pip install pycairo.

For drawing, we first have to create a surface. The most useful surfaces are

After drawing, surface.finish() should be called. Alternatively, surfaces can be used as context managers:

with cairo.SVGSurface("example.svg", 200, 200) as surface:
    # draw stuff here

Before we can draw, we have to create a context from the surface:

ctx = cairo.Context(surface)

Drawing examples

Pycairo is stateful and has the concepts of paths and subpaths. I will not try to accurately describe these, but rather show some examples.

Setting the background color to white:

ctx.set_source_rgb(1, 1, 1)
ctx.paint()

Drawing a red line:

ctx.set_source_rgb(1, 0, 0)
ctx.set_line_width(5)
ctx.move_to(10, 50)
ctx.line_to(256, 320)
ctx.stroke()

Drawing a yellow polygon:

ctx.set_source_rgb(1, 1, 0)
ctx.move_to(400, 100)
ctx.line_to(500, 50)
ctx.line_to(600, 100)
ctx.line_to(550, 200)
ctx.close_path()
ctx.fill()

Drawing a green rectangle:

ctx.set_source_rgb(0, 1, 1)
ctx.rectangle(50, 100, 150, 100)
ctx.fill()

Drawing a blue circle:

ctx.set_source_rgb(0, 0, 1)
ctx.arc(300, 150, 50, 0, 2 * 3.1416)
ctx.fill()

Drawing text:

ctx.set_source_rgb(0, 0, 0)
ctx.select_font_face("Sans",
    cairo.FONT_SLANT_NORMAL, cairo.FONT_WEIGHT_BOLD)
ctx.set_font_size(32)
ctx.move_to(100, 300)
ctx.show_text("Hello, Cairo!")
ctx.stroke()

Note the following:

It is also possible to manipulate the image data of an ImageSurface directly.

from PIL import Image
import cairo

w, h = 256, 256
with cairo.ImageSurface(cairo.FORMAT_ARGB32, w, h) as surface:
    data = surface.get_data()
    for y in range(0, h):
        for x in range(0, w):
            index = y * surface.get_stride() + x * 4
            c = 0 if (x-128)**2 + (y-128)**2 < 10000 else 255
            data[index] = c        # R
            data[index + 1] = c    # G
            data[index + 2] = c    # B
            data[index + 3] = 255  # A

    img = Image.frombytes("RGBA", (w, h), surface.get_data())

img.show()
img.close()

Exporting the data

Exporting an image

Some surfaces (PDFSurface, PSSurface, SVGSurface) are explicitly created to write to a file. Any variable surface that is an instance of the Surface class can also export the surface to a PNG by doing surface.write_to_png("image.png").

Exporting to a SVG file:

import cairo

with cairo.SVGSurface("output.svg", 128, 128) as surface:
    ctx = cairo.Context(surface)

    # draw a blue circle
    ctx.set_source_rgb(0, 0, 1)
    ctx.arc(128, 128, 100, 0, 2 * 3.1416)
    ctx.fill()

Using an ImageSurface and exporting to a PNG:

import cairo

w, h = 256, 256
with cairo.ImageSurface(cairo.FORMAT_ARGB32, w, h) as surface:
    ctx = cairo.Context(surface)

    # draw a blue circle
    ctx.set_source_rgb(0, 0, 1)
    ctx.arc(128, 128, 100, 0, 2 * 3.1416)
    ctx.fill()

	ctx.write_to_png("output.png")

If we want to show the image from Python, we have to use an external library. For example, using Pillow (pip install pillow), we can do

from PIL import Image
import cairo

w, h = 256, 256
with cairo.ImageSurface(cairo.FORMAT_ARGB32, w, h) as surface:
    ctx = cairo.Context(surface)

    # draw a blue circle
    ctx.set_source_rgb(0, 0, 1)
    ctx.arc(128, 128, 100, 0, 2 * 3.1416)
    ctx.fill()

    img = Image.frombytes("RGBA", (w, h), surface.get_data())

img.show()
img.close()

Exporting a video

For this, we can use opencv-python, which is a library that provides Python bindings for OpenCV. It expects frame as a numpy array, so we’ll need that library as well:

pip install opencv-python numpy

Now, to output video, we create a VideoWriter:

fourcc = cv2.VideoWriter_fourcc(*'mp4v')
videowriter = cv2.VideoWriter(
	'output_video.mp4', fourcc, fps, (w, h))

For context: A “fourcc” is an abbreviation for a “four character code” that identifies a codec.

Now, we can write a frame with

buf = surface.get_data()
img = np.frombuffer(buf, dtype=np.uint8).reshape((h, w, 4))
img = cv2.cvtColor(img, cv2.COLOR_BGRA2BGR)

Once we are done, we call videowriter.release() to release the video writer, and our output video is ready.

Complete example:

import cairo
import cv2
import numpy as np

w, h = 640, 480
fps = 60
num_frames = 300

fourcc = cv2.VideoWriter_fourcc(*"mp4v")
videowriter = cv2.VideoWriter(
	"output.mp4", fourcc, fps, (w, h))

for frame in range(num_frames):
    surface = cairo.ImageSurface(cairo.FORMAT_ARGB32, w, h)
    ctx = cairo.Context(surface)
    
    # white background
    ctx.set_source_rgb(1, 1, 1)
    ctx.paint()

    # moving red line
    ctx.set_source_rgb(1, 0, 0)
    ctx.set_line_width(5)
    ctx.move_to(50 + frame * 2 , 50)
    ctx.line_to(200 + frame * 2, 50)
    ctx.stroke()

    # convert Cairo surface to OpenCV format
    buf = surface.get_data()
    img = np.frombuffer(buf, dtype=np.uint8).reshape((h, w, 4))
    img = cv2.cvtColor(img, cv2.COLOR_BGRA2BGR)

    videowriter.write(img)

videowriter.release()