Video Processing Fundamentals in Python

Introduction to Video Processing

Video processing refers to the manipulation and analysis of video data. It is a critical aspect of computer vision, enabling applications like automated surveillance, video recognition, and image stabilization. Python, being an incredibly versatile language, provides strong libraries like OpenCV that facilitate video processing tasks efficiently.

In this post, we will explore how to handle videos using Python and OpenCV, covering fundamental techniques like reading a video file, applying filters, detecting edges, and saving the processed video back to disk.

Setting Up Your Environment

Before we dive into coding, ensure you have a Python environment set up. You can install OpenCV via pip if you haven't already:

pip install opencv-python

You might also want to install NumPy (useful for numerical operations):

pip install numpy

Reading a Video File

The first step in video processing is to read a video file. Let’s create a simple script that opens a video file and displays it frame by frame.

import cv2

# Load the video
video_path = 'input_video.mp4'

# Change this path to your video file
cap = cv2.VideoCapture(video_path)

while cap.isOpened():
    ret, frame = cap.read()

# Read a frame
    
    if not ret:
        break

# If no frame is returned, we end the loop
    
    cv2.imshow('Video Frame', frame)

# Display the frame

 

# Break the loop on 'q' key press
    if cv2.waitKey(1) & 0xFF == ord('q'):
        break

cap.release()

# Release the video capture
cv2.destroyAllWindows()

# Close all OpenCV windows

Explanation

We import the OpenCV library.
Open the desired video using cv2.VideoCapture().
Loop through the video frames until the end is reached (cap.isOpened()).
Use cap.read() to read each frame; if there’s no frame left, we break out of the loop.
Show the frame using cv2.imshow().
Finally, wait for a key press to exit the loop and release resources.

Processing Video Frames

Now that we can load and display video, let’s process each frame for a typical operation—edge detection. We can apply the Canny edge detection algorithm, which highlights edges within the frame.

import cv2

video_path = 'input_video.mp4'
cap = cv2.VideoCapture(video_path)

while cap.isOpened():
    ret, frame = cap.read()
    
    if not ret:
        break

# Convert the frame to grayscale
    gray_frame = cv2.cvtColor(frame, cv2.COLOR_BGR2GRAY)

# Apply Canny Edge Detection
    edges = cv2.Canny(gray_frame, 100, 200)

# Display original and processed frames
    cv2.imshow('Original Frame', frame)
    cv2.imshow('Edges', edges)

    if cv2.waitKey(1) & 0xFF == ord('q'):
        break

cap.release()
cv2.destroyAllWindows()

Explanation

We read each frame just like before.
Convert the colorful frame to grayscale using cv2.cvtColor().
Apply Canny edge detection with cv2.Canny(gray_frame, 100, 200). The two arguments define the lower and upper thresholds for edge detection.
Display both the original and processed frames side by side.

Saving Processed Video

Once we process video frames, we may want to save them into a new video file. Let’s modify our code to save the edge-detected frames.

import cv2

video_path = 'input_video.mp4'
output_path = 'output_video.avi'
cap = cv2.VideoCapture(video_path)

# Get width, height, and frames per second of the input video
width = int(cap.get(cv2.CAP_PROP_FRAME_WIDTH))
height = int(cap.get(cv2.CAP_PROP_FRAME_HEIGHT))
fps = int(cap.get(cv2.CAP_PROP_FPS))

# Define the codec and create VideoWriter object
fourcc = cv2.VideoWriter_fourcc(*'XVID')
out = cv2.VideoWriter(output_path, fourcc, fps, (width, height))

while cap.isOpened():
    ret, frame = cap.read()
    
    if not ret:
        break

    gray_frame = cv2.cvtColor(frame, cv2.COLOR_BGR2GRAY)
    edges = cv2.Canny(gray_frame, 100, 200)

    out.write(cv2.cvtColor(edges, cv2.COLOR_GRAY2BGR))

# Write the frame to the new video

    cv2.imshow('Edges', edges)
    
    if cv2.waitKey(1) & 0xFF == ord('q'):
        break

cap.release()
out.release()

# Release the VideoWriter
cv2.destroyAllWindows()

Explanation

We create a VideoWriter object using cv2.VideoWriter(). This requires specifying the output file name, codec (using cv2.VideoWriter_fourcc()), frame rate, and frame size.
Each processed frame is written to the output video using out.write(). Since the output video requires frames in BGR format, we convert the grayscale edges back for compatibility.

Applying Filters

OpenCV provides various filters to enhance image quality. Let’s explore the Gaussian Blur filter for smoothing:

import cv2

video_path = 'input_video.mp4'
cap = cv2.VideoCapture(video_path)

while cap.isOpened():
    ret, frame = cap.read()
    
    if not ret:
        break

# Apply Gaussian Blur
    blurred_frame = cv2.GaussianBlur(frame, (15, 15), 0)

    cv2.imshow('Blurred Frame', blurred_frame)

    if cv2.waitKey(1) & 0xFF == ord('q'):
        break

cap.release()
cv2.destroyAllWindows()

Explanation

The cv2.GaussianBlur() function applies a Gaussian filter to smooth the frame, reducing noise and detail.
The kernel size (15, 15) denotes the size of the filter, and ‘0’ indicates that the standard deviation in X and Y directions is calculated from the kernel size.

Conclusion

Understanding video processing fundamentals is the first step toward building sophisticated computer vision applications. By leveraging Python with OpenCV, you’re equipped to read, process, and save videos efficiently. From edge detection to applying filters, the possibilities are endless! Feel free to further explore OpenCV's extensive features to enhance your video processing skills.

Level Up Your Skills with Xperto-AI