The Sliding Window
A convolution kernel (or filter) is a small matrix — typically 3×3 or 5×5 — that is slid across every pixel of an image. At each position, the kernel values are multiplied element-wise with the underlying pixel values and summed to produce a single output pixel.
