Feature Matching¶

Overview¶

Feature matching is the process of finding and connecting identical feature points across two images. It is used in object recognition, image stitching, 3D reconstruction, object tracking, and more. In this lesson, we will learn about BFMatcher, FLANN, distance metrics, Lowe's ratio test, Homography, and RANSAC.

1. Feature Matching Fundamentals¶

Matching Process¶

+---------------------------------------------------------------------+
|                     Feature Matching Pipeline                        |
+---------------------------------------------------------------------+
|                                                                      |
|   Image 1                        Image 2                             |
|   +---------+                     +---------+                        |
|   | *  *    |                     |   *  *  |                        |
|   |    *  * |                     | *    *  |                        |
|   |  *      |                     |   *     |                        |
|   +---------+                     +---------+                        |
|       |                               |                              |
|       v                               v                              |
|  +----------+                   +----------+                         |
|  | Feature  |                   | Feature  |                         |
|  | Detection|                   | Detection|                         |
|  +----+-----+                   +----+-----+                         |
|       |                               |                              |
|       v                               v                              |
|  +----------+                   +----------+                         |
|  |Descriptor|                   |Descriptor|                         |
|  | Compute  |                   | Compute  |                         |
|  +----+-----+                   +----+-----+                         |
|       |                               |                              |
|       +----------+-------------------+                               |
|                  v                                                   |
|           +--------------+                                           |
|           |   Matching   |                                           |
|           | (BFMatcher   |                                           |
|           |  or FLANN)   |                                           |
|           +------+-------+                                           |
|                  v                                                   |
|           +--------------+                                           |
|           | Filtering    |                                           |
|           | (Ratio Test, |                                           |
|           |  RANSAC)     |                                           |
|           +--------------+                                           |
|                                                                      |
+---------------------------------------------------------------------+

DMatch Structure¶

import cv2

# DMatch attributes
# match.queryIdx  : Descriptor index in query (first) image
# match.trainIdx  : Descriptor index in train (second) image
# match.imgIdx    : Index of train image (when matching multiple images)
# match.distance  : Distance between descriptors (similarity)

2. BFMatcher¶

Concept¶

BFMatcher (Brute-Force Matcher):
Computes distances between all descriptor pairs to find minimum distance

Advantages:
- Simple implementation
- Always guarantees optimal match

Disadvantages:
- O(N * M) complexity (N, M: number of descriptors)
- Slow for large feature sets

                Query Descriptors
                d1   d2   d3   d4
            +----+----+----+----+
Train   d1' | 10 | 25 | 15 | 30 |
Desc    d2' | 20 |  5 | 35 | 12 |  <- Each cell: distance
        d3' | 30 | 18 |  8 | 22 |
            +----+----+----+----+

Match: d1<->d1'(10), d2<->d2'(5), d3<->d3'(8), d4<->d2'(12)

cv2.BFMatcher¶

import cv2
import numpy as np

def bf_matching_demo(img1_path, img2_path):
    """Basic BFMatcher usage"""
    img1 = cv2.imread(img1_path, cv2.IMREAD_GRAYSCALE)
    img2 = cv2.imread(img2_path, cv2.IMREAD_GRAYSCALE)

    # ORB detector (binary descriptors)
    orb = cv2.ORB_create()
    kp1, des1 = orb.detectAndCompute(img1, None)
    kp2, des2 = orb.detectAndCompute(img2, None)

    # Create BFMatcher (Hamming distance for binary)
    bf = cv2.BFMatcher(cv2.NORM_HAMMING, crossCheck=True)

    # Match
    matches = bf.match(des1, des2)

    # Sort by distance
    matches = sorted(matches, key=lambda x: x.distance)

    # Draw top 30 matches
    result = cv2.drawMatches(
        img1, kp1, img2, kp2,
        matches[:30], None,
        flags=cv2.DRAW_MATCHES_FLAGS_NOT_DRAW_SINGLE_POINTS
    )

    print(f"Total matches: {len(matches)}")
    print(f"Min distance: {matches[0].distance:.2f}")
    print(f"Max distance: {matches[-1].distance:.2f}")

    cv2.imshow('BF Matches', result)
    cv2.waitKey(0)

    return matches

matches = bf_matching_demo('query.jpg', 'train.jpg')

crossCheck Option¶

import cv2

def bf_crosscheck_comparison(img1_path, img2_path):
    """Compare crossCheck option"""
    img1 = cv2.imread(img1_path, cv2.IMREAD_GRAYSCALE)
    img2 = cv2.imread(img2_path, cv2.IMREAD_GRAYSCALE)

    orb = cv2.ORB_create()
    kp1, des1 = orb.detectAndCompute(img1, None)
    kp2, des2 = orb.detectAndCompute(img2, None)

    # crossCheck=False
    bf_no_cross = cv2.BFMatcher(cv2.NORM_HAMMING, crossCheck=False)
    matches_no_cross = bf_no_cross.match(des1, des2)

    # crossCheck=True
    # Both A->B and B->A must match
    bf_cross = cv2.BFMatcher(cv2.NORM_HAMMING, crossCheck=True)
    matches_cross = bf_cross.match(des1, des2)

    print(f"crossCheck=False: {len(matches_no_cross)} matches")
    print(f"crossCheck=True:  {len(matches_cross)} matches")

    # crossCheck=True provides more reliable matches

bf_crosscheck_comparison('query.jpg', 'train.jpg')

knnMatch¶

import cv2
import numpy as np

def bf_knn_matching(img1_path, img2_path, k=2):
    """k-nearest neighbors matching"""
    img1 = cv2.imread(img1_path, cv2.IMREAD_GRAYSCALE)
    img2 = cv2.imread(img2_path, cv2.IMREAD_GRAYSCALE)

    # SIFT detector (float descriptors)
    sift = cv2.SIFT_create()
    kp1, des1 = sift.detectAndCompute(img1, None)
    kp2, des2 = sift.detectAndCompute(img2, None)

    # BFMatcher (L2 distance for float)
    bf = cv2.BFMatcher(cv2.NORM_L2)

    # Return k nearest neighbors
    matches = bf.knnMatch(des1, des2, k=k)

    # k matches per query descriptor
    print(f"Query descriptor count: {len(des1)}")
    print(f"{k} matches per query")

    # Check first query's matches
    if len(matches) > 0:
        print(f"\nFirst query's matches:")
        for i, m in enumerate(matches[0]):
            print(f"  Match {i+1}: trainIdx={m.trainIdx}, distance={m.distance:.2f}")

    return matches

matches = bf_knn_matching('query.jpg', 'train.jpg', k=2)

3. FLANN-based Matcher¶

Concept¶

FLANN (Fast Library for Approximate Nearest Neighbors):
Library for approximate nearest neighbor search

Characteristics:
- Faster than BFMatcher (for large datasets)
- Approximate algorithm (not 100% accurate)
- Uses KD-Tree, K-Means Tree, etc.

Index Types:
1. FLANN_INDEX_KDTREE (0): For float descriptors
2. FLANN_INDEX_LSH (6): For binary descriptors

FLANN Usage¶

import cv2
import numpy as np

def flann_matching_sift(img1_path, img2_path):
    """FLANN matching (SIFT - float descriptors)"""
    img1 = cv2.imread(img1_path, cv2.IMREAD_GRAYSCALE)
    img2 = cv2.imread(img2_path, cv2.IMREAD_GRAYSCALE)

    sift = cv2.SIFT_create()
    kp1, des1 = sift.detectAndCompute(img1, None)
    kp2, des2 = sift.detectAndCompute(img2, None)

    # FLANN parameters (KD-Tree)
    FLANN_INDEX_KDTREE = 1
    index_params = dict(
        algorithm=FLANN_INDEX_KDTREE,
        trees=5
    )
    search_params = dict(
        checks=50  # Search iterations (higher = more accurate, slower)
    )

    # Create FLANN matcher
    flann = cv2.FlannBasedMatcher(index_params, search_params)

    # k-nearest neighbors matching
    matches = flann.knnMatch(des1, des2, k=2)

    # Lowe's ratio test
    good_matches = []
    for m, n in matches:
        if m.distance < 0.7 * n.distance:
            good_matches.append(m)

    print(f"Total matches: {len(matches)}")
    print(f"Good matches: {len(good_matches)}")

    # Draw results
    result = cv2.drawMatches(
        img1, kp1, img2, kp2,
        good_matches, None,
        flags=cv2.DRAW_MATCHES_FLAGS_NOT_DRAW_SINGLE_POINTS
    )

    cv2.imshow('FLANN Matches', result)
    cv2.waitKey(0)

    return good_matches, kp1, kp2

matches, kp1, kp2 = flann_matching_sift('query.jpg', 'train.jpg')

FLANN for ORB (Binary)¶

import cv2
import numpy as np

def flann_matching_orb(img1_path, img2_path):
    """FLANN matching (ORB - binary descriptors)"""
    img1 = cv2.imread(img1_path, cv2.IMREAD_GRAYSCALE)
    img2 = cv2.imread(img2_path, cv2.IMREAD_GRAYSCALE)

    orb = cv2.ORB_create()
    kp1, des1 = orb.detectAndCompute(img1, None)
    kp2, des2 = orb.detectAndCompute(img2, None)

    # FLANN parameters (LSH for binary)
    FLANN_INDEX_LSH = 6
    index_params = dict(
        algorithm=FLANN_INDEX_LSH,
        table_number=6,        # Number of hash tables
        key_size=12,           # Key size
        multi_probe_level=1    # Multi-probe level
    )
    search_params = dict(checks=50)

    flann = cv2.FlannBasedMatcher(index_params, search_params)

    # Convert descriptors to float32 (FLANN requirement)
    des1 = des1.astype(np.float32)
    des2 = des2.astype(np.float32)

    matches = flann.knnMatch(des1, des2, k=2)

    # Ratio test
    good_matches = []
    for pair in matches:
        if len(pair) == 2:
            m, n = pair
            if m.distance < 0.75 * n.distance:
                good_matches.append(m)

    result = cv2.drawMatches(
        img1, kp1, img2, kp2,
        good_matches, None,
        flags=cv2.DRAW_MATCHES_FLAGS_NOT_DRAW_SINGLE_POINTS
    )

    cv2.imshow('FLANN ORB Matches', result)
    cv2.waitKey(0)

    return good_matches

flann_matching_orb('query.jpg', 'train.jpg')

4. Distance Metrics¶

Distance Types¶

+--------------------------------------------------------------------+
|                        Distance Metric Comparison                   |
+--------------------------------------------------------------------+
|                                                                     |
|  cv2.NORM_L1 (Manhattan Distance)                                  |
|  d = sum|a_i - b_i|                                                 |
|  -> Rarely used                                                     |
|                                                                     |
|  cv2.NORM_L2 (Euclidean Distance)                                  |
|  d = sqrt(sum(a_i - b_i)^2)                                        |
|  -> For SIFT, SURF, etc. float descriptors                         |
|                                                                     |
|  cv2.NORM_HAMMING                                                   |
|  d = sum(a_i XOR b_i)                                              |
|  -> For ORB, BRIEF, etc. binary descriptors (256 bits)             |
|                                                                     |
|  cv2.NORM_HAMMING2                                                  |
|  -> For ORB (WTA_K=3,4)                                            |
|                                                                     |
+--------------------------------------------------------------------+

Recommended Metrics per Descriptor¶

import cv2

# Recommended distance metric per descriptor type
descriptor_distance = {
    'SIFT': cv2.NORM_L2,
    'SURF': cv2.NORM_L2,
    'KAZE': cv2.NORM_L2,
    'ORB': cv2.NORM_HAMMING,
    'BRISK': cv2.NORM_HAMMING,
    'AKAZE': cv2.NORM_HAMMING,  # Binary mode
    'BRIEF': cv2.NORM_HAMMING,
    'FREAK': cv2.NORM_HAMMING,
}

def get_matcher(descriptor_type):
    """Return matcher for descriptor type"""
    norm_type = descriptor_distance.get(descriptor_type, cv2.NORM_L2)
    return cv2.BFMatcher(norm_type, crossCheck=True)

5. Match Filtering¶

Lowe's Ratio Test¶

Lowe's Ratio Test:
Filter by ratio of distances between nearest and second-nearest neighbor

Principle:
Good match -> Nearest neighbor is clearly closer (small ratio)
Bad match -> Multiple candidates at similar distances (large ratio)

distance(best) / distance(second_best) < threshold

Recommended threshold: 0.7 ~ 0.8

import cv2
import numpy as np

def lowe_ratio_test(img1_path, img2_path, ratio_thresh=0.75):
    """Apply Lowe's ratio test"""
    img1 = cv2.imread(img1_path, cv2.IMREAD_GRAYSCALE)
    img2 = cv2.imread(img2_path, cv2.IMREAD_GRAYSCALE)

    sift = cv2.SIFT_create()
    kp1, des1 = sift.detectAndCompute(img1, None)
    kp2, des2 = sift.detectAndCompute(img2, None)

    bf = cv2.BFMatcher()
    matches = bf.knnMatch(des1, des2, k=2)

    # Ratio test
    good_matches = []
    for m, n in matches:
        ratio = m.distance / n.distance
        if ratio < ratio_thresh:
            good_matches.append(m)

    print(f"Total matches: {len(matches)}")
    print(f"Ratio test passed: {len(good_matches)}")
    print(f"Filter ratio: {len(good_matches)/len(matches)*100:.1f}%")

    # Match quality analysis
    if good_matches:
        distances = [m.distance for m in good_matches]
        print(f"Average distance: {np.mean(distances):.2f}")
        print(f"Distance std dev: {np.std(distances):.2f}")

    return good_matches, kp1, kp2

matches, kp1, kp2 = lowe_ratio_test('query.jpg', 'train.jpg')

Distance-based Filtering¶

import cv2
import numpy as np

def distance_based_filtering(matches, threshold_factor=2.0):
    """Distance-based match filtering"""
    if not matches:
        return []

    distances = [m.distance for m in matches]
    mean_dist = np.mean(distances)
    std_dist = np.std(distances)

    # Keep only those below mean + k*std
    threshold = mean_dist + threshold_factor * std_dist

    good_matches = [m for m in matches if m.distance < threshold]

    print(f"Distance mean: {mean_dist:.2f}")
    print(f"Distance std dev: {std_dist:.2f}")
    print(f"Threshold: {threshold:.2f}")
    print(f"Filtering result: {len(matches)} -> {len(good_matches)}")

    return good_matches

Symmetric Matching¶

import cv2

def symmetric_matching(des1, des2, norm_type=cv2.NORM_L2):
    """Symmetric matching (verify both A->B and B->A)"""
    bf = cv2.BFMatcher(norm_type)

    # A -> B matching
    matches_ab = bf.knnMatch(des1, des2, k=1)

    # B -> A matching
    matches_ba = bf.knnMatch(des2, des1, k=1)

    # Select only bidirectionally consistent matches
    symmetric = []
    for m_ab in matches_ab:
        if len(m_ab) == 0:
            continue

        query_idx = m_ab[0].queryIdx
        train_idx = m_ab[0].trainIdx

        # Check reverse direction in B->A
        for m_ba in matches_ba:
            if len(m_ba) == 0:
                continue

            if m_ba[0].queryIdx == train_idx and m_ba[0].trainIdx == query_idx:
                symmetric.append(m_ab[0])
                break

    return symmetric

6. Homography and RANSAC¶

Homography Concept¶

Homography:
3x3 matrix representing perspective transformation between planes

+     +   +           + +   +
| x'  |   | h11 h12 h13 | | x |
| y'  | = | h21 h22 h23 | | y |
|  1  |   | h31 h32 h33 | | 1 |
+     +   +           + +   +

x' = (h11*x + h12*y + h13) / (h31*x + h32*y + h33)
y' = (h21*x + h22*y + h23) / (h31*x + h32*y + h33)

Applications:
- Object position estimation
- Image registration
- Panorama stitching
- AR marker detection

cv2.findHomography()¶

import cv2
import numpy as np

def find_object_homography(img1_path, img2_path, min_matches=10):
    """Find object using homography"""
    img1 = cv2.imread(img1_path)  # Query (object to find)
    img2 = cv2.imread(img2_path)  # Target (scene)

    gray1 = cv2.cvtColor(img1, cv2.COLOR_BGR2GRAY)
    gray2 = cv2.cvtColor(img2, cv2.COLOR_BGR2GRAY)

    # SIFT features and matching
    sift = cv2.SIFT_create()
    kp1, des1 = sift.detectAndCompute(gray1, None)
    kp2, des2 = sift.detectAndCompute(gray2, None)

    bf = cv2.BFMatcher()
    matches = bf.knnMatch(des1, des2, k=2)

    # Ratio test
    good_matches = []
    for m, n in matches:
        if m.distance < 0.75 * n.distance:
            good_matches.append(m)

    print(f"Good matches: {len(good_matches)}")

    if len(good_matches) >= min_matches:
        # Extract matched point coordinates
        src_pts = np.float32([kp1[m.queryIdx].pt for m in good_matches]).reshape(-1, 1, 2)
        dst_pts = np.float32([kp2[m.trainIdx].pt for m in good_matches]).reshape(-1, 1, 2)

        # Compute homography (RANSAC)
        H, mask = cv2.findHomography(src_pts, dst_pts, cv2.RANSAC, 5.0)

        if H is not None:
            # Transform query image corners
            h, w = gray1.shape
            corners = np.float32([
                [0, 0],
                [w, 0],
                [w, h],
                [0, h]
            ]).reshape(-1, 1, 2)

            transformed_corners = cv2.perspectiveTransform(corners, H)

            # Draw object location on target image
            result = img2.copy()
            cv2.polylines(
                result,
                [np.int32(transformed_corners)],
                True,
                (0, 255, 0),
                3,
                cv2.LINE_AA
            )

            # Match visualization
            matches_mask = mask.ravel().tolist()
            draw_params = dict(
                matchColor=(0, 255, 0),
                singlePointColor=None,
                matchesMask=matches_mask,
                flags=2
            )

            match_img = cv2.drawMatches(
                img1, kp1, img2, kp2,
                good_matches, None, **draw_params
            )

            cv2.imshow('Object Detection', result)
            cv2.imshow('Matches', match_img)
            cv2.waitKey(0)

            # Inlier ratio
            inliers = np.sum(mask)
            print(f"Inliers: {inliers}/{len(good_matches)}")
            print(f"Inlier ratio: {inliers/len(good_matches)*100:.1f}%")

            return H, transformed_corners
    else:
        print(f"Insufficient matches: {len(good_matches)} < {min_matches}")
        return None, None

H, corners = find_object_homography('book_cover.jpg', 'scene.jpg')

Understanding RANSAC¶

RANSAC (RANdom SAmple Consensus):
Model estimation from data with outliers

Algorithm:
1. Randomly select minimum samples (homography: 4 points)
2. Compute model
3. Compute error for all points
4. Count points within threshold (inliers)
5. Repeat and select model with most inliers
6. Recompute model with inliers (optional)

+----------------------------------------+
|  *  *  *  *  *                         |
|     *  *  *        <- Inliers (near line) |
|        *  *  *                         |
|  x                                     |
|           x        <- Outliers         |
|     x          x                       |
+----------------------------------------+

findHomography parameters:
- cv2.RANSAC: Use RANSAC
- ransacReprojThreshold: Inlier threshold (pixels)

import cv2
import numpy as np

def homography_methods_comparison(src_pts, dst_pts):
    """Compare various homography computation methods"""

    methods = [
        (0, 'Regular (LS)'),
        (cv2.RANSAC, 'RANSAC'),
        (cv2.LMEDS, 'Least-Median'),
        (cv2.RHO, 'PROSAC'),
    ]

    for method, name in methods:
        try:
            H, mask = cv2.findHomography(
                src_pts, dst_pts,
                method,
                ransacReprojThreshold=5.0
            )

            if H is not None and mask is not None:
                inliers = np.sum(mask)
                print(f"{name}: {inliers}/{len(src_pts)} inliers")
            else:
                print(f"{name}: Failed")
        except Exception as e:
            print(f"{name}: Error - {e}")

7. Image Stitching Basics¶

Simple Panorama¶

import cv2
import numpy as np

def simple_panorama(img1_path, img2_path):
    """Simple panorama stitching"""
    img1 = cv2.imread(img1_path)  # Left image
    img2 = cv2.imread(img2_path)  # Right image

    gray1 = cv2.cvtColor(img1, cv2.COLOR_BGR2GRAY)
    gray2 = cv2.cvtColor(img2, cv2.COLOR_BGR2GRAY)

    # Feature detection and matching
    sift = cv2.SIFT_create()
    kp1, des1 = sift.detectAndCompute(gray1, None)
    kp2, des2 = sift.detectAndCompute(gray2, None)

    bf = cv2.BFMatcher()
    matches = bf.knnMatch(des1, des2, k=2)

    # Ratio test
    good = []
    for m, n in matches:
        if m.distance < 0.75 * n.distance:
            good.append(m)

    print(f"Matches: {len(good)}")

    if len(good) < 4:
        print("Not enough matches.")
        return None

    # Compute homography
    src_pts = np.float32([kp1[m.queryIdx].pt for m in good]).reshape(-1, 1, 2)
    dst_pts = np.float32([kp2[m.trainIdx].pt for m in good]).reshape(-1, 1, 2)

    H, mask = cv2.findHomography(src_pts, dst_pts, cv2.RANSAC, 5.0)

    if H is None:
        print("Homography computation failed")
        return None

    # Image warping
    h1, w1 = img1.shape[:2]
    h2, w2 = img2.shape[:2]

    # Compute result image size
    corners1 = np.float32([[0, 0], [w1, 0], [w1, h1], [0, h1]]).reshape(-1, 1, 2)
    corners1_transformed = cv2.perspectiveTransform(corners1, H)

    corners2 = np.float32([[0, 0], [w2, 0], [w2, h2], [0, h2]]).reshape(-1, 1, 2)

    all_corners = np.concatenate([corners1_transformed, corners2], axis=0)

    x_min, y_min = np.int32(all_corners.min(axis=0).ravel())
    x_max, y_max = np.int32(all_corners.max(axis=0).ravel())

    # Translation transform
    translation = np.array([
        [1, 0, -x_min],
        [0, 1, -y_min],
        [0, 0, 1]
    ], dtype=np.float32)

    # Warp image 1
    result_width = x_max - x_min
    result_height = y_max - y_min

    warped1 = cv2.warpPerspective(
        img1,
        translation @ H,
        (result_width, result_height)
    )

    # Copy image 2
    warped1[-y_min:-y_min+h2, -x_min:-x_min+w2] = img2

    cv2.imshow('Panorama', warped1)
    cv2.waitKey(0)

    return warped1

panorama = simple_panorama('left.jpg', 'right.jpg')

Using OpenCV Stitcher¶

import cv2
import numpy as np

def opencv_stitcher(image_paths):
    """Use OpenCV Stitcher class"""
    images = []
    for path in image_paths:
        img = cv2.imread(path)
        if img is not None:
            images.append(img)

    if len(images) < 2:
        print("At least 2 images required.")
        return None

    # Create Stitcher
    stitcher = cv2.Stitcher_create(cv2.Stitcher_PANORAMA)
    # Or: cv2.Stitcher_SCANS (for document scans)

    # Perform stitching
    status, result = stitcher.stitch(images)

    if status == cv2.Stitcher_OK:
        print("Stitching successful!")
        cv2.imshow('Stitched', result)
        cv2.waitKey(0)
        return result
    elif status == cv2.Stitcher_ERR_NEED_MORE_IMGS:
        print("Need more images.")
    elif status == cv2.Stitcher_ERR_HOMOGRAPHY_EST_FAIL:
        print("Homography estimation failed")
    elif status == cv2.Stitcher_ERR_CAMERA_PARAMS_ADJUST_FAIL:
        print("Camera parameters adjustment failed")

    return None

# Usage example
image_files = ['pano1.jpg', 'pano2.jpg', 'pano3.jpg']
result = opencv_stitcher(image_files)

8. Practice Problems¶

Problem 1: Find Optimal Matching Parameters¶

Test various ratio threshold values to find the optimal value.

Solution Code

import cv2
import numpy as np
import matplotlib.pyplot as plt

def find_optimal_ratio(img1_path, img2_path):
    """Find optimal ratio threshold"""
    img1 = cv2.imread(img1_path, cv2.IMREAD_GRAYSCALE)
    img2 = cv2.imread(img2_path, cv2.IMREAD_GRAYSCALE)

    sift = cv2.SIFT_create()
    kp1, des1 = sift.detectAndCompute(img1, None)
    kp2, des2 = sift.detectAndCompute(img2, None)

    bf = cv2.BFMatcher()
    matches = bf.knnMatch(des1, des2, k=2)

    ratios = np.arange(0.5, 1.0, 0.05)
    results = []

    for ratio in ratios:
        good = [m for m, n in matches if m.distance < ratio * n.distance]
        results.append(len(good))

    # Graph
    plt.figure(figsize=(10, 5))
    plt.plot(ratios, results, 'b-o')
    plt.xlabel('Ratio Threshold')
    plt.ylabel('Number of Matches')
    plt.title('Ratio Threshold vs Match Count')
    plt.grid(True)
    plt.show()

    # Gradient change analysis
    gradients = np.gradient(results)
    optimal_idx = np.argmax(np.abs(gradients))
    optimal_ratio = ratios[optimal_idx]

    print(f"Recommended ratio threshold: {optimal_ratio:.2f}")

    return optimal_ratio

optimal = find_optimal_ratio('query.jpg', 'train.jpg')

Problem 2: Multiple Object Detection¶

Detect multiple instances of the same object in a scene.

Solution Code

import cv2
import numpy as np

def detect_multiple_objects(template_path, scene_path, threshold=10):
    """Detect multiple identical objects"""
    template = cv2.imread(template_path)
    scene = cv2.imread(scene_path)

    gray_t = cv2.cvtColor(template, cv2.COLOR_BGR2GRAY)
    gray_s = cv2.cvtColor(scene, cv2.COLOR_BGR2GRAY)

    sift = cv2.SIFT_create()
    kp_t, des_t = sift.detectAndCompute(gray_t, None)
    kp_s, des_s = sift.detectAndCompute(gray_s, None)

    bf = cv2.BFMatcher()
    all_matches = bf.knnMatch(des_t, des_s, k=2)

    # Ratio test
    good_matches = []
    for m, n in all_matches:
        if m.distance < 0.7 * n.distance:
            good_matches.append(m)

    if len(good_matches) < threshold:
        print("Insufficient matches")
        return []

    # Clustering to find multiple instances
    scene_pts = np.array([kp_s[m.trainIdx].pt for m in good_matches])

    # K-means clustering
    criteria = (cv2.TERM_CRITERIA_EPS + cv2.TERM_CRITERIA_MAX_ITER, 100, 0.2)
    k = min(5, len(good_matches) // threshold)  # Max 5 objects

    if k < 1:
        k = 1

    _, labels, centers = cv2.kmeans(
        np.float32(scene_pts),
        k,
        None,
        criteria,
        10,
        cv2.KMEANS_RANDOM_CENTERS
    )

    result = scene.copy()
    detected = []

    for cluster_id in range(k):
        cluster_mask = labels.ravel() == cluster_id
        cluster_matches = [m for m, is_in in zip(good_matches, cluster_mask) if is_in]

        if len(cluster_matches) >= threshold // 2:
            # Compute homography per cluster
            src_pts = np.float32([kp_t[m.queryIdx].pt for m in cluster_matches]).reshape(-1, 1, 2)
            dst_pts = np.float32([kp_s[m.trainIdx].pt for m in cluster_matches]).reshape(-1, 1, 2)

            H, mask = cv2.findHomography(src_pts, dst_pts, cv2.RANSAC, 5.0)

            if H is not None:
                h, w = gray_t.shape
                corners = np.float32([[0, 0], [w, 0], [w, h], [0, h]]).reshape(-1, 1, 2)
                transformed = cv2.perspectiveTransform(corners, H)

                cv2.polylines(result, [np.int32(transformed)], True, (0, 255, 0), 3)
                detected.append(transformed)

    print(f"Objects detected: {len(detected)}")
    cv2.imshow('Multiple Objects', result)
    cv2.waitKey(0)

    return detected

detect_multiple_objects('coin.jpg', 'coins.jpg')

Problem 3: Real-time Object Tracking¶

Track a template object in real-time from webcam.

Solution Code

import cv2
import numpy as np

def realtime_object_tracking(template_path):
    """Real-time object tracking"""
    template = cv2.imread(template_path, cv2.IMREAD_GRAYSCALE)
    h, w = template.shape

    # Use ORB (fast)
    orb = cv2.ORB_create(nfeatures=500)
    kp_t, des_t = orb.detectAndCompute(template, None)

    bf = cv2.BFMatcher(cv2.NORM_HAMMING, crossCheck=False)

    cap = cv2.VideoCapture(0)

    while True:
        ret, frame = cap.read()
        if not ret:
            break

        gray = cv2.cvtColor(frame, cv2.COLOR_BGR2GRAY)
        kp_f, des_f = orb.detectAndCompute(gray, None)

        if des_f is not None and len(des_f) > 10:
            matches = bf.knnMatch(des_t, des_f, k=2)

            # Ratio test
            good = []
            for pair in matches:
                if len(pair) == 2:
                    m, n = pair
                    if m.distance < 0.75 * n.distance:
                        good.append(m)

            if len(good) >= 10:
                src_pts = np.float32([kp_t[m.queryIdx].pt for m in good]).reshape(-1, 1, 2)
                dst_pts = np.float32([kp_f[m.trainIdx].pt for m in good]).reshape(-1, 1, 2)

                H, mask = cv2.findHomography(src_pts, dst_pts, cv2.RANSAC, 5.0)

                if H is not None:
                    corners = np.float32([[0, 0], [w, 0], [w, h], [0, h]]).reshape(-1, 1, 2)
                    transformed = cv2.perspectiveTransform(corners, H)
                    cv2.polylines(frame, [np.int32(transformed)], True, (0, 255, 0), 3)

                    # Display match count
                    cv2.putText(frame, f'Matches: {len(good)}', (10, 30),
                                cv2.FONT_HERSHEY_SIMPLEX, 1, (0, 255, 0), 2)

        cv2.imshow('Tracking', frame)

        if cv2.waitKey(1) & 0xFF == ord('q'):
            break

    cap.release()
    cv2.destroyAllWindows()

# realtime_object_tracking('logo.jpg')

Difficulty	Topic	Description
*	Basic Matching	Feature matching between two images
**	Filtering	Ratio test, distance filtering
**	Object Detection	Find object using homography
***	Panorama	Stitch 2+ images
***	Real-time Tracking	Track object from webcam

Next Steps¶

15_Object_Detection_Basics.md - Template Matching, Haar Cascade, HOG+SVM

Feature Matching

Feature Matching¶

Overview¶

Table of Contents¶

1. Feature Matching Fundamentals¶

Matching Process¶

DMatch Structure¶

2. BFMatcher¶

Concept¶

cv2.BFMatcher¶

crossCheck Option¶

knnMatch¶

3. FLANN-based Matcher¶

Concept¶

FLANN Usage¶

FLANN for ORB (Binary)¶

4. Distance Metrics¶

Distance Types¶

Recommended Metrics per Descriptor¶

5. Match Filtering¶

Lowe's Ratio Test¶

Distance-based Filtering¶

Symmetric Matching¶

6. Homography and RANSAC¶

Homography Concept¶

cv2.findHomography()¶

Understanding RANSAC¶

7. Image Stitching Basics¶

Simple Panorama¶

Using OpenCV Stitcher¶

8. Practice Problems¶

Problem 1: Find Optimal Matching Parameters¶

Problem 2: Multiple Object Detection¶

Problem 3: Real-time Object Tracking¶

Recommended Problems¶

Next Steps¶

References¶