Skip to content

Multi-target Multi-camera Person Tracking (M.S. Project at UCSB)

Notifications You must be signed in to change notification settings

kimia-cvengineer/MTMCPT

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 

Repository files navigation

Multi-target Multi-camera Person Tracking

This project introduces an innovative framework designed to improve surveillance, monitoring, and anomaly detection systems through consistent tracking and re-identification of people across both overlapping and non-overlapping cameras. To continuously track and re-identify individuals, a fusion of people’s motion and appearance features is leveraged. Addressing the difficulties posed by fluctuating lighting conditions, quality of frames, and occlusions, this approach refines and connects the tracking results from individual cameras to produce reliable and continuous tracklets. Lastly, hierarchical clustering organizes single-camera tracklets into distinct groups of unique identities.

Problem Definition

  • Goal: Consistently track and re-identify people across multi-view cameras.
  • Scope:
    • Generalize to both overlapping and non-overlapping cameras indoors and outdoors.
    • Address challenges in viewpoint variance and frame quality through refinement techniques.

box_exchanging_img

Importance of the Problem

  • Public safety and security
  • Traffic monitoring and management
  • Healthcare and elderly care

Proposed Tracking Framework

proposed_pipeline

Step 0: Dataset Preparation

  • Dataset: MEVA (Multimedia Event Detection and Activity Dataset)
    • A challenging large-scale video dataset designed for activity recognition in multi-camera environments.
    • Contains over 9,300 hours of untrimmed videos with:
      • Diverse backgrounds, camera poses, illuminations, and indoor/outdoor scenes.
      • Videos taken at different times of the day, month, and year, split into 5-minute segments.
    • Involves 158 unique people wearing 598 outfits across 33 camera views.

meva_dataset_samples

  • Focus:
    • We focus on 5 connected cameras:
      • 14 videos.
      • Each video contains 9,000 frames (~5 minutes long).
      • 20 unique people across cameras.

meva_dataset_site_map_focus

  • Annotations:
    • By default, the dataset doesn’t provide consistent IDs for people.
    • We annotated part of the dataset using a custom annotation tool due to the dataset’s large scale.

annotaion_results

Step 1: Single-view Multi-Object Tracking

  • BoT-SORT: Robust Associations Multi-Pedestrian Tracking:
    • Fuses motion and appearance features.
    • Uses Kalman filter for motion-based future position estimation.
    • Employs BoT (SBS) appearance-based feature extractor to reduce tracking errors.

Step 2: Tracklet Refinement

  • Objective: Split tracklets containing different identities.
    • Uses intra-variance of tracklets to detect tracklets that should be split.
    • Key insight:
      • Higher intra-variance indicates higher appearance variation, suggesting that the tracklet contains different identities.
    • Applies K-Means clustering to split tracklets.
    • Reduces errors caused by single-camera trackers.

tracklet_refinement

Step 3: Intra-camera Tracklet Association

  • Approach:
    • Uses agglomerative clustering to group tracklets with the same identity based on appearance features.
    • Process:
      • Calculate the aggregated distance matrix between each pair of tracklets.
      • Cluster tracklets using the aggregated distance matrix.

tracklet_clustering

Step 4: Intra-camera Clustering Refinement

  • Follows the same approach described in Tracklet Refinement.

Step 5: Inter-camera Tracklet Association

  • Follows the same approach described in Intra-camera Tracklet Association.

Quantitative Results

Method IDF1 IDP IDR IDS
Pipeline w/o refinement 27.3% 22.2% 34.7% 375
Pipeline + pre-clustering refinement 32.2% 26.9% 37.9% 386
Pipeline + pre/post-clustering refinement 34.8% 28.8% 44.3% 353

Qualitative Results

Single-camera Different Time Person Tracking

multi-times-results

Multi-view Person Tracking

multi-camera-results

For the technical details of the project and experiments, please refer to my presentation slides.

Releases

No releases published

Packages

No packages published