DRT: Detection Refinement for Multiple Object Tracking

Bisheng Wang; Christian Fruhwirth-Reisinger; Horst Possegger; Horst Bischof; Guo Cao

DRT: Detection Refinement for Multiple Object Tracking

Bisheng Wang, Christian Fruhwirth-Reisinger, Horst Possegger, Horst Bischof, Guo Cao

Institute of Computer Graphics and Vision (7100)

Research output: Chapter in Book/Report/Conference proceeding › Conference paper › peer-review

Abstract

Deep learning methods have led to remarkable progress in multiple object tracking (MOT). However, when tracking in crowded scenes, existing methods still suffer from both inaccurate and missing detections. This paper proposes Detection Refinement for Tracking (DRT) to address these two issues for people tracking. First, we construct an encoder-decoder backbone network with a novel semi-supervised heatmap training procedure, which leverages human heatmaps to obtain a more precise localization of the targets. Second, we integrate a "one patch, multiple predictions" mechanism into DRT which refines the detection results and recovers occluded pedestrians at the same time. Additionally, we leverage a data-driven LSTM-based motion model which can recover lost targets at a negligible computational cost. Compared with strong baseline methods, our DRT achieves significant improvements on publicly available MOT datasets. In addition, DRT generalizes well, i.e. it can be applied to any detector to improve their performance.

Original language	English
Title of host publication	British Machine Vision Conference (BMVC) 2021
Publisher	The British Machine Vision Association
Number of pages	14
Publication status	Published - 23 Nov 2021
Event	32nd British Machine Vision Conference: BMVC 2021 - Virtuell, United Kingdom Duration: 22 Nov 2021 → 25 Nov 2021

Conference

Conference	32nd British Machine Vision Conference
Abbreviated title	BMVC 2021
Country/Territory	United Kingdom
City	Virtuell
Period	22/11/21 → 25/11/21

Access to Document

https://www.bmvc2021-virtualconference.com/assets/papers/0148.pdf

Cite this

@inproceedings{2582916c3ed147c985da6cfdc11258e0,

title = "DRT: Detection Refinement for Multiple Object Tracking",

abstract = "Deep learning methods have led to remarkable progress in multiple object tracking (MOT). However, when tracking in crowded scenes, existing methods still suffer from both inaccurate and missing detections. This paper proposes Detection Refinement for Tracking (DRT) to address these two issues for people tracking. First, we construct an encoder-decoder backbone network with a novel semi-supervised heatmap training procedure, which leverages human heatmaps to obtain a more precise localization of the targets. Second, we integrate a {"}one patch, multiple predictions{"} mechanism into DRT which refines the detection results and recovers occluded pedestrians at the same time. Additionally, we leverage a data-driven LSTM-based motion model which can recover lost targets at a negligible computational cost. Compared with strong baseline methods, our DRT achieves significant improvements on publicly available MOT datasets. In addition, DRT generalizes well, i.e. it can be applied to any detector to improve their performance.",

author = "Bisheng Wang and Christian Fruhwirth-Reisinger and Horst Possegger and Horst Bischof and Guo Cao",

year = "2021",

month = nov,

day = "23",

language = "English",

booktitle = "British Machine Vision Conference (BMVC) 2021",

publisher = "The British Machine Vision Association",

address = "United Kingdom",

note = "32nd British Machine Vision Conference : BMVC 2021, BMVC 2021 ; Conference date: 22-11-2021 Through 25-11-2021",

}

TY - GEN

T1 - DRT: Detection Refinement for Multiple Object Tracking

AU - Wang, Bisheng

AU - Fruhwirth-Reisinger, Christian

AU - Possegger, Horst

AU - Bischof, Horst

AU - Cao, Guo

PY - 2021/11/23

Y1 - 2021/11/23

N2 - Deep learning methods have led to remarkable progress in multiple object tracking (MOT). However, when tracking in crowded scenes, existing methods still suffer from both inaccurate and missing detections. This paper proposes Detection Refinement for Tracking (DRT) to address these two issues for people tracking. First, we construct an encoder-decoder backbone network with a novel semi-supervised heatmap training procedure, which leverages human heatmaps to obtain a more precise localization of the targets. Second, we integrate a "one patch, multiple predictions" mechanism into DRT which refines the detection results and recovers occluded pedestrians at the same time. Additionally, we leverage a data-driven LSTM-based motion model which can recover lost targets at a negligible computational cost. Compared with strong baseline methods, our DRT achieves significant improvements on publicly available MOT datasets. In addition, DRT generalizes well, i.e. it can be applied to any detector to improve their performance.

AB - Deep learning methods have led to remarkable progress in multiple object tracking (MOT). However, when tracking in crowded scenes, existing methods still suffer from both inaccurate and missing detections. This paper proposes Detection Refinement for Tracking (DRT) to address these two issues for people tracking. First, we construct an encoder-decoder backbone network with a novel semi-supervised heatmap training procedure, which leverages human heatmaps to obtain a more precise localization of the targets. Second, we integrate a "one patch, multiple predictions" mechanism into DRT which refines the detection results and recovers occluded pedestrians at the same time. Additionally, we leverage a data-driven LSTM-based motion model which can recover lost targets at a negligible computational cost. Compared with strong baseline methods, our DRT achieves significant improvements on publicly available MOT datasets. In addition, DRT generalizes well, i.e. it can be applied to any detector to improve their performance.

M3 - Conference paper

BT - British Machine Vision Conference (BMVC) 2021

PB - The British Machine Vision Association

T2 - 32nd British Machine Vision Conference

Y2 - 22 November 2021 through 25 November 2021

ER -

DRT: Detection Refinement for Multiple Object Tracking

Abstract

Conference

Access to Document

Fingerprint

Cite this