Associative embedding: End-to-end learning for joint detection and grouping

Alejandro Newell, Zhiao Huang, Jia Deng

Research output: Contribution to journalConference articlepeer-review

606 Scopus citations


We introduce associative embedding, a novel method for supervising convolutional neural networks for the task of detection and grouping. A number of computer vision problems can be framed in this manner including multi-person pose estimation, instance segmentation, and multi-object tracking. Usually the grouping of detections is achieved with multi-stage pipelines, instead we propose an approach that teaches a network to simultaneously output detections and group assignments. This technique can be easily integrated into any state-of-the-art network architecture that produces pixel-wise predictions. We show how to apply this method to multi-person pose estimation and report state-of-the-art performance on the MPII and MS-COCO datasets.

Original languageEnglish (US)
Pages (from-to)2278-2288
Number of pages11
JournalAdvances in Neural Information Processing Systems
StatePublished - 2017
Externally publishedYes
Event31st Annual Conference on Neural Information Processing Systems, NIPS 2017 - Long Beach, United States
Duration: Dec 4 2017Dec 9 2017

All Science Journal Classification (ASJC) codes

  • Computer Networks and Communications
  • Information Systems
  • Signal Processing


Dive into the research topics of 'Associative embedding: End-to-end learning for joint detection and grouping'. Together they form a unique fingerprint.

Cite this