TY - JOUR
T1 - Creating consistent scene graphs using a probabilistic grammar
AU - Liu, Tianqiang
AU - Chaudhuri, Siddhartha
AU - Kim, Vladimir G.
AU - Huang, Qixing
AU - Mitra, Niloy J.
AU - Funkhouser, Thomas
N1 - Publisher Copyright:
2014 Copyright held by the Owner/Author. Publication rights licensed to ACM.
PY - 2014/11/19
Y1 - 2014/11/19
N2 - Growing numbers of 3D scenes in online repositories provide new opportunities for data-driven scene understanding, editing, and synthesis. Despite the plethora of data now available online, most of it cannot be effectively used for data-driven applications because it lacks consistent segmentations, category labels, and/or functional groupings required for co-analysis. In this paper, we develop algorithms that infer such information via parsing with a probabilistic grammar learned from examples. First, given a collection of scene graphs with consistent hierarchies and labels, we train a probabilistic hierarchical grammar to represent the distributions of shapes, cardinalities, and spatial relationships of semantic objects within the collection. Then, we use the learned grammar to parse new scenes to assign them segmentations, labels, and hierarchies consistent with the collection. During experiments with these algorithms, we find that: they work effectively for scene graphs for indoor scenes commonly found online (bedrooms, classrooms, and libraries); they outperform alternative approaches that consider only shape similarities and/or spatial relationships without hierarchy; they require relatively small sets of training data; they are robust to moderate over-segmentation in the inputs; and, they can robustly transfer labels from one data set to another. As a result, the proposed algorithms can be used to provide consistent hierarchies for large collections of scenes within the same semantic class.
AB - Growing numbers of 3D scenes in online repositories provide new opportunities for data-driven scene understanding, editing, and synthesis. Despite the plethora of data now available online, most of it cannot be effectively used for data-driven applications because it lacks consistent segmentations, category labels, and/or functional groupings required for co-analysis. In this paper, we develop algorithms that infer such information via parsing with a probabilistic grammar learned from examples. First, given a collection of scene graphs with consistent hierarchies and labels, we train a probabilistic hierarchical grammar to represent the distributions of shapes, cardinalities, and spatial relationships of semantic objects within the collection. Then, we use the learned grammar to parse new scenes to assign them segmentations, labels, and hierarchies consistent with the collection. During experiments with these algorithms, we find that: they work effectively for scene graphs for indoor scenes commonly found online (bedrooms, classrooms, and libraries); they outperform alternative approaches that consider only shape similarities and/or spatial relationships without hierarchy; they require relatively small sets of training data; they are robust to moderate over-segmentation in the inputs; and, they can robustly transfer labels from one data set to another. As a result, the proposed algorithms can be used to provide consistent hierarchies for large collections of scenes within the same semantic class.
KW - Scene collections
KW - Scene understanding
UR - http://www.scopus.com/inward/record.url?scp=84914680266&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84914680266&partnerID=8YFLogxK
U2 - 10.1145/2661229.2661243
DO - 10.1145/2661229.2661243
M3 - Article
AN - SCOPUS:84914680266
SN - 0730-0301
VL - 33
JO - ACM Transactions on Graphics
JF - ACM Transactions on Graphics
IS - 6
ER -