Social interaction behaviors change as a result of both physical and psychiatric problems, and it is important to identify subtle changes in group activity engagements for monitoring the mental health of patients in clinics. This work proposes a system to identify when and where group formations occur in an approximately 1700 $ text{m}^{2}$ therapeutic built environment using a distributed edge-computing camera network. The proposed method can localize group formations when provided with noisy positions and orientations of individuals, estimated from sparsely distributed multiview cameras, which run a lightweight multiperson 2-D pose detection model. Our group identification method demonstrated an F1 score of up to 90% with a mean absolute error of 1.25 m for group localization on our benchmark dataset. The dataset consisted of seven subjects walking, sitting, and conversing for 35 min in groups of various sizes ranging from 2 to 7 subjects. The proposed system is low-cost and scalable to any ordinary building to transform the indoor space into a smart environment using edge computing systems. We expect the proposed system to enhance existing therapeutic units for passively monitoring the social behaviors of patients when implementing real-time interventions.Social interaction behaviors change as a result of both physical and psychiatric problems, and it is important to identify subtle changes in group activity engagements for monitoring the mental health of patients in clinics. This work proposes a system to identify when and where group formations occur in an approximately 1700 $ text{m}^{2}$ therapeutic built environment using a distributed edge-computing camera network. The proposed method can localize group formations when provided with noisy positions and orientations of individuals, estimated from sparsely distributed multiview cameras, which run a lightweight multiperson 2-D pose detection model. Our group identification method demonstrated an F1 score of up to 90% with a mean absolute error of 1.25 m for group localization on our benchmark dataset. The dataset consisted of seven subjects walking, sitting, and conversing for 35 min in groups of various sizes ranging from 2 to 7 subjects. The proposed system is low-cost and scalable to any ordinary building to transform the indoor space into a smart environment using edge computing systems. We expect the proposed system to enhance existing therapeutic units for passively monitoring the social behaviors of patients when implementing real-time interventions. Leer más