Visual Computing

University of Konstanz
2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

DeepShapeKit: accurate 4D shape reconstruction of swimming fish

R. Wu, O. Deussen, L. Li

Abstract

In this paper, we present methods for capturing 4D body shapes of swimming fish with affordable small training datasets and textureless 2D videos. Automated capture of spatiotemporal animal movements and postures is revolutionizing the study of collective animal behavior. 4D (including 3D space + time) shape data from animals like schooling fish contains a rich array of social and non-social information that can be used to shed light on the fundamental mechanisms underlying collective behavior. However, unlike the large datasets used for 4D shape reconstructions of the human body, there are no large amounts of labeled training datasets for reconstructing fish bodies in 4D, due to the difficulty of underwater data collection. We created a template mesh model using 3D scan data from a real fish, then extracted silhouettes (segmentation masks) and key-points of the fish body using Mask R-CNN and DeepLabCut, respectively. Next, using the Adam optimizer, we optimized the 3D template mesh model for each frame by minimizing the difference between the projected 3D model and the detected silhouettes as well as the key-points. Finally, using an LSTM-based smoother, we generated accurate 4D shapes of schooling fish based on the 3D shapes over each frame. Our results show that the method is effective for 4D shape reconstructions of swimming fish, with greater fidelity than other state-of-the-art algorithms.

BibTeX

@inproceedings{Wu2022DeepShapeKitaccurate4D,
  author    = {R. Wu, O. Deussen, L. Li},
  booktitle = {2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)},
  doi       = {10.1109/iros47612.2022.9982097},
  month     = {oct},
  publisher = {IEEE},
  title     = {DeepShapeKit: accurate 4D shape reconstruction of swimming fish},
  year      = {2022}
}