Item request has been placed! ×
Item request cannot be made. ×
loading  Processing Request

Incorporating spatio-temporal information in Frustum-ConvNet for improved 3D object detection in instrumented vehicles

Item request has been placed! ×
Item request cannot be made. ×
loading   Processing Request
  • المؤلفون: Munirathnam G., Venkatesh; O'Connor, Noel E.; Little, Suzanne
  • نوع التسجيلة:
    Electronic Resource
  • الدخول الالكتروني :
    https://doras.dcu.ie/27612/
    https://doras.dcu.ie/27612
    https://dx.doi.org/10.1109/EUVIP53989.2022.9922815
    DOI: 10.1109/EUVIP53989.2022.9922815
  • معلومة اضافية
    • Publisher Information:
      IEEE 2022-10-20
    • نبذة مختصرة :
      Environmental perception is a key task for autonomous vehicles to ensure intelligent planning and safe decision-making. Most current state-of-the-art perceptual meth- ods in vehicles, and in particular for 3D object detection, are based on a single-frame reference. However, these methods do not effectively utilise temporal information associated with the objects or the scene from the input data sequences. The work presented in this paper corroborates the use of spatial and temporal information through multi-frame, lidar, point cloud data to leverage spatio-temporal contextual information and improve the accuracy of 3D object detection. The study also gathers more insights into the effect of inducing temporal information into a network and the overall performance of the deep learning model. We consider the Frustum-ConvNet architecture as the baseline model and propose methods to incorporate spatio-temporal information using convolutional-LSTMs to detect the 3D object detection using lidar data. We also propose to employ an attention mechanism with temporal encoding to stimulate the model to focus on salient feature points within the region proposals. The results from this study shows the inclusion of temporal information considerably improves the true positive metric specifically the orientation error of the 3D bounding box from 0.819 to 0.784 and 0.294 to 0.111 for cars and pedestrian classes respectively on the customized subset of nuScenes training dataset. The overall nuScenes detection score (NDS) is improved from 0.822 to 0.837 compared to the baseline.
    • الموضوع:
    • الرقم المعرف:
      10.1109.EUVIP53989.2022.9922815
    • Availability:
      Open access content. Open access content
      cc_by_nc_4
    • Note:
      application/pdf
      English
    • Other Numbers:
      IEDUB oai:doras.dcu.ie:27612
      https://doras.dcu.ie/27612/1/EUVIP_2022_paperID_61.pdf
      Munirathnam G., Venkatesh orcid logoORCID: 0000-0002-4393-9267 , O'Connor, Noel E. orcid logoORCID: 0000-0002-4033-9135 and Little, Suzanne orcid logoORCID: 0000-0003-3281-3471 (2022) Incorporating spatio-temporal information in Frustum-ConvNet for improved 3D object detection in instrumented vehicles. In: 10th European Workshop on Visual Information Processing (EUVIP), 11-14 Sept 2022, Lisbon Portugal. ISBN 978-1-6654-6623-3
      DOI: 10.1109/EUVIP53989.2022.9922815
      1375980394
    • Contributing Source:
      DUBLIN CITY UNIV
      From OAIster®, provided by the OCLC Cooperative.
    • الرقم المعرف:
      edsoai.on1375980394
HoldingsOnline