Timezone: »
Distracted driving action recognition from naturalistic driving is crucial for both driver and pedestrian's safe and reliable experience. However, traditional computer vision techniques sometimes require a lot of supervision in terms of a large amount of annotated training data to detect distracted driving activities. Recently, the vision-language models have offered large-scale visual-textual pretraining that can be adapted to unsupervised task-specific learning like distracted activity recognition. The contrastive image-text pretraining models like CLIP have shown significant promise in learning natural language-guided visual representations. In this paper, we propose a CLIP-based driver activity recognition framework that predicts whether a driver is distracted or not while driving. CLIP's vision embedding offers zero-shot transfer, which can identify distracted activities by the driver from the driving videos. Our result suggests this framework offers SOTA performance on zero-shot transfer for predicting the driver's state on three public datasets. We also developed DriveCLIP, a classifier on top of the CLIP's visual representation for distracted driving detection tasks, and reported the results here.
Author Information
Md Zahid Hasan (Iowa State University)
Ameya Joshi (New York University)
Mohammed Shaiqur Rahman (Iowa State University)
Venkatachalapathy Archana (Iowa State University)
Anuj Sharma (Iowa state university)
Chinmay Hegde (New York University)
Soumik Sarkar (Iowa State University)
More from the Same Authors
-
2021 : Cross-Modal Virtual Sensing for Combustion Instability Monitoring »
Tryambak Gangopadhyay · Vikram Ramanan · Chakravarthy S.R. · Soumik Sarkar -
2022 : 3D Reconstruction of Protein Complex Structures Using Synthesized Multi-View AFM Images »
Jaydeep Rade · Soumik Sarkar · Anwesha Sarkar · Adarsh Krishnamurthy -
2022 : Enhancing System-level Safety in Autonomous Driving via Feedback Learning »
Sin Yong Tan · Weisi Fan · Qisai Liu · Tichakorn Wongpiromsarn · Soumik Sarkar -
2022 : Provable Active Learning of Neural Networks for Parametric PDEs »
Aarshvi Gajjar · Chinmay Hegde · Christopher Musco -
2022 : Generative Design of Material Microstructures for Organic Solar Cells using Diffusion Models »
Ethan Herron · Xian Yeow Lee · Aditya Balu · Baskar Ganapathysubramanian · Soumik Sarkar · Adarsh Krishnamurthy -
2022 : Communication-efficient Decentralized Deep Learning »
Fateme Fotouhi · Aditya Balu · Zhanhong Jiang · Yasaman Esfandiari · Salman Jahani · Soumik Sarkar -
2022 : A study of natural robustness of deep reinforcement learning algorithms towards adversarial perturbations »
Qisai Liu · Xian Yeow Lee · Soumik Sarkar -
2022 : Provable Active Learning of Neural Networks for Parametric PDEs »
Aarshvi Gajjar · Chinmay Hegde · Christopher Musco -
2021 Poster: Implicit Sparse Regularization: The Impact of Depth and Early Stopping »
Jiangyuan Li · Thanh Nguyen · Chinmay Hegde · Raymond K. W. Wong -
2021 Poster: Differentiable Spline Approximations »
Minsu Cho · Aditya Balu · Ameya Joshi · Anjana Deva Prasad · Biswajit Khara · Soumik Sarkar · Baskar Ganapathysubramanian · Adarsh Krishnamurthy · Chinmay Hegde -
2019 Poster: Algorithmic Guarantees for Inverse Imaging with Untrained Network Priors »
Gauri Jagatap · Chinmay Hegde -
2018 Poster: Online Robust Policy Learning in the Presence of Unknown Adversaries »
Aaron Havens · Zhanhong Jiang · Soumik Sarkar -
2017 Poster: Collaborative Deep Learning in Fixed Topology Networks »
Zhanhong Jiang · Aditya Balu · Chinmay Hegde · Soumik Sarkar