"au:"Du Tran"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Du Tran"" — arXiv2 Search

Showing 1–20 of 29 results

/ Date/ Name

Jun 16, 2023Learning Space-Time Semantic Correspondences Nov 20, 2015Deep End2End Voxel2Voxel Prediction Dec 2, 2014Learning Spatiotemporal Features with 3D Convolutional Networks Nov 30, 2017A Closer Look at Spatiotemporal Convolutions for Action Recognition Apr 4, 2019Video Classification with Channel-Separated Convolutional Networks Aug 16, 2017ConvNet Architecture Search for Spatiotemporal Feature Learning Dec 20, 2013EXMOVES: Classifier-based Features for Scalable Action Recognition Jun 23, 2016VideoMCC: a New Benchmark for Video Comprehension Nov 28, 2019Self-Supervised Learning by Cross-Modal Audio-Video Clustering Jun 6, 2019Learning Temporal Pose Estimation from Sparsely-Labeled Videos Nov 25, 2025Layer-Aware Video Composition via Split-then-Merge Jun 7, 2019Video Modeling with Correlation Networks Jun 10, 2019UniDual: A Unified Model for Image and Video Understanding Apr 8, 2019SCSampler: Sampling Salient Clips from Video for Efficient Action Recognition Jan 29, 2017Transformation-Based Models of Video Sequences Jun 30, 2018Cooperative Learning of Audio and Video Models from Self-Supervised Synchronization Jan 26, 2019DistInit: Learning Video Representations Without a Single Labeled Video May 2, 2019Large-scale weakly-supervised pre-training for video action recognition Jun 17, 2021Long-Short Temporal Contrastive Learning of Video Transformers Apr 12, 2022Open-World Instance Segmentation: Exploiting Pseudo Ground Truth From Learned Pairwise Affinity