Skip to content

DWCTOD/arXiv-CVPR2022-daily

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

39 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Updated on 2022.04.12

CVPR2022

Publish Date Title Authors PDF Code
2022-04-11 Single-Photon Structured Light Varun Sundar et.al. 2204.05300v1 null
2022-04-11 Focal Length and Object Pose Estimation via Render and Compare Georgy Ponimatkin et.al. 2204.05145v1 link
2022-04-11 XMP-Font: Self-Supervised Cross-Modality Pre-training for Few-Shot Font Generation Wei Liu et.al. 2204.05084v1 null
2022-04-11 Pyramid Grafting Network for One-Stage High Resolution Saliency Detection Chenxi Xie et.al. 2204.05041v1 link
2022-04-11 Structure-Aware Motion Transfer with Deformable Anchor Model Jiale Tao et.al. 2204.05018v1 link
2022-04-11 HiMODE: A Hybrid Monocular Omnidirectional Depth Estimation Model Masum Shah Junayed et.al. 2204.05007v1 null
2022-04-11 Commonality in Natural Images Rescues GANs: Pretraining GANs with Generic and Privacy-free Synthetic Data Kyungjune Baek et.al. 2204.04950v1 link
2022-04-11 When NAS Meets Trees: An Efficient Algorithm for Neural Architecture Search Guocheng Qian et.al. 2204.04918v1 null
2022-04-11 Consistency Learning via Decoding Path Augmentation for Transformers in Human Object Interaction Detection Jihwan Park et.al. 2204.04836v1 link
2022-04-10 SOS! Self-supervised Learning Over Sets Of Handled Objects In Egocentric Action Recognition Victor Escorcia et.al. 2204.04796v1 null
2022-04-10 Beyond Cross-view Image Retrieval: Highly Accurate Vehicle Localization Using Satellite Image Yujiao Shi et.al. 2204.04752v1 link
2022-04-10 Reasoning with Multi-Structure Commonsense Knowledge in Visual Dialog Shunyu Zhang et.al. 2204.04680v1 null
2022-04-10 FedCorr: Multi-Stage Federated Learning for Label Noise Correction Jingyi Xu et.al. 2204.04677v1 link
2022-04-10 NAN: Noise-Aware NeRFs for Burst-Denoising Naama Pearl et.al. 2204.04668v1 null
2022-04-10 Video K-Net: A Simple, Strong, and Unified Baseline for Video Segmentation Xiangtai Li et.al. 2204.04656v1 link
2022-04-10 Learning Pixel-Level Distinctions for Video Highlight Detection Fanyue Wei et.al. 2204.04615v1 null
2022-04-10 Explaining Deep Convolutional Neural Networks via Latent Visual-Semantic Filter Attention Yu Yang et.al. 2204.04601v1 link
2022-04-10 Robust Cross-Modal Representation Learning with Progressive Self-Distillation Alex Andonian et.al. 2204.04588v1 null
2022-04-09 Joint Distribution Matters: Deep Brownian Distance Covariance for Few-Shot Classification Jiangtao Xie et.al. 2204.04567v1 null
2022-04-09 Multimodal Transformer for Nursing Activity Recognition Momal Ijaz et.al. 2204.04564v1 null
2022-04-09 DeepLIIF: An Online Platform for Quantification of Clinical Pathology Slides Parmida Ghahremani et.al. 2204.04494v1 link
2022-04-09 ManiTrans: Entity-Level Text-Guided Image Manipulation via Token-wise Semantic Alignment and Generation Jianan Wang et.al. 2204.04428v1 null
2022-04-09 Adaptive Differential Filters for Fast and Communication-Efficient Federated Learning Daniel Becking et.al. 2204.04424v1 null
2022-04-09 The Two Dimensions of Worst-case Training and the Integrated Effect for Out-of-domain Generalization Zeyi Huang et.al. 2204.04384v1 link
2022-04-08 Dancing under the stars: video denoising in starlight Kristina Monakhova et.al. 2204.04210v1 null
2022-04-08 General Incremental Learning with Domain-aware Categorical Representations Jiangwei Xie et.al. 2204.04078v1 null
2022-04-08 Identifying Ambiguous Similarity Conditions via Semantic Matching Han-Jia Ye et.al. 2204.04053v1 null
2022-04-08 Probabilistic Representations for Video Contrastive Learning Jungin Park et.al. 2204.03946v1 null
2022-04-08 Does Robustness on ImageNet Transfer to Downstream Tasks? Yutaro Yamada et.al. 2204.03934v1 null
2022-04-08 Deep Hyperspectral-Depth Reconstruction Using Single Color-Dot Projection Chunyu Li et.al. 2204.03929v1 null
2022-04-08 CD$^2$-pFed: Cyclic Distillation-guided Channel Decoupling for Model Personalization in Federated Learning Yiqing Shen et.al. 2204.03880v1 null
2022-04-08 Reusing the Task-specific Classifier as a Discriminator: Discriminator-free Adversarial Domain Adaptation Lin Chen et.al. 2204.03838v1 link
2022-04-07 TorMentor: Deterministic dynamic-path, data augmentations with fractals Anguelos Nicolaou et.al. 2204.03776v1 null
2022-04-07 Gravitationally Lensed Black Hole Emission Tomography Aviad Levis et.al. 2204.03715v1 null
2022-04-07 TemporalUV: Capturing Loose Clothing with Temporally Coherent UV Coordinates You Xie et.al. 2204.03671v1 null
2022-04-07 Total Variation Optimization Layers for Computer Vision Raymond A. Yeh et.al. 2204.03643v1 link
2022-04-07 Pre-train, Self-train, Distill: A simple recipe for Supersizing 3D Reconstruction Kalyan Vasudev Alwala et.al. 2204.03642v1 null
2022-04-07 Unsupervised Image-to-Image Translation with Generative Prior Shuai Yang et.al. 2204.03641v1 link
2022-04-07 Class-Incremental Learning with Strong Pre-trained Models Tz-Ying Wu et.al. 2204.03634v1 null
2022-04-07 Unified Contrastive Learning in Image-Text-Label Space Jianwei Yang et.al. 2204.03610v1 link
2022-04-07 Pin the Memory: Learning to Generalize Semantic Segmentation Jin Kim et.al. 2204.03609v1 null
2022-04-07 AutoRF: Learning 3D Object Radiance Fields from Single View Observations Norman MĂĽller et.al. 2204.03593v1 null
2022-04-07 Many-to-many Splatting for Efficient Video Frame Interpolation Ping Hu et.al. 2204.03513v1 link
2022-04-07 Deep Visual Geo-localization Benchmark Gabriele Berton et.al. 2204.03444v1 link
2022-04-07 PSTR: End-to-End One-Step Person Search With Transformers Jiale Cao et.al. 2204.03340v1 link
2022-04-07 Coarse-to-Fine Feature Mining for Video Semantic Segmentation Guolei Sun et.al. 2204.03330v1 link
2022-04-07 L2G: A Simple Local-to-Global Knowledge Transfer Framework for Weakly Supervised Semantic Segmentation Peng-Tao Jiang et.al. 2204.03206v1 null
2022-04-07 Winoground: Probing Vision and Language Models for Visio-Linguistic Compositionality Tristan Thrush et.al. 2204.03162v1 null
2022-04-06 AUV-Net: Learning Aligned UV Maps for Texture Transfer and Synthesis Zhiqin Chen et.al. 2204.03105v1 null
2022-04-06 Hierarchical Self-supervised Representation Learning for Movie Understanding Fanyi Xiao et.al. 2204.03101v1 null
2022-04-06 Learning from Untrimmed Videos: Self-Supervised Video Representation Learning with Hierarchical Consistency Zhiwu Qing et.al. 2204.03017v1 null
2022-04-06 Multi-Scale Memory-Based Video Deblurring Bo Ji et.al. 2204.02977v1 link
2022-04-06 Temporal Alignment Networks for Long-term Video Tengda Han et.al. 2204.02968v1 null
2022-04-06 "The Pedestrian next to the Lamppost" Adaptive Object Graphs for Better Instantaneous Mapping Avishkar Saha et.al. 2204.02944v1 null
2022-04-06 An Empirical Study of End-to-End Temporal Action Detection Xiaolong Liu et.al. 2204.02932v1 link
2022-04-06 Masking Adversarial Damage: Finding Adversarial Saliency for Robust and Sparse Network Byung-Kwan Lee et.al. 2204.02738v1 null
2022-04-06 Aesthetic Text Logo Synthesis via Content-aware Layout Inferring Yizhi Wang et.al. 2204.02701v1 link
2022-04-06 Towards An End-to-End Framework for Flow-Guided Video Inpainting Zhen Li et.al. 2204.02663v2 link
2022-04-06 Towards Robust Adaptive Object Detection under Noisy Annotations Xinyu Liu et.al. 2204.02620v1 link
2022-04-06 Cloning Outfits from Real-World Images to 3D Characters for Generalizable Person Re-Identification Yanan Wang et.al. 2204.02611v2 link
2022-04-06 Learning to Anticipate Future with Dynamic Context Removal Xinyu Xu et.al. 2204.02587v1 null
2022-04-06 SqueezeNeRF: Further factorized FastNeRF for memory-efficient inference Krishna Wadhwani et.al. 2204.02585v2 null
2022-04-06 FocalClick: Towards Practical Interactive Image Segmentation Xi Chen et.al. 2204.02574v1 link
2022-04-06 Gait Recognition in the Wild with Dense 3D Representations and A Benchmark Jinkai Zheng et.al. 2204.02569v1 link
2022-04-06 MixFormer: Mixing Features across Windows and Dimensions Qiang Chen et.al. 2204.02557v1 link
2022-04-06 RODD: A Self-Supervised Approach for Robust Out-of-Distribution Detection Umar Khalid et.al. 2204.02553v1 link
2022-04-06 Modeling Motion with Multi-Modal Features for Text-Based Video Segmentation Wangbo Zhao et.al. 2204.02547v1 link
2022-04-05 Adversarial Robustness through the Lens of Convolutional Filters Paul Gavrikov et.al. 2204.02481v1 link
2022-04-05 Learning Optimal K-space Acquisition and Reconstruction using Physics-Informed Neural Networks Wei Peng et.al. 2204.02480v1 null
2022-04-05 ObjectFolder 2.0: A Multisensory Object Dataset for Sim2Real Transfer Ruohan Gao et.al. 2204.02389v1 link
2022-04-05 Neural Convolutional Surfaces Luca Morreale et.al. 2204.02289v1 null
2022-04-05 Rethinking Visual Geo-localization for Large-Scale Applications Gabriele Berton et.al. 2204.02287v1 link
2022-04-05 Arbitrary-Scale Image Synthesis Evangelos Ntavelis et.al. 2204.02273v1 link
2022-04-05 IRON: Inverse Rendering by Optimizing Neural SDFs and Materials from Photometric Images Kai Zhang et.al. 2204.02232v1 null
2022-04-05 SNUG: Self-Supervised Neural Dynamic Garments Igor Santesteban et.al. 2204.02219v1 link
2022-04-05 Multi-View Transformer for 3D Visual Grounding Shijia Huang et.al. 2204.02174v1 link
2022-04-05 Leveraging Equivariant Features for Absolute Pose Regression Mohamed Adel Musallam et.al. 2204.02163v1 null
2022-04-05 Dual-AI: Dual-path Actor Interaction Learning for Group Activity Recognition Mingfei Han et.al. 2204.02148v2 null
2022-04-05 Detector-Free Weakly Supervised Group Activity Recognition Dongkeun Kim et.al. 2204.02139v1 null
2022-04-05 Overcoming Catastrophic Forgetting in Incremental Object Detection via Elastic Response Distillation Tao Feng et.al. 2204.02136v1 link
2022-04-05 P3Depth: Monocular Depth Estimation with a Piecewise Planarity Prior Vaishakh Patil et.al. 2204.02091v1 link
2022-04-05 Text Spotting Transformers Xiang Zhang et.al. 2204.01918v1 link
2022-04-04 Revisiting Near/Remote Sensing with Geospatial Attention Scott Workman et.al. 2204.01807v1 null
2022-04-04 Joint Hand Motion and Interaction Hotspots Prediction from Egocentric Videos Shaowei Liu et.al. 2204.01696v1 null
2022-04-04 LISA: Learning Implicit Shape and Appearance of Hands Enric Corona et.al. 2204.01695v1 null
2022-04-04 Exemplar-bsaed Pattern Synthesis with Implicit Periodic Field Network Haiwei Chen et.al. 2204.01671v1 null
2022-04-04 FIFO: Learning Fog-invariant Features for Foggy Scene Segmentation Sohyun Lee et.al. 2204.01587v1 null
2022-04-04 Unsupervised Learning of Accurate Siamese Tracking Qiuhong Shen et.al. 2204.01475v1 link
2022-04-04 Correlation Verification for Image Retrieval Seongwon Lee et.al. 2204.01458v1 link
2022-04-04 WildNet: Learning Domain Generalized Semantic Segmentation from the Wild Suhyeon Lee et.al. 2204.01446v1 link
2022-04-04 Degradation-agnostic Correspondence from Resolution-asymmetric Stereo Xihao Chen et.al. 2204.01429v1 null
2022-04-04 RayMVSNet: Learning Ray-based 1D Implicit Fields for Accurate Multi-View Stereo Junhua Xi et.al. 2204.01320v1 null
2022-04-03 Exploiting Temporal Relations on Radar Perception for Autonomous Driving Peizhao Li et.al. 2204.01184v1 null
2022-04-03 BNV-Fusion: Dense 3D Reconstruction using Bi-level Neural Volume Fusion Kejie Li et.al. 2204.01139v1 null
2022-04-03 ES6D: A Computation Efficient and Symmetry-Aware 6D Pose Regression Framework Ningkai Mo et.al. 2204.01080v1 null
2022-04-03 Style-Based Global Appearance Flow for Virtual Try-On Sen He et.al. 2204.01046v1 link
2022-04-03 STCrowd: A Multimodal Dataset for Pedestrian Perception in Crowded Scenes Peishan Cong et.al. 2204.01026v1 link
2022-04-03 TransRAC: Encoding Multi-scale Temporal Correlation with Transformers for Repetitive Action Counting Huazhang Hu et.al. 2204.01018v1 link
2022-04-03 Neural Global Shutter: Learn to Restore Video from a Rolling Shutter Camera with Global Reset Feature Zhixiang Wang et.al. 2204.00974v1 link
2022-04-03 DST: Dynamic Substitute Training for Data-free Black-box Attack Wenxuan Wang et.al. 2204.00972v1 null
2022-04-03 AdaFace: Quality Adaptive Margin for Face Recognition Minchul Kim et.al. 2204.00964v1 link
2022-04-02 Matching Feature Sets for Few-Shot Image Classification Arman Afrasiyabi et.al. 2204.00949v1 null
2022-04-02 Progressive Minimal Path Method with Embedded CNN Wei Liao et.al. 2204.00944v1 null
2022-04-02 Class-Incremental Learning by Knowledge Distillation with Adaptive Feature Consolidation Minsoo Kang et.al. 2204.00895v1 link
2022-04-02 Online Convolutional Re-parameterization Mu Hu et.al. 2204.00826v1 null
2022-04-02 Semantic-Aware Domain Generalized Segmentation Duo Peng et.al. 2204.00822v1 link
2022-04-02 R(Det)^2: Randomized Decision Routing for Object Detection Ya-Li Li et.al. 2204.00794v1 null
2022-04-02 Homography Loss for Monocular 3D Object Detection Jiaqi Gu et.al. 2204.00754v1 link
2022-04-02 What to look at and where: Semantic and Spatial Refined Transformer for detecting human-object interactions A S M Iftekhar et.al. 2204.00746v1 null
2022-04-01 Consistency driven Sequential Transformers Attention Model for Partially Observable Scenes Samrudhdhi B. Rangrej et.al. 2204.00656v1 null
2022-04-01 Robust Neonatal Face Detection in Real-world Clinical Settings Jacqueline Hausmann et.al. 2204.00655v1 null
2022-04-01 SIMBAR: Single Image-Based Scene Relighting For Effective Data Augmentation For Automated Driving Vision Tasks Xianling Zhang et.al. 2204.00644v1 null
2022-04-01 On the Importance of Asymmetry for Siamese Representation Learning Xiao Wang et.al. 2204.00613v1 link
2022-04-01 Proper Reuse of Image Classification Features Improves Object Detection Cristina Vasconcelos et.al. 2204.00484v1 null
2022-04-01 Marginal Contrastive Correspondence for Guided Image Generation Fangneng Zhan et.al. 2204.00442v1 null
2022-04-01 Learning to Deblur using Light Field Generated and Real Defocus Images Lingyan Ruan et.al. 2204.00367v1 link
2022-04-01 DIP: Deep Inverse Patchmatch for High-Resolution Optical Flow Zihua Zheng et.al. 2204.00330v1 link
2022-04-01 CAT-Det: Contrastively Augmented Transformer for Multi-modal 3D Object Detection Yanan Zhang et.al. 2204.00325v1 null
2022-04-01 Unimodal-Concentrated Loss: Fully Adaptive Label Distribution Learning for Ordinal Regression Qiang Li et.al. 2204.00309v1 null
2022-04-01 Perception Prioritized Training of Diffusion Models Jooyoung Choi et.al. 2204.00227v1 link
2022-04-01 Bridging the Gap between Classification and Localization for Weakly Supervised Object Localization Eunji Kim et.al. 2204.00220v1 null
2022-04-01 GraftNet: Towards Domain Generalized Stereo Matching with a Broad-Spectrum and Task-Oriented Feature Biyang Liu et.al. 2204.00179v1 link
2022-04-01 LASER: LAtent SpacE Rendering for 2D Visual Localization Zhixiang Min et.al. 2204.00157v1 null
2022-03-31 TransGeo: Transformer Is All You Need for Cross-view Image Geo-localization Sijie Zhu et.al. 2204.00097v1 link
2022-03-31 Efficient Maximal Coding Rate Reduction by Variational Forms Christina Baek et.al. 2204.00077v1 null
2022-03-31 Improving Adversarial Transferability via Neuron Attribution-Based Attacks Jianping Zhang et.al. 2204.00008v1 link
2022-03-31 Bringing Old Films Back to Life Ziyu Wan et.al. 2203.17276v1 link
2022-03-31 TransEditor: Transformer-Based Dual-Space GAN for Highly Controllable Facial Editing Yanbo Xu et.al. 2203.17266v1 link
2022-03-31 Generating High Fidelity Data from Low-density Regions using Diffusion Models Vikash Sehwag et.al. 2203.17260v1 null
2022-03-31 Continuous Scene Representations for Embodied AI Samir Yitzhak Gadre et.al. 2203.17251v1 null
2022-03-31 Templates for 3D Object Pose Estimation Revisited: Generalization to New Objects and Robustness to Occlusions Van Nguyen Nguyen et.al. 2203.17234v1 link
2022-03-31 SimVQA: Exploring Simulated Environments for Visual Question Answering Paola Cascante-Bonilla et.al. 2203.17219v1 null
2022-03-31 Leverage Your Local and Global Representations: A New Self-Supervised Learning Strategy Tong Zhang et.al. 2203.17205v1 null
2022-03-31 Time Lens++: Event-based Frame Interpolation with Parametric Non-linear Flow and Multi-scale Fusion Stepan Tulyakov et.al. 2203.17191v1 null
2022-03-31 AEGNN: Asynchronous Event-based Graph Neural Networks Simon Schaefer et.al. 2203.17149v1 null
2022-03-31 It's All In the Teacher: Zero-Shot Quantization Brought Closer to the Teacher Kanghyun Choi et.al. 2203.17008v2 link
2022-03-31 Towards Robust Rain Removal Against Adversarial Attacks: A Comprehensive Benchmark Analysis and Beyond Yi Yu et.al. 2203.16931v1 link
2022-03-31 End-to-End Trajectory Distribution Prediction Based on Occupancy Grid Maps Ke Guo et.al. 2203.16910v1 link
2022-03-31 Multi-Granularity Alignment Domain Adaptation for Object Detection Wenzhang Zhou et.al. 2203.16897v1 null
2022-03-31 CRAFT: Cross-Attentional Flow Transformer for Robust Optical Flow Xiuchao Sui et.al. 2203.16896v1 link
2022-03-31 Deformation and Correspondence Aware Unsupervised Synthetic-to-Real Scene Flow Estimation for Point Clouds Zhao Jin et.al. 2203.16895v1 link
2022-03-31 Towards Driving-Oriented Metric for Lane Detection Models Takami Sato et.al. 2203.16851v1 link
2022-03-31 Fine-grained Temporal Contrastive Learning for Weakly-supervised Temporal Action Localization Junyu Gao et.al. 2203.16800v1 link
2022-03-31 Deformable Video Transformer Jue Wang et.al. 2203.16795v1 null
2022-03-31 Reflection and Rotation Symmetry Detection via Equivariant Learning Ahyun Seo et.al. 2203.16787v1 null
2022-03-31 ViSTA: Vision and Scene Text Aggregation for Cross-Modal Retrieval Mengjun Cheng et.al. 2203.16778v1 null
2022-03-31 ReSTR: Convolution-free Referring Image Segmentation Using Transformers Namyup Kim et.al. 2203.16768v1 null
2022-03-31 MeMOT: Multi-Object Tracking with Memory Jiarui Cai et.al. 2203.16761v1 null
2022-03-31 Stochastic Backpropagation: A Memory Efficient Strategy for Training Video Models Feng Cheng et.al. 2203.16755v1 null
2022-03-31 Personalized Image Aesthetics Assessment with Rich Attributes Yuzhe Yang et.al. 2203.16754v1 null
2022-03-31 Exploiting Explainable Metrics for Augmented SGD Mahdi S. Hosseini et.al. 2203.16723v1 link
2022-03-30 Task Adaptive Parameter Sharing for Multi-Task Learning Matthew Wallingford et.al. 2203.16708v1 null
2022-03-30 Face Relighting with Geometrically Consistent Shadows Andrew Hou et.al. 2203.16681v1 link
2022-03-30 Escaping Data Scarcity for High-Resolution Heterogeneous Face Hallucination Yiqun Mei et.al. 2203.16669v1 null
2022-03-30 Learning Local Displacements for Point Cloud Completion Yida Wang et.al. 2203.16600v1 null
2022-03-30 Constrained Few-shot Class-incremental Learning Michael Hersche et.al. 2203.16588v1 link
2022-03-30 Large-Scale Pre-training for Person Re-identification with Noisy Labels Dengpan Fu et.al. 2203.16533v1 link
2022-03-30 Understanding 3D Object Articulation in Internet Videos Shengyi Qian et.al. 2203.16531v1 null
2022-03-30 CaDeX: Learning Canonical Deformation Coordinate Space for Dynamic Surface Representation via Neural Homeomorphism Jiahui Lei et.al. 2203.16529v1 null
2022-03-30 Collaborative Transformers for Grounded Situation Recognition Junhyeong Cho et.al. 2203.16518v1 link
2022-03-30 Unseen Classes at a Later Time? No Problem Hari Chandana Kuchibhotla et.al. 2203.16517v1 link
2022-03-30 Fast Light-Weight Near-Field Photometric Stereo Daniel Lichy et.al. 2203.16515v1 null
2022-03-30 AdaMixer: A Fast-Converging Query-Based Object Detector Ziteng Gao et.al. 2203.16507v1 link
2022-03-30 Fast, Accurate and Memory-Efficient Partial Permutation Synchronization Shaohan Li et.al. 2203.16505v1 null
2022-03-30 TubeDETR: Spatio-Temporal Video Grounding with Transformers Antoine Yang et.al. 2203.16434v1 link
2022-03-30 Balanced MSE for Imbalanced Visual Regression Jiawei Ren et.al. 2203.16427v1 link
2022-03-30 Practical Learned Lossless JPEG Recompression with Multi-Level Cross-Channel Entropy Model in the DCT Domain Lina Guo et.al. 2203.16357v1 null
2022-03-30 Multi-Robot Active Mapping via Neural Bipartite Graph Matching Kai Ye et.al. 2203.16319v1 null
2022-03-30 Forecasting from LiDAR via Future Object Detection Neehar Peri et.al. 2203.16297v1 null
2022-03-30 Image-to-Lidar Self-Supervised Distillation for Autonomous Driving Data Corentin Sautier et.al. 2203.16258v1 link
2022-03-30 InstaFormer: Instance-Aware Image-to-Image Translation with Transformer Soohyun Kim et.al. 2203.16248v1 null
2022-03-30 Target-aware Dual Adversarial Learning and a Multi-scenario Multi-Modality Benchmark to Fuse Infrared and Visible for Object Detection Jinyuan Liu et.al. 2203.16220v1 link
2022-03-30 Learning of Global Objective for Network Flow in Multi-Object Tracking Shuai Li et.al. 2203.16210v1 null
2022-03-30 Fair Contrastive Learning for Facial Attribute Classification Sungho Park et.al. 2203.16209v1 link
2022-03-30 Spatial-Temporal Parallel Transformer for Arm-Hand Dynamic Estimation Shuying Liu et.al. 2203.16202v1 null
2022-03-30 On the Road to Online Adaptation for Semantic Image Segmentation Riccardo Volpi et.al. 2203.16195v1 link
2022-03-30 FLOAT: Factorized Learning of Object Attributes for Improved Multi-object Multi-part Scene Parsing Rishubh Singh et.al. 2203.16168v1 null
2022-03-30 Global Tracking via Ensemble of Local Trackers Zikun Zhou et.al. 2203.16092v1 link
2022-03-30 Omni-DETR: Omni-Supervised Object Detection with Transformers Pei Wang et.al. 2203.16089v1 null
2022-03-30 STRPM: A Spatiotemporal Residual Predictive Model for High-Resolution Video Prediction Zheng Chang et.al. 2203.16084v1 null
2022-03-30 Learning Program Representations for Food Images and Cooking Recipes Dim P. Papadopoulos et.al. 2203.16071v1 null
2022-03-30 AxIoU: An Axiomatically Justified Measure for Video Moment Retrieval Riku Togashi et.al. 2203.16062v1 null
2022-03-30 Progressively Generating Better Initial Guesses Towards Next Stages for High-Quality Human Motion Prediction Tiezheng Ma et.al. 2203.16051v1 link
2022-03-30 Threshold Matters in WSSS: Manipulating the Activation for the Robust and Accurate Segmentation Model Against Thresholds Minhyun Lee et.al. 2203.16045v1 link
2022-03-30 Semi-Supervised Learning of Semantic Correspondence with Pseudo-Labels Jiwon Kim et.al. 2203.16038v1 null
2022-03-30 Iterative Deep Homography Estimation Si-Yuan Cao et.al. 2203.15982v1 link
2022-03-29 StyleT2I: Toward Compositional and High-Fidelity Text-to-Image Synthesis Zhiheng Li et.al. 2203.15799v1 link
2022-03-29 CHEX: CHannel EXploration for CNN Model Compression Zejiang Hou et.al. 2203.15794v1 null
2022-03-29 FisherMatch: Semi-Supervised Rotation Regression via Entropy-based Filtering Yingda Yin et.al. 2203.15765v1 null
2022-03-29 Integrative Few-Shot Learning for Classification and Segmentation Dahyun Kang et.al. 2203.15712v1 null
2022-03-29 OakInk: A Large-scale Knowledge Repository for Understanding Hand-Object Interaction Lixin Yang et.al. 2203.15709v1 link
2022-03-29 EnvEdit: Environment Editing for Vision-and-Language Navigation Jialu Li et.al. 2203.15685v1 link
2022-03-29 Exploring Frequency Adversarial Attacks for Face Forgery Detection Shuai Jia et.al. 2203.15674v1 null
2022-03-29 PoseTriplet: Co-evolving 3D Human Pose Estimation, Imitation, and Hallucination under Self-supervision Kehong Gong et.al. 2203.15625v1 null
2022-03-29 Learning a Structured Latent Space for Unsupervised Point Cloud Completion Yingjie Cai et.al. 2203.15580v1 null
2022-03-29 BARC: Learning to Regress 3D Dog Shape from Images by Exploiting Breed Information Nadine Rueegg et.al. 2203.15536v1 null
2022-03-29 OSOP: A Multi-Stage One Shot Object Pose Estimation Framework Ivan Shugurov et.al. 2203.15533v1 null
2022-03-29 Learning Structured Gaussians to Approximate Deep Ensembles Ivor J. A. Simpson et.al. 2203.15485v1 null
2022-03-29 Clean Implicit 3D Structure from Noisy 2D STEM Images Hannah Kniesel et.al. 2203.15434v1 link
2022-03-29 Long-term Video Frame Interpolation via Feature Propagation Dawit Mureja Argaw et.al. 2203.15427v1 null
2022-03-29 Quantifying Societal Bias Amplification in Image Captioning Yusuke Hirota et.al. 2203.15395v1 null
2022-03-29 Alignment-Uniformity aware Representation Learning for Zero-shot Video Classification Shi Pu et.al. 2203.15381v1 link
2022-03-29 A Style-aware Discriminator for Controllable Image Translation Kunhee Kim et.al. 2203.15375v1 null
2022-03-29 Self-Supervised Image Representation Learning with Geometric Set Consistency Nenglun Chen et.al. 2203.15361v1 null
2022-03-29 Nested Collaborative Learning for Long-Tailed Visual Recognition Jun Li et.al. 2203.15359v1 link
2022-03-29 Online Continual Learning on a Contaminated Data Stream with Blurry Task Boundaries Jihwan Bang et.al. 2203.15355v1 link
2022-03-29 SIOD: Single Instance Annotated Per Category Per Image for Object Detection Hanjun Li et.al. 2203.15353v1 link
2022-03-29 Task-specific Inconsistency Alignment for Domain Adaptive Object Detection Liang Zhao et.al. 2203.15345v1 null
2022-03-29 Balanced Multimodal Learning via On-the-fly Gradient Modulation Xiaokang Peng et.al. 2203.15332v1 null
2022-03-29 CNN Filter DB: An Empirical Investigation of Trained Convolutional Filters Paul Gavrikov et.al. 2203.15331v1 link
2022-03-29 Dressing in the Wild by Watching Dance Videos Xin Dong et.al. 2203.15320v1 null
2022-03-29 Eigenlanes: Data-Driven Lane Descriptors for Structurally Diverse Lanes Dongkwon Jin et.al. 2203.15302v1 null
2022-03-29 Uncertainty-Aware Adaptation for Self-Supervised 3D Human Pose Estimation Jogendra Nath Kundu et.al. 2203.15293v1 null
2022-03-29 MAT: Mask-Aware Transformer for Large Hole Image Inpainting Wenbo Li et.al. 2203.15270v1 link
2022-03-29 Eigencontours: Novel Contour Descriptors Based on Low-Rank Approximation Wonhui Park et.al. 2203.15259v1 null
2022-03-29 Pop-Out Motion: 3D-Aware Image Deformation via Learning the Shape Laplacian Jihyun Lee et.al. 2203.15235v1 null
2022-03-28 Frame-wise Action Representations for Long Videos via Sequence Contrastive Learning Minghao Chen et.al. 2203.14957v1 link
2022-03-28 GIRAFFE HD: A High-Resolution 3D-aware Generative Model Yang Xue et.al. 2203.14954v1 null
2022-03-28 Energy-based Latent Aligner for Incremental Learning K J Joseph et.al. 2203.14952v1 link
2022-03-28 Controllable Dynamic Multi-Task Architectures Dripta S. Raychaudhuri et.al. 2203.14949v1 null
2022-03-28 Learning to Prompt for Open-Vocabulary Object Detection with Vision-Language Model Yu Du et.al. 2203.14940v1 link
2022-03-28 Attributable Visual Similarity Learning Borui Zhang et.al. 2203.14932v1 link
2022-03-28 Expanding Low-Density Latent Regions for Open-Set Object Detection Jiaming Han et.al. 2203.14911v1 link
2022-03-28 Optimizing Elimination Templates by Greedy Parameter Search Evgeniy Martyushev et.al. 2203.14901v1 null
2022-03-28 Learning Where to Learn in Cross-View Self-Supervised Learning Lang Huang et.al. 2203.14898v1 null
2022-03-28 Doodle It Yourself: Class Incremental Learning by Drawing a Few Sketches Ayan Kumar Bhunia et.al. 2203.14843v1 null
2022-03-28 Sketching without Worrying: Noise-Tolerant Sketch-Based Image Retrieval Ayan Kumar Bhunia et.al. 2203.14817v1 link
2022-03-28 Partially Does It: Towards Scene-Level FG-SBIR with Partial Input Pinaki Nath Chowdhury et.al. 2203.14804v1 null
2022-03-28 Assembly101: A Large-Scale Multi-View Video Dataset for Understanding Procedural Activities Fadime Sener et.al. 2203.14712v1 link
2022-03-28 MSTR: Multi-Scale Transformer for End-to-End Human-Object Interaction Detection Bumsoo Kim et.al. 2203.14709v1 null
2022-03-28 Sketch3T: Test-Time Training for Zero-Shot SBIR Aneeshan Sain et.al. 2203.14691v1 null
2022-03-28 Brain-inspired Multilayer Perceptron with Spiking Neurons Wenshuo Li et.al. 2203.14679v1 null
2022-03-28 Part-based Pseudo Label Refinement for Unsupervised Person Re-identification Yoonki Cho et.al. 2203.14675v1 null
2022-03-28 Diverse Plausible 360-Degree Image Outpainting for Efficient 3DCG Background Creation Naofumi Akimoto et.al. 2203.14668v1 null
2022-03-28 FS6D: Few-Shot 6D Pose Estimation of Novel Objects Yisheng He et.al. 2203.14628v1 link
2022-03-28 Towards Implicit Text-Guided 3D Shape Generation Zhengzhe Liu et.al. 2203.14622v1 link
2022-03-28 Demystifying the Neural Tangent Kernel from a Practical Perspective: Can it be trusted for Neural Architecture Search without training? Jisoo Mok et.al. 2203.14577v1 link
2022-03-28 HandOccNet: Occlusion-Robust 3D Hand Mesh Estimation Network JoonKyu Park et.al. 2203.14564v1 null
2022-03-28 Reference-based Video Super-Resolution Using Multi-Camera Video Triplets Junyong Lee et.al. 2203.14537v1 link
2022-03-28 Uni6D: A Unified CNN Framework without Projection Breakdown for 6D Pose Estimation Xiaoke Jiang et.al. 2203.14531v1 null
2022-03-28 REGTR: End-to-end Point Cloud Correspondences with Transformers Zi Jian Yew et.al. 2203.14517v1 link
2022-03-28 ImFace: A Nonlinear 3D Morphable Face Model with Implicit Neural Representations Mingwu Zheng et.al. 2203.14510v1 null
2022-03-28 Automated Progressive Learning for Efficient Training of Vision Transformers Changlin Li et.al. 2203.14509v1 link
2022-03-28 Stratified Transformer for 3D Point Cloud Segmentation Xin Lai et.al. 2203.14508v1 link
2022-03-28 Catching Both Gray and Black Swans: Open-set Supervised Anomaly Detection Choubo Ding et.al. 2203.14506v1 link
2022-03-28 NOC-REK: Novel Object Captioning with Retrieved Vocabulary from External Knowledge Duc Minh Vo et.al. 2203.14499v1 null
2022-03-25 Versatile Multi-Modal Pre-Training for Human-Centric Perception Fangzhou Hong et.al. 2203.13815v1 null
2022-03-25 Stochastic Trajectory Prediction via Motion Indeterminacy Diffusion Tianpei Gu et.al. 2203.13777v1 link
2022-03-25 Searching for Network Width with Bilaterally Coupled Network Xiu Su et.al. 2203.13714v1 link
2022-03-25 Unsupervised Pre-training for Temporal Action Localization Tasks Can Zhang et.al. 2203.13609v1 null
2022-03-25 Rope3D: TheRoadside Perception Dataset for Autonomous Driving and Monocular 3D Object Detection Task Xiaoqing Ye et.al. 2203.13608v1 null
2022-03-25 Continual Test-Time Domain Adaptation Qin Wang et.al. 2203.13591v1 link
2022-03-25 Contrastive learning of Class-agnostic Activation Map for Weakly Supervised Object Localization and Semantic Segmentation Jinheng Xie et.al. 2203.13505v1 null
2022-03-25 Non-Probability Sampling Network for Stochastic Human Trajectory Prediction Inhwan Bae et.al. 2203.13471v1 null
2022-03-25 CAD: Co-Adapting Discriminative Features for Improved Few-Shot Classification Philip Chikontwe et.al. 2203.13465v1 null
2022-03-25 MDAN: Multi-level Dependent Attention Network for Visual Emotion Analysis Liwen Xu et.al. 2203.13443v1 null
2022-03-25 Noisy Boundaries: Lemon or Lemonade for Semi-supervised Instance Segmentation? Zhenyu Wang et.al. 2203.13427v1 null
2022-03-25 Self-Supervised Predictive Learning: A Negative-Free Method for Sound Source Localization in Visual Scenes Zengjie Song et.al. 2203.13412v1 null
2022-03-25 Point2Seq: Detecting 3D Objects as Sequences Yujing Xue et.al. 2203.13394v1 null
2022-03-24 Probing Representation Forgetting in Supervised and Unsupervised Continual Learning MohammadReza Davari et.al. 2203.13381v1 null
2022-03-24 NPBG++: Accelerating Neural Point-Based Graphics Ruslan Rakhimov et.al. 2203.13318v1 null
2022-03-24 SharpContour: A Contour-based Boundary Refinement Approach for Efficient and Accurate Instance Segmentation Chenming Zhu et.al. 2203.13312v1 null
2022-03-24 MonoDETR: Depth-aware Transformer for Monocular 3D Object Detection Renrui Zhang et.al. 2203.13310v1 link
2022-03-24 Weakly-Supervised Online Action Segmentation in Multi-View Instructional Videos Reza Ghoddoosian et.al. 2203.13309v1 null
2022-03-24 EPro-PnP: Generalized End-to-End Probabilistic Perspective-n-Points for Monocular Object Pose Estimation Hansheng Chen et.al. 2203.13254v1 link
2022-03-24 Global Tracking Transformers Xingyi Zhou et.al. 2203.13250v1 link
2022-03-24 Pastiche Master: Exemplar-Based High-Resolution Portrait Style Transfer Shuai Yang et.al. 2203.13248v1 link
2022-03-24 Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation Xian Liu et.al. 2203.13161v1 link
2022-03-24 Moving Window Regression: A Novel Approach to Ordinal Regression Nyeong-Ho Shin et.al. 2203.13122v1 link
2022-03-24 Egocentric Prediction of Action Target in 3D Yiming Li et.al. 2203.13116v1 null
2022-03-24 AziNorm: Exploiting the Radial Symmetry of Point Cloud for Azimuth-Normalized 3D Perception Shaoyu Chen et.al. 2203.13090v1 link
2022-03-24 Bailando: 3D Dance Generation by Actor-Critic GPT with Choreographic Memory Li Siyao et.al. 2203.13055v2 link
2022-03-24 CVF-SID: Cyclic multi-Variate Function for Self-Supervised Image Denoising by Disentangling Noise from Image Reyhaneh Neshatavar et.al. 2203.13009v1 link
2022-03-24 Compound Domain Generalization via Meta-Knowledge Encoding Chaoqi Chen et.al. 2203.13006v1 null
2022-03-24 Hierarchical Nearest Neighbor Graph Embedding for Efficient Dimensionality Reduction M. Saquib Sarfraz et.al. 2203.12997v1 link
2022-03-24 WarpingGAN: Warping Multiple Uniform Priors for Adversarial 3D Point Cloud Generation Yingzhi Tang et.al. 2203.12917v1 link
2022-03-24 Neural Reflectance for Shape Recovery with Shadow Handling Junxuan Li et.al. 2203.12909v1 link
2022-03-24 RNNPose: Recurrent 6-DoF Object Pose Refinement with Robust Correspondence Field Estimation and Pose Optimization Yan Xu et.al. 2203.12870v1 null
2022-03-24 DyRep: Bootstrapping Training with Dynamic Re-parameterization Tao Huang et.al. 2203.12868v1 link
2022-03-24 Beyond Fixation: Dynamic Window Visual Transformer Pengzhen Ren et.al. 2203.12856v1 link
2022-03-24 Industrial Style Transfer with Large-scale Geometric Warping and Content Preservation Jinchao Yang et.al. 2203.12835v1 link
2022-03-24 Sparse Instance Activation for Real-Time Instance Segmentation Tianheng Cheng et.al. 2203.12827v1 link
2022-03-24 Learning Motion-Dependent Appearance for High-Fidelity Rendering of Dynamic Humans from a Single Camera Jae Shin Yoon et.al. 2203.12780v1 null
2022-03-23 Multidimensional Belief Quantification for Label-Efficient Meta-Learning Deep Pandey et.al. 2203.12768v1 null
2022-03-23 UMT: Unified Multi-modal Transformers for Joint Video Moment Retrieval and Highlight Detection Ye Liu et.al. 2203.12745v1 link
2022-03-23 Maximum Spatial Perturbation Consistency for Unpaired Image-to-Image Translation Yanwu Xu et.al. 2203.12707v1 link
2022-03-23 DynamicEarthNet: Daily Multi-Spectral Satellite Dataset for Semantic Change Segmentation Aysim Toker et.al. 2203.12560v1 null
2022-03-23 Transformer-based Multimodal Information Fusion for Facial Expression Analysis Wei Zhang et.al. 2203.12367v1 null
2022-03-23 How Do You Do It? Fine-Grained Action Understanding with Pseudo-Adverbs Hazel Doughty et.al. 2203.12344v1 link
2022-03-23 Towards Semi-Supervised Deep Facial Expression Recognition with An Adaptive Confidence Margin Hangyu Li et.al. 2203.12341v2 link
2022-03-23 Real-time Object Detection for Streaming Perception Jinrong Yang et.al. 2203.12338v1 link
2022-03-23 DR.VIC: Decomposition and Reasoning for Video Individual Counting Tao Han et.al. 2203.12335v1 link
2022-03-23 Node Representation Learning in Graph via Node-to-Neighbourhood Mutual Information Maximization Wei Dong et.al. 2203.12265v1 link
2022-03-23 Ev-TTA: Test-Time Adaptation for Event-Based Object Recognition Junho Kim et.al. 2203.12247v1 null
2022-03-23 Training-free Transformer Architecture Search Qinqin Zhou et.al. 2203.12217v1 null
2022-03-23 Self-supervised Learning of Adversarial Example: Towards Good Generalizations for Deepfake Detection Liang Chen et.al. 2203.12208v1 link
2022-03-23 Unifying Motion Deblurring and Frame Interpolation with Events Xiang Zhang et.al. 2203.12178v1 null
2022-03-22 PlaneMVS: 3D Plane Reconstruction from Multi-View Stereo Jiachen Liu et.al. 2203.12082v1 null
2022-03-22 DTFD-MIL: Double-Tier Feature Distillation Multiple Instance Learning for Histopathology Whole Slide Image Classification Hongrun Zhang et.al. 2203.12081v1 null
2022-03-22 φ-SfT: Shape-from-Template with a Physics-Based Deformation Model Navami Kairanda et.al. 2203.11938v1 null
2022-03-22 Learning from All Vehicles Dian Chen et.al. 2203.11934v1 link
2022-03-22 Dataset Distillation by Matching Training Trajectories George Cazenavette et.al. 2203.11932v1 link
2022-03-22 GradViT: Gradient Inversion of Vision Transformers Ali Hatamizadeh et.al. 2203.11894v1 null
2022-03-22 AP-BSN: Self-Supervised Denoising for Real-World Images via Asymmetric PD and Blind-Spot Network Wooseok Lee et.al. 2203.11799v1 link
2022-03-22 Exploring and Evaluating Image Restoration Potential in Dynamic Scenes Cheng Zhang et.al. 2203.11754v2 link
2022-03-22 FedDC: Federated Learning with Non-IID Data via Local Drift Decoupling and Correction Liang Gao et.al. 2203.11751v1 link
2022-03-22 Meta-attention for ViT-backed Continual Learning Mengqi Xue et.al. 2203.11684v1 link
2022-03-22 Look for the Change: Learning Object States and State-Modifying Actions from Untrimmed Web Videos Tomáš Souček et.al. 2203.11637v1 link
2022-03-22 IDEA-Net: Dynamic 3D Point Cloud Interpolation via Deep Embedding Alignment Yiming Zeng et.al. 2203.11590v1 link
2022-03-22 Out-of-distribution Generalization with Causal Invariant Transformations Ruoyu Wang et.al. 2203.11528v2 null
2022-03-22 TransFusion: Robust LiDAR-Camera Fusion for 3D Object Detection with Transformers Xuyang Bai et.al. 2203.11496v1 link
2022-03-22 Practical Stereo Matching via Cascaded Recurrent Network with Adaptive Correlation Jiankun Li et.al. 2203.11483v1 link
2022-03-22 Mixed Differential Privacy in Computer Vision Aditya Golatkar et.al. 2203.11481v1 null
2022-03-22 Remember Intentions: Retrospective-Memory-based Trajectory Prediction Chenxin Xu et.al. 2203.11474v1 link
2022-03-22 Federated Class-Incremental Learning Jiahua Dong et.al. 2203.11473v1 link
2022-03-22 Ray3D: ray-based 3D human pose estimation for monocular absolute 3D localization Yu Zhan et.al. 2203.11471v1 link
2022-03-21 Global Matching with Overlapping Attention for Optical Flow Estimation Shiyu Zhao et.al. 2203.11335v1 link
2022-03-21 NeRFusion: Fusing Radiance Fields for Large-Scale Scene Reconstruction Xiaoshuai Zhang et.al. 2203.11283v1 null
2022-03-21 Transforming Model Prediction for Tracking Christoph Mayer et.al. 2203.11192v1 link
2022-03-21 DiffPoseNet: Direct Differentiable Camera Pose Estimation Chethan M. Parameshwara et.al. 2203.11174v1 null
2022-03-21 Not All Points Are Equal: Learning Highly Efficient Point-based Detectors for 3D LiDAR Point Clouds Yifan Zhang et.al. 2203.11139v1 link
2022-03-21 No Pain, Big Gain: Classify Dynamic Point Cloud Sequences with Static Models by Fitting Feature-level Space-time Surfaces Jia-Xing Zhong et.al. 2203.11113v1 link
2022-03-21 MixFormer: End-to-End Tracking with Iterative Mixed Attention Yutao Cui et.al. 2203.11082v1 link
2022-03-21 MonoDTR: Monocular 3D Object Detection with Depth-Aware Transformer Kuan-Chih Huang et.al. 2203.10981v1 link
2022-03-21 Unified Multivariate Gaussian Mixture for Efficient Neural Image Compression Xiaosu Zhu et.al. 2203.10897v1 link
2022-03-21 Revisiting Domain Generalized Stereo Matching Networks from a Feature Consistency Perspective Jiawei Zhang et.al. 2203.10887v1 link
2022-03-21 ELIC: Efficient Learned Image Compression with Unevenly Grouped Space-Channel Contextual Adaptive Coding Dailan He et.al. 2203.10886v1 null
2022-03-21 RGB-Depth Fusion GAN for Indoor Depth Completion Haowen Wang et.al. 2203.10856v1 null
2022-03-21 Hyperbolic Vision Transformers: Combining Improvements in Metric Learning Aleksandr Ermolov et.al. 2203.10833v2 link
2022-03-21 ViM: Out-Of-Distribution with Virtual-logit Matching Haoqi Wang et.al. 2203.10807v1 link
2022-03-21 Delving into the Estimation Shift of Batch Normalization in a Network Lei Huang et.al. 2203.10778v1 link
2022-03-21 Tree Energy Loss: Towards Sparsely Annotated Semantic Segmentation Zhiyuan Liang et.al. 2203.10739v1 null
2022-03-21 HP-Capsule: Unsupervised Face Part Discovery by Hierarchical Parsing Capsule Network Chang Yu et.al. 2203.10699v1 null
2022-03-20 Unsupervised Domain Adaptation for Nighttime Aerial Tracking Junjie Ye et.al. 2203.10541v1 link
2022-03-20 Depth Estimation by Combining Binocular Stereo and Monocular Structured-Light Yuhua Xu et.al. 2203.10493v1 null
2022-03-20 SimAN: Exploring Self-Supervised Representation Learning of Scene Text via Similarity-Aware Normalization Canjie Luo et.al. 2203.10492v1 link
2022-03-20 TVConv: Efficient Translation Variant Convolution for Layout-aware Visual Processing Jierun Chen et.al. 2203.10489v1 link
2022-03-20 Portrait Eyeglasses and Shadow Removal by Leveraging 3D Synthetic Data Junfeng Lyu et.al. 2203.10474v1 link
2022-03-19 CLRNet: Cross Layer Refinement Network for Lane Detection Tu Zheng et.al. 2203.10350v1 null
2022-03-19 Voxel Set Transformer: A Set-to-Set Approach to 3D Object Detection from Point Clouds Chenhang He et.al. 2203.10314v1 link
2022-03-19 DirecFormer: A Directed Attention in Transformer Approach to Robust Action Recognition Thanh-Dat Truong et.al. 2203.10233v1 link
2022-03-19 SwinTextSpotter: Scene Text Spotting via Better Synergy between Text Detection and Text Recognition Mingxin Huang et.al. 2203.10209v1 link
2022-03-18 Discovering Objects that Can Move Zhipeng Bao et.al. 2203.10159v1 null
2022-03-18 Fourier Document Restoration for Robust Document Dewarping and Recognition Chuhui Xue et.al. 2203.09910v1 null
2022-03-18 Learning Affordance Grounding from Exocentric Images Hongchen Luo et.al. 2203.09905v1 link
2022-03-18 DTA: Physical Camouflage Attacks using Differentiable Transformation Network Naufal Suryanto et.al. 2203.09831v1 null
2022-03-18 Cross-Modal Perceptionist: Can Face Geometry be Gleaned from Voices? Cho-Ying Wu et.al. 2203.09824v1 null
2022-03-18 Stacked Hybrid-Attention and Group Collaborative Learning for Unbiased Scene Graph Generation Xingning Dong et.al. 2203.09811v1 link
2022-03-18 Sparse Fuse Dense: Towards High Quality 3D Detection with Depth Completion Xiaopei Wu et.al. 2203.09780v1 null
2022-03-18 ContrastMask: Contrastive Learning to Segment Every Thing Xuehui Wang et.al. 2203.09775v1 null
2022-03-18 Class-Balanced Pixel-Level Self-Labeling for Domain Adaptive Semantic Segmentation Ruihuang Li et.al. 2203.09744v1 link
2022-03-18 A Dual Weighting Label Assignment Scheme for Object Detection Shuai Li et.al. 2203.09730v1 link
2022-03-18 VISTA: Boosting 3D Object Detection via Dual Cross-VIew SpaTial Attention Shengheng Deng et.al. 2203.09704v1 link
2022-03-17 Regional Semantic Contrast and Aggregation for Weakly Supervised Semantic Segmentation Tianfei Zhou et.al. 2203.09653v1 link
2022-03-17 Cascade Transformers for End-to-End Person Search Rui Yu et.al. 2203.09642v1 link
2022-03-17 AutoSDF: Shape Priors for 3D Completion, Reconstruction and Generation Paritosh Mittal et.al. 2203.09516v1 null
2022-03-17 FERV39k: A Large-Scale Multi-Scene Dataset for Facial Expression Recognition in Videos Yan Wang et.al. 2203.09463v1 null
2022-03-17 Look Outside the Room: Synthesizing A Consistent Long-Term 3D Scene Video from A Single Image Xuanchi Ren et.al. 2203.09457v1 null
2022-03-17 Vox2Cortex: Fast Explicit Reconstruction of Cortical Surfaces from 3D MRI Scans with Geometric Deep Neural Networks Fabian Bongratz et.al. 2203.09446v2 null
2022-03-17 ZebraPose: Coarse to Fine Surface Encoding for 6DoF Object Pose Estimation Yongzhi Su et.al. 2203.09418v1 link
2022-03-17 Bi-directional Object-context Prioritization Learning for Saliency Ranking Xin Tian et.al. 2203.09416v1 link
2022-03-17 A Text Attention Network for Spatial Deformation Robust Scene Text Image Super-resolution Jianqi Ma et.al. 2203.09388v2 link
2022-03-17 Interacting Attention Graph for Single Image Two-Hand Reconstruction Mengcheng Li et.al. 2203.09364v2 null
2022-03-17 Object Localization under Single Coarse Point Supervision Xuehui Yu et.al. 2203.09338v1 link
2022-03-17 Modulated Contrast for Versatile Image Synthesis Fangneng Zhan et.al. 2203.09333v1 link
2022-03-17 Fine-tuning Global Model via Data-Free Knowledge Distillation for Non-IID Federated Learning Lin Zhang et.al. 2203.09249v1 null
2022-03-17 Neural Compression-Based Feature Learning for Video Restoration Cong Huang et.al. 2203.09208v2 null
2022-03-17 Details or Artifacts: A Locally Discriminative Learning Approach to Realistic Image Super-Resolution Jie Liang et.al. 2203.09195v1 link
2022-03-17 MuKEA: Multimodal Knowledge Extraction and Accumulation for Knowledge-based Visual Question Answering Yang Ding et.al. 2203.09138v1 link
2022-03-17 Global Convergence of MAML and Theory-Inspired Neural Architecture Search for Few-Shot Learning Haoxiang Wang et.al. 2203.09137v1 link
2022-03-17 Improving the Transferability of Targeted Adversarial Examples through Object-Based Diverse Input Junyoung Byun et.al. 2203.09123v1 link
2022-03-17 Attribute Surrogates Learning and Spectral Tokens Pooling in Transformers for Few-shot Learning Yangji He et.al. 2203.09064v1 link
2022-03-17 DATA: Domain-Aware and Task-Aware Pre-training Qing Chang et.al. 2203.09041v1 link
2022-03-16 Decoupled Knowledge Distillation Borui Zhao et.al. 2203.08679v1 null
2022-03-16 Deep vanishing point detection: Geometric priors make dataset variations vanish Yancong Lin et.al. 2203.08586v1 link
2022-03-16 EDTER: Edge Detection with Transformer Mengyang Pu et.al. 2203.08566v1 link
2022-03-16 MonoJSG: Joint Semantic and Geometric Cost Volume for Monocular 3D Object Detection Qing Lian et.al. 2203.08563v1 null
2022-03-16 Non-isotropy Regularization for Proxy-based Deep Metric Learning Karsten Roth et.al. 2203.08547v1 link
2022-03-16 Integrating Language Guidance into Vision-based Deep Metric Learning Karsten Roth et.al. 2203.08543v1 link
2022-03-16 Scribble-Supervised LiDAR Semantic Segmentation Ozan Unal et.al. 2203.08537v1 link
2022-03-16 Capturing Humans in Motion: Temporal-Attentive 3D Human Pose and Shape Estimation from Monocular Video Wen-Li Wei et.al. 2203.08534v1 null
2022-03-16 Physical Inertial Poser (PIP): Physics-aware Real-time Human Motion Tracking from Sparse Inertial Sensors Xinyu Yi et.al. 2203.08528v2 null
2022-03-16 Towards Practical Certifiable Patch Defense with Vision Transformer Zhaoyu Chen et.al. 2203.08519v1 null
2022-03-16 QS-Attn: Query-Selected Attention for Contrastive Learning in I2I Translation Xueqi Hu et.al. 2203.08483v1 link
2022-03-16 Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding Haojun Jiang et.al. 2203.08481v1 link
2022-03-16 The Devil Is in the Details: Window-based Attention for Image Compression Renjie Zou et.al. 2203.08450v1 link
2022-03-16 Attribute Group Editing for Reliable Few-shot Image Generation Guanqi Ding et.al. 2203.08422v1 link
2022-03-16 Privacy-preserving Online AutoML for Domain-Specific Face Detection Chenqian Yan et.al. 2203.08399v1 null
2022-03-16 Represent, Compare, and Learn: A Similarity-Aware Framework for Class-Agnostic Counting Min Shi et.al. 2203.08354v1 null
2022-03-15 DeepFusion: Lidar-Camera Deep Fusion for Multi-Modal 3D Object Detection Yingwei Li et.al. 2203.08195v1 null
2022-03-15 Can Neural Nets Learn the Same Model Twice? Investigating Reproducibility and Double Descent from the Decision Boundary Perspective Gowthami Somepalli et.al. 2203.08124v1 link
2022-03-15 Implicit Feature Decoupling with Depthwise Quantization Iordanis Fostiropoulos et.al. 2203.08080v1 link
2022-03-15 OcclusionFusion: Occlusion-aware Motion Estimation for Real-time Dynamic 3D Reconstruction Wenbin Lin et.al. 2203.07977v1 null
2022-03-15 Style Transformer for Image Inversion and Editing Xueqi Hu et.al. 2203.07932v1 link
2022-03-15 GPV-Pose: Category-level Object Pose Estimation via Geometry-guided Point-wise Voting Yan Di et.al. 2203.07918v1 link
2022-03-15 Interspace Pruning: Using Adaptive Filter Representations to Improve Training of Sparse CNNs Paul Wimmer et.al. 2203.07808v1 null
2022-03-15 Scalable Penalized Regression for Noise Detection in Learning with Noisy Labels Yikai Wang et.al. 2203.07788v1 null
2022-03-15 Exact Feature Distribution Matching for Arbitrary Style Transfer and Domain Generalization Yabin Zhang et.al. 2203.07740v1 link
2022-03-15 Distribution-Aware Single-Stage Models for Multi-Person 3D Pose Estimation Zitian Wang et.al. 2203.07697v2 null
2022-03-15 Learning What Not to Segment: A New Perspective on Few-Shot Segmentation Chunbo Lang et.al. 2203.07615v1 link
2022-03-14 Implicit Motion Handling for Video Camouflaged Object Detection Xuelian Cheng et.al. 2203.07363v2 null
2022-03-14 GCFSR: a Generative and Controllable Face Super Resolution Method Without Facial and GAN Priors Jingwen He et.al. 2203.07319v1 null
2022-03-14 RCL: Recurrent Continuous Localization for Temporal Action Detection Qiang Wang et.al. 2203.07112v1 null
2022-03-14 Active Learning by Feature Mixing Amin Parvaneh et.al. 2203.07034v1 link
2022-03-14 Rethinking Minimal Sufficient Representation in Contrastive Learning Haoqing Wang et.al. 2203.07004v1 link
2022-03-14 Blind2Unblind: Self-Supervised Image Denoising with Visible Blind Spots Zejin Wang et.al. 2203.06967v2 link
2022-03-14 UniVIP: A Unified Framework for Self-Supervised Visual Pre-training Zhaowen Li et.al. 2203.06965v1 null
2022-03-14 Forward Compatible Few-Shot Class-Incremental Learning Da-Wei Zhou et.al. 2203.06953v1 link
2022-03-14 XYLayoutLM: Towards Layout-Aware Multimodal Networks For Visually-Rich Document Understanding Zhangxuan Gu et.al. 2203.06947v2 null
2022-03-14 Accelerating DETR Convergence via Semantic-Aligned Matching Gongjie Zhang et.al. 2203.06883v1 link
2022-03-14 ADAS: A Direct Adaptation Strategy for Multi-Target Domain Adaptive Semantic Segmentation Seunghun Lee et.al. 2203.06811v1 null
2022-03-13 Scaling Up Your Kernels to 31x31: Revisiting Large Kernel Design in CNNs Xiaohan Ding et.al. 2203.06717v1 link
2022-03-13 LAS-AT: Adversarial Training with Learnable Attack Strategy Xiaojun Jia et.al. 2203.06616v1 link
2022-03-13 Depth-Aware Generative Adversarial Network for Talking Head Video Generation Fa-Ting Hong et.al. 2203.06605v2 null
2022-03-13 AutoGPart: Intermediate Supervision Search for Generalizable 3D Part Segmentation Xueyi Liu et.al. 2203.06558v1 null
2022-03-13 Sparse Local Patch Transformer for Robust Face Alignment and Landmarks Inherent Relation Learning Jiahao Xia et.al. 2203.06541v1 link
2022-03-12 Kernel Proposal Network for Arbitrary Shape Text Detection Shi-Xue Zhang et.al. 2203.06410v1 null
2022-03-12 SIGMA: Semantic-complete Graph Matching for Domain Adaptive Object Detection Wuyang Li et.al. 2203.06398v1 link
2022-03-12 Self-Sustaining Representation Expansion for Non-Exemplar Class-Incremental Learning Kai Zhu et.al. 2203.06359v1 null
2022-03-12 Wavelet Knowledge Distillation: Towards Efficient Image-to-Image Translation Linfeng Zhang et.al. 2203.06321v1 null
2022-03-12 MISF: Multi-level Interactive Siamese Filtering for High-Fidelity Image Inpainting Xiaoguang Li et.al. 2203.06304v1 link
2022-03-11 REX: Reasoning-aware and Grounded Explanation Shi Chen et.al. 2203.06107v1 link
2022-03-11 Enhancing Adversarial Training with Second-Order Statistics of Weights Gaojie Jin et.al. 2203.06020v1 link
2022-03-11 Hyperbolic Image Segmentation Mina GhadimiAtigh et.al. 2203.05898v1 link
2022-03-11 WiCV 2021: The Eighth Women In Computer Vision Workshop Arushi Goel et.al. 2203.05825v1 null
2022-03-11 FLAG: Flow-based 3D Avatar Generation from Sparse Observations Sadegh Aliakbarian et.al. 2203.05789v1 null
2022-03-11 Democracy Does Matter: Comprehensive Feature Mining for Co-Salient Object Detection Siyue Yu et.al. 2203.05787v1 null
2022-03-11 Learning Distinctive Margin toward Active Domain Adaptation Ming Xie et.al. 2203.05738v1 null
2022-03-10 Point Density-Aware Voxels for LiDAR 3D Object Detection Jordan S. K. Hu et.al. 2203.05662v1 link
2022-03-10 Conditional Prompt Learning for Vision-Language Models Kaiyang Zhou et.al. 2203.05557v1 link
2022-03-10 Representation Compensation Networks for Continual Semantic Segmentation Chang-Bin Zhang et.al. 2203.05402v1 link
2022-03-10 Spatial Commonsense Graph for Object Localisation in Partial Scenes Francesco Giuliari et.al. 2203.05380v1 link
2022-03-10 Domain Generalization via Shuffled Style Assembly for Face Anti-Spoofing Zhuo Wang et.al. 2203.05340v2 null
2022-03-10 Iterative Corresponding Geometry: Fusing Region and Depth for Highly Efficient 3D Tracking of Textureless Objects Manuel Stoiber et.al. 2203.05334v1 link
2022-03-10 GrainSpace: A Large-scale Dataset for Fine-grained and Domain-adaptive Recognition of Cereal Grains Lei Fan et.al. 2203.05306v1 null
2022-03-10 Contrastive Boundary Learning for Point Cloud Segmentation Liyao Tang et.al. 2203.05272v2 link
2022-03-10 Back to Reality: Weakly-supervised 3D Object Detection with Shape-guided Label Enhancement Xiuwei Xu et.al. 2203.05238v1 link
2022-03-10 Knowledge Distillation as Efficient Pre-training: Faster Convergence, Higher Data-efficiency, and Better Transferability Ruifei He et.al. 2203.05180v1 link
2022-03-10 Practical Evaluation of Adversarial Robustness via Adaptive Auto Attack Ye Liu et.al. 2203.05154v1 link
2022-03-10 Frequency-driven Imperceptible Adversarial Attack on Semantic Similarity Cheng Luo et.al. 2203.05151v1 null
2022-03-10 OpenTAL: Towards Open Set Temporal Action Localization Wentao Bao et.al. 2203.05114v1 link
2022-03-09 NLX-GPT: A Model for Natural Language Explanations in Vision and Vision-Language Tasks Fawaz Sammani et.al. 2203.05081v1 link
2022-03-09 Adaptive Trajectory Prediction via Transferable GNN Yi Xu et.al. 2203.05046v1 null
2022-03-09 Neural Data-Dependent Transform for Learned Image Compression Dezhao Wang et.al. 2203.04963v1 null
2022-03-09 What Matters For Meta-Learning Vision Regression Tasks? Ning Gao et.al. 2203.04905v1 null
2022-03-09 How many Observations are Enough? Knowledge Distillation for Trajectory Forecasting Alessio Monti et.al. 2203.04781v1 null
2022-03-09 SkinningNet: Two-Stream Graph Convolutional Neural Network for Skinning Prediction of Synthetic Characters Albert Mosella-Montoro et.al. 2203.04746v1 null
2022-03-09 FlexIT: Towards Flexible Semantic Image Translation Guillaume Couairon et.al. 2203.04705v1 null
2022-03-09 ChiTransformer:Towards Reliable Stereo from Cues Qing Su et.al. 2203.04554v1 null
2022-03-08 Dynamic Dual-Output Diffusion Models Yaniv Benny et.al. 2203.04304v1 null
2022-03-08 A Simple Multi-Modality Transfer Learning Baseline for Sign Language Translation Yutong Chen et.al. 2203.04287v1 null
2022-03-08 Probabilistic Warp Consistency for Weakly-Supervised Semantic Correspondences Prune Truong et.al. 2203.04279v1 link
2022-03-08 End-to-End Semi-Supervised Learning for Video Action Detection Akash Kumar et.al. 2203.04251v1 link
2022-03-08 Neural Face Identification in a 2D Wireframe Projection of a Manifold Object Kehan Wang et.al. 2203.04229v1 link
2022-03-08 Selective-Supervised Contrastive Learning with Noisy Labels Shikun Li et.al. 2203.04181v1 link
2022-03-08 Motron: Multimodal Probabilistic Human Motion Forecasting Tim Salzmann et.al. 2203.04132v1 null
2022-03-08 E2EC: An End-to-End Contour-based Method for High-Quality High-Speed Instance Segmentation Tao Zhang et.al. 2203.04074v1 link
2022-03-08 Shape-invariant 3D Adversarial Point Clouds Qidong Huang et.al. 2203.04041v1 link
2022-03-08 DeltaCNN: End-to-End CNN Inference of Sparse Frame Differences in Videos Mathias Parger et.al. 2203.03996v1 null
2022-03-08 Contrastive Conditional Neural Processes Zesheng Ye et.al. 2203.03978v1 null
2022-03-08 On Generalizing Beyond Domains in Cross-Domain Continual Learning Christian Simon et.al. 2203.03970v1 null
2022-03-08 Generative Cooperative Learning for Unsupervised Video Anomaly Detection Muhammad Zaigham Zaheer et.al. 2203.03962v1 null
2022-03-08 ART-Point: Improving Rotation Robustness of Point Cloud Classifiers via Adversarial Rotation Robin Wang et.al. 2203.03888v1 link
2022-03-08 Semi-Supervised Semantic Segmentation Using Unreliable Pseudo-Labels Yuchao Wang et.al. 2203.03884v1 null
2022-03-08 Weakly Supervised Semantic Segmentation using Out-of-Distribution Data Jungbeom Lee et.al. 2203.03860v1 link
2022-03-08 Deep Rectangling for Image Stitching: A Learning Baseline Lang Nie et.al. 2203.03831v1 link
2022-03-08 Shadows can be Dangerous: Stealthy and Effective Physical-world Adversarial Attack by Natural Phenomenon Yiqi Zhong et.al. 2203.03818v2 link
2022-03-08 Generating 3D Bio-Printable Patches Using Wound Segmentation and Reconstruction to Treat Diabetic Foot Ulcers Han Joo Chae et.al. 2203.03814v1 null
2022-03-08 Unknown-Aware Object Detection: Learning What You Don't Know from Videos in the Wild Xuefeng Du et.al. 2203.03800v1 link
2022-03-07 Kubric: A scalable dataset generator Klaus Greff et.al. 2203.03570v1 link
2022-03-07 Adversarial Texture for Fooling Person Detectors in the Physical World Zhanhao Hu et.al. 2203.03373v2 null
2022-03-07 Interpretable part-whole hierarchies and conceptual-semantic relationships in neural networks Nicola Garau et.al. 2203.03282v1 null
2022-03-07 MSDN: Mutually Semantic Distillation Network for Zero-Shot Learning Shiming Chen et.al. 2203.03137v1 link
2022-03-07 Protecting Facial Privacy: Generating Adversarial Identity Masks via Style-robust Makeup Transfer Shengshan Hu et.al. 2203.03121v1 null

About

CVPR2022 update everyday!

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages