GitHub - DWCTOD/arXiv-CVPR2022-daily: CVPR2022 update everyday!

Updated on 2022.04.12

CVPR2022

Publish Date	Title	Authors	PDF	Code
2022-04-11	Single-Photon Structured Light	Varun Sundar et.al.	2204.05300v1	null
2022-04-11	Focal Length and Object Pose Estimation via Render and Compare	Georgy Ponimatkin et.al.	2204.05145v1	link
2022-04-11	XMP-Font: Self-Supervised Cross-Modality Pre-training for Few-Shot Font Generation	Wei Liu et.al.	2204.05084v1	null
2022-04-11	Pyramid Grafting Network for One-Stage High Resolution Saliency Detection	Chenxi Xie et.al.	2204.05041v1	link
2022-04-11	Structure-Aware Motion Transfer with Deformable Anchor Model	Jiale Tao et.al.	2204.05018v1	link
2022-04-11	HiMODE: A Hybrid Monocular Omnidirectional Depth Estimation Model	Masum Shah Junayed et.al.	2204.05007v1	null
2022-04-11	Commonality in Natural Images Rescues GANs: Pretraining GANs with Generic and Privacy-free Synthetic Data	Kyungjune Baek et.al.	2204.04950v1	link
2022-04-11	When NAS Meets Trees: An Efficient Algorithm for Neural Architecture Search	Guocheng Qian et.al.	2204.04918v1	null
2022-04-11	Consistency Learning via Decoding Path Augmentation for Transformers in Human Object Interaction Detection	Jihwan Park et.al.	2204.04836v1	link
2022-04-10	SOS! Self-supervised Learning Over Sets Of Handled Objects In Egocentric Action Recognition	Victor Escorcia et.al.	2204.04796v1	null
2022-04-10	Beyond Cross-view Image Retrieval: Highly Accurate Vehicle Localization Using Satellite Image	Yujiao Shi et.al.	2204.04752v1	link
2022-04-10	Reasoning with Multi-Structure Commonsense Knowledge in Visual Dialog	Shunyu Zhang et.al.	2204.04680v1	null
2022-04-10	FedCorr: Multi-Stage Federated Learning for Label Noise Correction	Jingyi Xu et.al.	2204.04677v1	link
2022-04-10	NAN: Noise-Aware NeRFs for Burst-Denoising	Naama Pearl et.al.	2204.04668v1	null
2022-04-10	Video K-Net: A Simple, Strong, and Unified Baseline for Video Segmentation	Xiangtai Li et.al.	2204.04656v1	link
2022-04-10	Learning Pixel-Level Distinctions for Video Highlight Detection	Fanyue Wei et.al.	2204.04615v1	null
2022-04-10	Explaining Deep Convolutional Neural Networks via Latent Visual-Semantic Filter Attention	Yu Yang et.al.	2204.04601v1	link
2022-04-10	Robust Cross-Modal Representation Learning with Progressive Self-Distillation	Alex Andonian et.al.	2204.04588v1	null
2022-04-09	Joint Distribution Matters: Deep Brownian Distance Covariance for Few-Shot Classification	Jiangtao Xie et.al.	2204.04567v1	null
2022-04-09	Multimodal Transformer for Nursing Activity Recognition	Momal Ijaz et.al.	2204.04564v1	null
2022-04-09	DeepLIIF: An Online Platform for Quantification of Clinical Pathology Slides	Parmida Ghahremani et.al.	2204.04494v1	link
2022-04-09	ManiTrans: Entity-Level Text-Guided Image Manipulation via Token-wise Semantic Alignment and Generation	Jianan Wang et.al.	2204.04428v1	null
2022-04-09	Adaptive Differential Filters for Fast and Communication-Efficient Federated Learning	Daniel Becking et.al.	2204.04424v1	null
2022-04-09	The Two Dimensions of Worst-case Training and the Integrated Effect for Out-of-domain Generalization	Zeyi Huang et.al.	2204.04384v1	link
2022-04-08	Dancing under the stars: video denoising in starlight	Kristina Monakhova et.al.	2204.04210v1	null
2022-04-08	General Incremental Learning with Domain-aware Categorical Representations	Jiangwei Xie et.al.	2204.04078v1	null
2022-04-08	Identifying Ambiguous Similarity Conditions via Semantic Matching	Han-Jia Ye et.al.	2204.04053v1	null
2022-04-08	Probabilistic Representations for Video Contrastive Learning	Jungin Park et.al.	2204.03946v1	null
2022-04-08	Does Robustness on ImageNet Transfer to Downstream Tasks?	Yutaro Yamada et.al.	2204.03934v1	null
2022-04-08	Deep Hyperspectral-Depth Reconstruction Using Single Color-Dot Projection	Chunyu Li et.al.	2204.03929v1	null
2022-04-08	CD$^2$-pFed: Cyclic Distillation-guided Channel Decoupling for Model Personalization in Federated Learning	Yiqing Shen et.al.	2204.03880v1	null
2022-04-08	Reusing the Task-specific Classifier as a Discriminator: Discriminator-free Adversarial Domain Adaptation	Lin Chen et.al.	2204.03838v1	link
2022-04-07	TorMentor: Deterministic dynamic-path, data augmentations with fractals	Anguelos Nicolaou et.al.	2204.03776v1	null
2022-04-07	Gravitationally Lensed Black Hole Emission Tomography	Aviad Levis et.al.	2204.03715v1	null
2022-04-07	TemporalUV: Capturing Loose Clothing with Temporally Coherent UV Coordinates	You Xie et.al.	2204.03671v1	null
2022-04-07	Total Variation Optimization Layers for Computer Vision	Raymond A. Yeh et.al.	2204.03643v1	link
2022-04-07	Pre-train, Self-train, Distill: A simple recipe for Supersizing 3D Reconstruction	Kalyan Vasudev Alwala et.al.	2204.03642v1	null
2022-04-07	Unsupervised Image-to-Image Translation with Generative Prior	Shuai Yang et.al.	2204.03641v1	link
2022-04-07	Class-Incremental Learning with Strong Pre-trained Models	Tz-Ying Wu et.al.	2204.03634v1	null
2022-04-07	Unified Contrastive Learning in Image-Text-Label Space	Jianwei Yang et.al.	2204.03610v1	link
2022-04-07	Pin the Memory: Learning to Generalize Semantic Segmentation	Jin Kim et.al.	2204.03609v1	null
2022-04-07	AutoRF: Learning 3D Object Radiance Fields from Single View Observations	Norman Müller et.al.	2204.03593v1	null
2022-04-07	Many-to-many Splatting for Efficient Video Frame Interpolation	Ping Hu et.al.	2204.03513v1	link
2022-04-07	Deep Visual Geo-localization Benchmark	Gabriele Berton et.al.	2204.03444v1	link
2022-04-07	PSTR: End-to-End One-Step Person Search With Transformers	Jiale Cao et.al.	2204.03340v1	link
2022-04-07	Coarse-to-Fine Feature Mining for Video Semantic Segmentation	Guolei Sun et.al.	2204.03330v1	link
2022-04-07	L2G: A Simple Local-to-Global Knowledge Transfer Framework for Weakly Supervised Semantic Segmentation	Peng-Tao Jiang et.al.	2204.03206v1	null
2022-04-07	Winoground: Probing Vision and Language Models for Visio-Linguistic Compositionality	Tristan Thrush et.al.	2204.03162v1	null
2022-04-06	AUV-Net: Learning Aligned UV Maps for Texture Transfer and Synthesis	Zhiqin Chen et.al.	2204.03105v1	null
2022-04-06	Hierarchical Self-supervised Representation Learning for Movie Understanding	Fanyi Xiao et.al.	2204.03101v1	null
2022-04-06	Learning from Untrimmed Videos: Self-Supervised Video Representation Learning with Hierarchical Consistency	Zhiwu Qing et.al.	2204.03017v1	null
2022-04-06	Multi-Scale Memory-Based Video Deblurring	Bo Ji et.al.	2204.02977v1	link
2022-04-06	Temporal Alignment Networks for Long-term Video	Tengda Han et.al.	2204.02968v1	null
2022-04-06	"The Pedestrian next to the Lamppost" Adaptive Object Graphs for Better Instantaneous Mapping	Avishkar Saha et.al.	2204.02944v1	null
2022-04-06	An Empirical Study of End-to-End Temporal Action Detection	Xiaolong Liu et.al.	2204.02932v1	link
2022-04-06	Masking Adversarial Damage: Finding Adversarial Saliency for Robust and Sparse Network	Byung-Kwan Lee et.al.	2204.02738v1	null
2022-04-06	Aesthetic Text Logo Synthesis via Content-aware Layout Inferring	Yizhi Wang et.al.	2204.02701v1	link
2022-04-06	Towards An End-to-End Framework for Flow-Guided Video Inpainting	Zhen Li et.al.	2204.02663v2	link
2022-04-06	Towards Robust Adaptive Object Detection under Noisy Annotations	Xinyu Liu et.al.	2204.02620v1	link
2022-04-06	Cloning Outfits from Real-World Images to 3D Characters for Generalizable Person Re-Identification	Yanan Wang et.al.	2204.02611v2	link
2022-04-06	Learning to Anticipate Future with Dynamic Context Removal	Xinyu Xu et.al.	2204.02587v1	null
2022-04-06	SqueezeNeRF: Further factorized FastNeRF for memory-efficient inference	Krishna Wadhwani et.al.	2204.02585v2	null
2022-04-06	FocalClick: Towards Practical Interactive Image Segmentation	Xi Chen et.al.	2204.02574v1	link
2022-04-06	Gait Recognition in the Wild with Dense 3D Representations and A Benchmark	Jinkai Zheng et.al.	2204.02569v1	link
2022-04-06	MixFormer: Mixing Features across Windows and Dimensions	Qiang Chen et.al.	2204.02557v1	link
2022-04-06	RODD: A Self-Supervised Approach for Robust Out-of-Distribution Detection	Umar Khalid et.al.	2204.02553v1	link
2022-04-06	Modeling Motion with Multi-Modal Features for Text-Based Video Segmentation	Wangbo Zhao et.al.	2204.02547v1	link
2022-04-05	Adversarial Robustness through the Lens of Convolutional Filters	Paul Gavrikov et.al.	2204.02481v1	link
2022-04-05	Learning Optimal K-space Acquisition and Reconstruction using Physics-Informed Neural Networks	Wei Peng et.al.	2204.02480v1	null
2022-04-05	ObjectFolder 2.0: A Multisensory Object Dataset for Sim2Real Transfer	Ruohan Gao et.al.	2204.02389v1	link
2022-04-05	Neural Convolutional Surfaces	Luca Morreale et.al.	2204.02289v1	null
2022-04-05	Rethinking Visual Geo-localization for Large-Scale Applications	Gabriele Berton et.al.	2204.02287v1	link
2022-04-05	Arbitrary-Scale Image Synthesis	Evangelos Ntavelis et.al.	2204.02273v1	link
2022-04-05	IRON: Inverse Rendering by Optimizing Neural SDFs and Materials from Photometric Images	Kai Zhang et.al.	2204.02232v1	null
2022-04-05	SNUG: Self-Supervised Neural Dynamic Garments	Igor Santesteban et.al.	2204.02219v1	link
2022-04-05	Multi-View Transformer for 3D Visual Grounding	Shijia Huang et.al.	2204.02174v1	link
2022-04-05	Leveraging Equivariant Features for Absolute Pose Regression	Mohamed Adel Musallam et.al.	2204.02163v1	null
2022-04-05	Dual-AI: Dual-path Actor Interaction Learning for Group Activity Recognition	Mingfei Han et.al.	2204.02148v2	null
2022-04-05	Detector-Free Weakly Supervised Group Activity Recognition	Dongkeun Kim et.al.	2204.02139v1	null
2022-04-05	Overcoming Catastrophic Forgetting in Incremental Object Detection via Elastic Response Distillation	Tao Feng et.al.	2204.02136v1	link
2022-04-05	P3Depth: Monocular Depth Estimation with a Piecewise Planarity Prior	Vaishakh Patil et.al.	2204.02091v1	link
2022-04-05	Text Spotting Transformers	Xiang Zhang et.al.	2204.01918v1	link
2022-04-04	Revisiting Near/Remote Sensing with Geospatial Attention	Scott Workman et.al.	2204.01807v1	null
2022-04-04	Joint Hand Motion and Interaction Hotspots Prediction from Egocentric Videos	Shaowei Liu et.al.	2204.01696v1	null
2022-04-04	LISA: Learning Implicit Shape and Appearance of Hands	Enric Corona et.al.	2204.01695v1	null
2022-04-04	Exemplar-bsaed Pattern Synthesis with Implicit Periodic Field Network	Haiwei Chen et.al.	2204.01671v1	null
2022-04-04	FIFO: Learning Fog-invariant Features for Foggy Scene Segmentation	Sohyun Lee et.al.	2204.01587v1	null
2022-04-04	Unsupervised Learning of Accurate Siamese Tracking	Qiuhong Shen et.al.	2204.01475v1	link
2022-04-04	Correlation Verification for Image Retrieval	Seongwon Lee et.al.	2204.01458v1	link
2022-04-04	WildNet: Learning Domain Generalized Semantic Segmentation from the Wild	Suhyeon Lee et.al.	2204.01446v1	link
2022-04-04	Degradation-agnostic Correspondence from Resolution-asymmetric Stereo	Xihao Chen et.al.	2204.01429v1	null
2022-04-04	RayMVSNet: Learning Ray-based 1D Implicit Fields for Accurate Multi-View Stereo	Junhua Xi et.al.	2204.01320v1	null
2022-04-03	Exploiting Temporal Relations on Radar Perception for Autonomous Driving	Peizhao Li et.al.	2204.01184v1	null
2022-04-03	BNV-Fusion: Dense 3D Reconstruction using Bi-level Neural Volume Fusion	Kejie Li et.al.	2204.01139v1	null
2022-04-03	ES6D: A Computation Efficient and Symmetry-Aware 6D Pose Regression Framework	Ningkai Mo et.al.	2204.01080v1	null
2022-04-03	Style-Based Global Appearance Flow for Virtual Try-On	Sen He et.al.	2204.01046v1	link
2022-04-03	STCrowd: A Multimodal Dataset for Pedestrian Perception in Crowded Scenes	Peishan Cong et.al.	2204.01026v1	link
2022-04-03	TransRAC: Encoding Multi-scale Temporal Correlation with Transformers for Repetitive Action Counting	Huazhang Hu et.al.	2204.01018v1	link
2022-04-03	Neural Global Shutter: Learn to Restore Video from a Rolling Shutter Camera with Global Reset Feature	Zhixiang Wang et.al.	2204.00974v1	link
2022-04-03	DST: Dynamic Substitute Training for Data-free Black-box Attack	Wenxuan Wang et.al.	2204.00972v1	null
2022-04-03	AdaFace: Quality Adaptive Margin for Face Recognition	Minchul Kim et.al.	2204.00964v1	link
2022-04-02	Matching Feature Sets for Few-Shot Image Classification	Arman Afrasiyabi et.al.	2204.00949v1	null
2022-04-02	Progressive Minimal Path Method with Embedded CNN	Wei Liao et.al.	2204.00944v1	null
2022-04-02	Class-Incremental Learning by Knowledge Distillation with Adaptive Feature Consolidation	Minsoo Kang et.al.	2204.00895v1	link
2022-04-02	Online Convolutional Re-parameterization	Mu Hu et.al.	2204.00826v1	null
2022-04-02	Semantic-Aware Domain Generalized Segmentation	Duo Peng et.al.	2204.00822v1	link
2022-04-02	R(Det)^2: Randomized Decision Routing for Object Detection	Ya-Li Li et.al.	2204.00794v1	null
2022-04-02	Homography Loss for Monocular 3D Object Detection	Jiaqi Gu et.al.	2204.00754v1	link
2022-04-02	What to look at and where: Semantic and Spatial Refined Transformer for detecting human-object interactions	A S M Iftekhar et.al.	2204.00746v1	null
2022-04-01	Consistency driven Sequential Transformers Attention Model for Partially Observable Scenes	Samrudhdhi B. Rangrej et.al.	2204.00656v1	null
2022-04-01	Robust Neonatal Face Detection in Real-world Clinical Settings	Jacqueline Hausmann et.al.	2204.00655v1	null
2022-04-01	SIMBAR: Single Image-Based Scene Relighting For Effective Data Augmentation For Automated Driving Vision Tasks	Xianling Zhang et.al.	2204.00644v1	null
2022-04-01	On the Importance of Asymmetry for Siamese Representation Learning	Xiao Wang et.al.	2204.00613v1	link
2022-04-01	Proper Reuse of Image Classification Features Improves Object Detection	Cristina Vasconcelos et.al.	2204.00484v1	null
2022-04-01	Marginal Contrastive Correspondence for Guided Image Generation	Fangneng Zhan et.al.	2204.00442v1	null
2022-04-01	Learning to Deblur using Light Field Generated and Real Defocus Images	Lingyan Ruan et.al.	2204.00367v1	link
2022-04-01	DIP: Deep Inverse Patchmatch for High-Resolution Optical Flow	Zihua Zheng et.al.	2204.00330v1	link
2022-04-01	CAT-Det: Contrastively Augmented Transformer for Multi-modal 3D Object Detection	Yanan Zhang et.al.	2204.00325v1	null
2022-04-01	Unimodal-Concentrated Loss: Fully Adaptive Label Distribution Learning for Ordinal Regression	Qiang Li et.al.	2204.00309v1	null
2022-04-01	Perception Prioritized Training of Diffusion Models	Jooyoung Choi et.al.	2204.00227v1	link
2022-04-01	Bridging the Gap between Classification and Localization for Weakly Supervised Object Localization	Eunji Kim et.al.	2204.00220v1	null
2022-04-01	GraftNet: Towards Domain Generalized Stereo Matching with a Broad-Spectrum and Task-Oriented Feature	Biyang Liu et.al.	2204.00179v1	link
2022-04-01	LASER: LAtent SpacE Rendering for 2D Visual Localization	Zhixiang Min et.al.	2204.00157v1	null
2022-03-31	TransGeo: Transformer Is All You Need for Cross-view Image Geo-localization	Sijie Zhu et.al.	2204.00097v1	link
2022-03-31	Efficient Maximal Coding Rate Reduction by Variational Forms	Christina Baek et.al.	2204.00077v1	null
2022-03-31	Improving Adversarial Transferability via Neuron Attribution-Based Attacks	Jianping Zhang et.al.	2204.00008v1	link
2022-03-31	Bringing Old Films Back to Life	Ziyu Wan et.al.	2203.17276v1	link
2022-03-31	TransEditor: Transformer-Based Dual-Space GAN for Highly Controllable Facial Editing	Yanbo Xu et.al.	2203.17266v1	link
2022-03-31	Generating High Fidelity Data from Low-density Regions using Diffusion Models	Vikash Sehwag et.al.	2203.17260v1	null
2022-03-31	Continuous Scene Representations for Embodied AI	Samir Yitzhak Gadre et.al.	2203.17251v1	null
2022-03-31	Templates for 3D Object Pose Estimation Revisited: Generalization to New Objects and Robustness to Occlusions	Van Nguyen Nguyen et.al.	2203.17234v1	link
2022-03-31	SimVQA: Exploring Simulated Environments for Visual Question Answering	Paola Cascante-Bonilla et.al.	2203.17219v1	null
2022-03-31	Leverage Your Local and Global Representations: A New Self-Supervised Learning Strategy	Tong Zhang et.al.	2203.17205v1	null
2022-03-31	Time Lens++: Event-based Frame Interpolation with Parametric Non-linear Flow and Multi-scale Fusion	Stepan Tulyakov et.al.	2203.17191v1	null
2022-03-31	AEGNN: Asynchronous Event-based Graph Neural Networks	Simon Schaefer et.al.	2203.17149v1	null
2022-03-31	It's All In the Teacher: Zero-Shot Quantization Brought Closer to the Teacher	Kanghyun Choi et.al.	2203.17008v2	link
2022-03-31	Towards Robust Rain Removal Against Adversarial Attacks: A Comprehensive Benchmark Analysis and Beyond	Yi Yu et.al.	2203.16931v1	link
2022-03-31	End-to-End Trajectory Distribution Prediction Based on Occupancy Grid Maps	Ke Guo et.al.	2203.16910v1	link
2022-03-31	Multi-Granularity Alignment Domain Adaptation for Object Detection	Wenzhang Zhou et.al.	2203.16897v1	null
2022-03-31	CRAFT: Cross-Attentional Flow Transformer for Robust Optical Flow	Xiuchao Sui et.al.	2203.16896v1	link
2022-03-31	Deformation and Correspondence Aware Unsupervised Synthetic-to-Real Scene Flow Estimation for Point Clouds	Zhao Jin et.al.	2203.16895v1	link
2022-03-31	Towards Driving-Oriented Metric for Lane Detection Models	Takami Sato et.al.	2203.16851v1	link
2022-03-31	Fine-grained Temporal Contrastive Learning for Weakly-supervised Temporal Action Localization	Junyu Gao et.al.	2203.16800v1	link
2022-03-31	Deformable Video Transformer	Jue Wang et.al.	2203.16795v1	null
2022-03-31	Reflection and Rotation Symmetry Detection via Equivariant Learning	Ahyun Seo et.al.	2203.16787v1	null
2022-03-31	ViSTA: Vision and Scene Text Aggregation for Cross-Modal Retrieval	Mengjun Cheng et.al.	2203.16778v1	null
2022-03-31	ReSTR: Convolution-free Referring Image Segmentation Using Transformers	Namyup Kim et.al.	2203.16768v1	null
2022-03-31	MeMOT: Multi-Object Tracking with Memory	Jiarui Cai et.al.	2203.16761v1	null
2022-03-31	Stochastic Backpropagation: A Memory Efficient Strategy for Training Video Models	Feng Cheng et.al.	2203.16755v1	null
2022-03-31	Personalized Image Aesthetics Assessment with Rich Attributes	Yuzhe Yang et.al.	2203.16754v1	null
2022-03-31	Exploiting Explainable Metrics for Augmented SGD	Mahdi S. Hosseini et.al.	2203.16723v1	link
2022-03-30	Task Adaptive Parameter Sharing for Multi-Task Learning	Matthew Wallingford et.al.	2203.16708v1	null
2022-03-30	Face Relighting with Geometrically Consistent Shadows	Andrew Hou et.al.	2203.16681v1	link
2022-03-30	Escaping Data Scarcity for High-Resolution Heterogeneous Face Hallucination	Yiqun Mei et.al.	2203.16669v1	null
2022-03-30	Learning Local Displacements for Point Cloud Completion	Yida Wang et.al.	2203.16600v1	null
2022-03-30	Constrained Few-shot Class-incremental Learning	Michael Hersche et.al.	2203.16588v1	link
2022-03-30	Large-Scale Pre-training for Person Re-identification with Noisy Labels	Dengpan Fu et.al.	2203.16533v1	link
2022-03-30	Understanding 3D Object Articulation in Internet Videos	Shengyi Qian et.al.	2203.16531v1	null
2022-03-30	CaDeX: Learning Canonical Deformation Coordinate Space for Dynamic Surface Representation via Neural Homeomorphism	Jiahui Lei et.al.	2203.16529v1	null
2022-03-30	Collaborative Transformers for Grounded Situation Recognition	Junhyeong Cho et.al.	2203.16518v1	link
2022-03-30	Unseen Classes at a Later Time? No Problem	Hari Chandana Kuchibhotla et.al.	2203.16517v1	link
2022-03-30	Fast Light-Weight Near-Field Photometric Stereo	Daniel Lichy et.al.	2203.16515v1	null
2022-03-30	AdaMixer: A Fast-Converging Query-Based Object Detector	Ziteng Gao et.al.	2203.16507v1	link
2022-03-30	Fast, Accurate and Memory-Efficient Partial Permutation Synchronization	Shaohan Li et.al.	2203.16505v1	null
2022-03-30	TubeDETR: Spatio-Temporal Video Grounding with Transformers	Antoine Yang et.al.	2203.16434v1	link
2022-03-30	Balanced MSE for Imbalanced Visual Regression	Jiawei Ren et.al.	2203.16427v1	link
2022-03-30	Practical Learned Lossless JPEG Recompression with Multi-Level Cross-Channel Entropy Model in the DCT Domain	Lina Guo et.al.	2203.16357v1	null
2022-03-30	Multi-Robot Active Mapping via Neural Bipartite Graph Matching	Kai Ye et.al.	2203.16319v1	null
2022-03-30	Forecasting from LiDAR via Future Object Detection	Neehar Peri et.al.	2203.16297v1	null
2022-03-30	Image-to-Lidar Self-Supervised Distillation for Autonomous Driving Data	Corentin Sautier et.al.	2203.16258v1	link
2022-03-30	InstaFormer: Instance-Aware Image-to-Image Translation with Transformer	Soohyun Kim et.al.	2203.16248v1	null
2022-03-30	Target-aware Dual Adversarial Learning and a Multi-scenario Multi-Modality Benchmark to Fuse Infrared and Visible for Object Detection	Jinyuan Liu et.al.	2203.16220v1	link
2022-03-30	Learning of Global Objective for Network Flow in Multi-Object Tracking	Shuai Li et.al.	2203.16210v1	null
2022-03-30	Fair Contrastive Learning for Facial Attribute Classification	Sungho Park et.al.	2203.16209v1	link
2022-03-30	Spatial-Temporal Parallel Transformer for Arm-Hand Dynamic Estimation	Shuying Liu et.al.	2203.16202v1	null
2022-03-30	On the Road to Online Adaptation for Semantic Image Segmentation	Riccardo Volpi et.al.	2203.16195v1	link
2022-03-30	FLOAT: Factorized Learning of Object Attributes for Improved Multi-object Multi-part Scene Parsing	Rishubh Singh et.al.	2203.16168v1	null
2022-03-30	Global Tracking via Ensemble of Local Trackers	Zikun Zhou et.al.	2203.16092v1	link
2022-03-30	Omni-DETR: Omni-Supervised Object Detection with Transformers	Pei Wang et.al.	2203.16089v1	null
2022-03-30	STRPM: A Spatiotemporal Residual Predictive Model for High-Resolution Video Prediction	Zheng Chang et.al.	2203.16084v1	null
2022-03-30	Learning Program Representations for Food Images and Cooking Recipes	Dim P. Papadopoulos et.al.	2203.16071v1	null
2022-03-30	AxIoU: An Axiomatically Justified Measure for Video Moment Retrieval	Riku Togashi et.al.	2203.16062v1	null
2022-03-30	Progressively Generating Better Initial Guesses Towards Next Stages for High-Quality Human Motion Prediction	Tiezheng Ma et.al.	2203.16051v1	link
2022-03-30	Threshold Matters in WSSS: Manipulating the Activation for the Robust and Accurate Segmentation Model Against Thresholds	Minhyun Lee et.al.	2203.16045v1	link
2022-03-30	Semi-Supervised Learning of Semantic Correspondence with Pseudo-Labels	Jiwon Kim et.al.	2203.16038v1	null
2022-03-30	Iterative Deep Homography Estimation	Si-Yuan Cao et.al.	2203.15982v1	link
2022-03-29	StyleT2I: Toward Compositional and High-Fidelity Text-to-Image Synthesis	Zhiheng Li et.al.	2203.15799v1	link
2022-03-29	CHEX: CHannel EXploration for CNN Model Compression	Zejiang Hou et.al.	2203.15794v1	null
2022-03-29	FisherMatch: Semi-Supervised Rotation Regression via Entropy-based Filtering	Yingda Yin et.al.	2203.15765v1	null
2022-03-29	Integrative Few-Shot Learning for Classification and Segmentation	Dahyun Kang et.al.	2203.15712v1	null
2022-03-29	OakInk: A Large-scale Knowledge Repository for Understanding Hand-Object Interaction	Lixin Yang et.al.	2203.15709v1	link
2022-03-29	EnvEdit: Environment Editing for Vision-and-Language Navigation	Jialu Li et.al.	2203.15685v1	link
2022-03-29	Exploring Frequency Adversarial Attacks for Face Forgery Detection	Shuai Jia et.al.	2203.15674v1	null
2022-03-29	PoseTriplet: Co-evolving 3D Human Pose Estimation, Imitation, and Hallucination under Self-supervision	Kehong Gong et.al.	2203.15625v1	null
2022-03-29	Learning a Structured Latent Space for Unsupervised Point Cloud Completion	Yingjie Cai et.al.	2203.15580v1	null
2022-03-29	BARC: Learning to Regress 3D Dog Shape from Images by Exploiting Breed Information	Nadine Rueegg et.al.	2203.15536v1	null
2022-03-29	OSOP: A Multi-Stage One Shot Object Pose Estimation Framework	Ivan Shugurov et.al.	2203.15533v1	null
2022-03-29	Learning Structured Gaussians to Approximate Deep Ensembles	Ivor J. A. Simpson et.al.	2203.15485v1	null
2022-03-29	Clean Implicit 3D Structure from Noisy 2D STEM Images	Hannah Kniesel et.al.	2203.15434v1	link
2022-03-29	Long-term Video Frame Interpolation via Feature Propagation	Dawit Mureja Argaw et.al.	2203.15427v1	null
2022-03-29	Quantifying Societal Bias Amplification in Image Captioning	Yusuke Hirota et.al.	2203.15395v1	null
2022-03-29	Alignment-Uniformity aware Representation Learning for Zero-shot Video Classification	Shi Pu et.al.	2203.15381v1	link
2022-03-29	A Style-aware Discriminator for Controllable Image Translation	Kunhee Kim et.al.	2203.15375v1	null
2022-03-29	Self-Supervised Image Representation Learning with Geometric Set Consistency	Nenglun Chen et.al.	2203.15361v1	null
2022-03-29	Nested Collaborative Learning for Long-Tailed Visual Recognition	Jun Li et.al.	2203.15359v1	link
2022-03-29	Online Continual Learning on a Contaminated Data Stream with Blurry Task Boundaries	Jihwan Bang et.al.	2203.15355v1	link
2022-03-29	SIOD: Single Instance Annotated Per Category Per Image for Object Detection	Hanjun Li et.al.	2203.15353v1	link
2022-03-29	Task-specific Inconsistency Alignment for Domain Adaptive Object Detection	Liang Zhao et.al.	2203.15345v1	null
2022-03-29	Balanced Multimodal Learning via On-the-fly Gradient Modulation	Xiaokang Peng et.al.	2203.15332v1	null
2022-03-29	CNN Filter DB: An Empirical Investigation of Trained Convolutional Filters	Paul Gavrikov et.al.	2203.15331v1	link
2022-03-29	Dressing in the Wild by Watching Dance Videos	Xin Dong et.al.	2203.15320v1	null
2022-03-29	Eigenlanes: Data-Driven Lane Descriptors for Structurally Diverse Lanes	Dongkwon Jin et.al.	2203.15302v1	null
2022-03-29	Uncertainty-Aware Adaptation for Self-Supervised 3D Human Pose Estimation	Jogendra Nath Kundu et.al.	2203.15293v1	null
2022-03-29	MAT: Mask-Aware Transformer for Large Hole Image Inpainting	Wenbo Li et.al.	2203.15270v1	link
2022-03-29	Eigencontours: Novel Contour Descriptors Based on Low-Rank Approximation	Wonhui Park et.al.	2203.15259v1	null
2022-03-29	Pop-Out Motion: 3D-Aware Image Deformation via Learning the Shape Laplacian	Jihyun Lee et.al.	2203.15235v1	null
2022-03-28	Frame-wise Action Representations for Long Videos via Sequence Contrastive Learning	Minghao Chen et.al.	2203.14957v1	link
2022-03-28	GIRAFFE HD: A High-Resolution 3D-aware Generative Model	Yang Xue et.al.	2203.14954v1	null
2022-03-28	Energy-based Latent Aligner for Incremental Learning	K J Joseph et.al.	2203.14952v1	link
2022-03-28	Controllable Dynamic Multi-Task Architectures	Dripta S. Raychaudhuri et.al.	2203.14949v1	null
2022-03-28	Learning to Prompt for Open-Vocabulary Object Detection with Vision-Language Model	Yu Du et.al.	2203.14940v1	link
2022-03-28	Attributable Visual Similarity Learning	Borui Zhang et.al.	2203.14932v1	link
2022-03-28	Expanding Low-Density Latent Regions for Open-Set Object Detection	Jiaming Han et.al.	2203.14911v1	link
2022-03-28	Optimizing Elimination Templates by Greedy Parameter Search	Evgeniy Martyushev et.al.	2203.14901v1	null
2022-03-28	Learning Where to Learn in Cross-View Self-Supervised Learning	Lang Huang et.al.	2203.14898v1	null
2022-03-28	Doodle It Yourself: Class Incremental Learning by Drawing a Few Sketches	Ayan Kumar Bhunia et.al.	2203.14843v1	null
2022-03-28	Sketching without Worrying: Noise-Tolerant Sketch-Based Image Retrieval	Ayan Kumar Bhunia et.al.	2203.14817v1	link
2022-03-28	Partially Does It: Towards Scene-Level FG-SBIR with Partial Input	Pinaki Nath Chowdhury et.al.	2203.14804v1	null
2022-03-28	Assembly101: A Large-Scale Multi-View Video Dataset for Understanding Procedural Activities	Fadime Sener et.al.	2203.14712v1	link
2022-03-28	MSTR: Multi-Scale Transformer for End-to-End Human-Object Interaction Detection	Bumsoo Kim et.al.	2203.14709v1	null
2022-03-28	Sketch3T: Test-Time Training for Zero-Shot SBIR	Aneeshan Sain et.al.	2203.14691v1	null
2022-03-28	Brain-inspired Multilayer Perceptron with Spiking Neurons	Wenshuo Li et.al.	2203.14679v1	null
2022-03-28	Part-based Pseudo Label Refinement for Unsupervised Person Re-identification	Yoonki Cho et.al.	2203.14675v1	null
2022-03-28	Diverse Plausible 360-Degree Image Outpainting for Efficient 3DCG Background Creation	Naofumi Akimoto et.al.	2203.14668v1	null
2022-03-28	FS6D: Few-Shot 6D Pose Estimation of Novel Objects	Yisheng He et.al.	2203.14628v1	link
2022-03-28	Towards Implicit Text-Guided 3D Shape Generation	Zhengzhe Liu et.al.	2203.14622v1	link
2022-03-28	Demystifying the Neural Tangent Kernel from a Practical Perspective: Can it be trusted for Neural Architecture Search without training?	Jisoo Mok et.al.	2203.14577v1	link
2022-03-28	HandOccNet: Occlusion-Robust 3D Hand Mesh Estimation Network	JoonKyu Park et.al.	2203.14564v1	null
2022-03-28	Reference-based Video Super-Resolution Using Multi-Camera Video Triplets	Junyong Lee et.al.	2203.14537v1	link
2022-03-28	Uni6D: A Unified CNN Framework without Projection Breakdown for 6D Pose Estimation	Xiaoke Jiang et.al.	2203.14531v1	null
2022-03-28	REGTR: End-to-end Point Cloud Correspondences with Transformers	Zi Jian Yew et.al.	2203.14517v1	link
2022-03-28	ImFace: A Nonlinear 3D Morphable Face Model with Implicit Neural Representations	Mingwu Zheng et.al.	2203.14510v1	null
2022-03-28	Automated Progressive Learning for Efficient Training of Vision Transformers	Changlin Li et.al.	2203.14509v1	link
2022-03-28	Stratified Transformer for 3D Point Cloud Segmentation	Xin Lai et.al.	2203.14508v1	link
2022-03-28	Catching Both Gray and Black Swans: Open-set Supervised Anomaly Detection	Choubo Ding et.al.	2203.14506v1	link
2022-03-28	NOC-REK: Novel Object Captioning with Retrieved Vocabulary from External Knowledge	Duc Minh Vo et.al.	2203.14499v1	null
2022-03-25	Versatile Multi-Modal Pre-Training for Human-Centric Perception	Fangzhou Hong et.al.	2203.13815v1	null
2022-03-25	Stochastic Trajectory Prediction via Motion Indeterminacy Diffusion	Tianpei Gu et.al.	2203.13777v1	link
2022-03-25	Searching for Network Width with Bilaterally Coupled Network	Xiu Su et.al.	2203.13714v1	link
2022-03-25	Unsupervised Pre-training for Temporal Action Localization Tasks	Can Zhang et.al.	2203.13609v1	null
2022-03-25	Rope3D: TheRoadside Perception Dataset for Autonomous Driving and Monocular 3D Object Detection Task	Xiaoqing Ye et.al.	2203.13608v1	null
2022-03-25	Continual Test-Time Domain Adaptation	Qin Wang et.al.	2203.13591v1	link
2022-03-25	Contrastive learning of Class-agnostic Activation Map for Weakly Supervised Object Localization and Semantic Segmentation	Jinheng Xie et.al.	2203.13505v1	null
2022-03-25	Non-Probability Sampling Network for Stochastic Human Trajectory Prediction	Inhwan Bae et.al.	2203.13471v1	null
2022-03-25	CAD: Co-Adapting Discriminative Features for Improved Few-Shot Classification	Philip Chikontwe et.al.	2203.13465v1	null
2022-03-25	MDAN: Multi-level Dependent Attention Network for Visual Emotion Analysis	Liwen Xu et.al.	2203.13443v1	null
2022-03-25	Noisy Boundaries: Lemon or Lemonade for Semi-supervised Instance Segmentation?	Zhenyu Wang et.al.	2203.13427v1	null
2022-03-25	Self-Supervised Predictive Learning: A Negative-Free Method for Sound Source Localization in Visual Scenes	Zengjie Song et.al.	2203.13412v1	null
2022-03-25	Point2Seq: Detecting 3D Objects as Sequences	Yujing Xue et.al.	2203.13394v1	null
2022-03-24	Probing Representation Forgetting in Supervised and Unsupervised Continual Learning	MohammadReza Davari et.al.	2203.13381v1	null
2022-03-24	NPBG++: Accelerating Neural Point-Based Graphics	Ruslan Rakhimov et.al.	2203.13318v1	null
2022-03-24	SharpContour: A Contour-based Boundary Refinement Approach for Efficient and Accurate Instance Segmentation	Chenming Zhu et.al.	2203.13312v1	null
2022-03-24	MonoDETR: Depth-aware Transformer for Monocular 3D Object Detection	Renrui Zhang et.al.	2203.13310v1	link
2022-03-24	Weakly-Supervised Online Action Segmentation in Multi-View Instructional Videos	Reza Ghoddoosian et.al.	2203.13309v1	null
2022-03-24	EPro-PnP: Generalized End-to-End Probabilistic Perspective-n-Points for Monocular Object Pose Estimation	Hansheng Chen et.al.	2203.13254v1	link
2022-03-24	Global Tracking Transformers	Xingyi Zhou et.al.	2203.13250v1	link
2022-03-24	Pastiche Master: Exemplar-Based High-Resolution Portrait Style Transfer	Shuai Yang et.al.	2203.13248v1	link
2022-03-24	Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation	Xian Liu et.al.	2203.13161v1	link
2022-03-24	Moving Window Regression: A Novel Approach to Ordinal Regression	Nyeong-Ho Shin et.al.	2203.13122v1	link
2022-03-24	Egocentric Prediction of Action Target in 3D	Yiming Li et.al.	2203.13116v1	null
2022-03-24	AziNorm: Exploiting the Radial Symmetry of Point Cloud for Azimuth-Normalized 3D Perception	Shaoyu Chen et.al.	2203.13090v1	link
2022-03-24	Bailando: 3D Dance Generation by Actor-Critic GPT with Choreographic Memory	Li Siyao et.al.	2203.13055v2	link
2022-03-24	CVF-SID: Cyclic multi-Variate Function for Self-Supervised Image Denoising by Disentangling Noise from Image	Reyhaneh Neshatavar et.al.	2203.13009v1	link
2022-03-24	Compound Domain Generalization via Meta-Knowledge Encoding	Chaoqi Chen et.al.	2203.13006v1	null
2022-03-24	Hierarchical Nearest Neighbor Graph Embedding for Efficient Dimensionality Reduction	M. Saquib Sarfraz et.al.	2203.12997v1	link
2022-03-24	WarpingGAN: Warping Multiple Uniform Priors for Adversarial 3D Point Cloud Generation	Yingzhi Tang et.al.	2203.12917v1	link
2022-03-24	Neural Reflectance for Shape Recovery with Shadow Handling	Junxuan Li et.al.	2203.12909v1	link
2022-03-24	RNNPose: Recurrent 6-DoF Object Pose Refinement with Robust Correspondence Field Estimation and Pose Optimization	Yan Xu et.al.	2203.12870v1	null
2022-03-24	DyRep: Bootstrapping Training with Dynamic Re-parameterization	Tao Huang et.al.	2203.12868v1	link
2022-03-24	Beyond Fixation: Dynamic Window Visual Transformer	Pengzhen Ren et.al.	2203.12856v1	link
2022-03-24	Industrial Style Transfer with Large-scale Geometric Warping and Content Preservation	Jinchao Yang et.al.	2203.12835v1	link
2022-03-24	Sparse Instance Activation for Real-Time Instance Segmentation	Tianheng Cheng et.al.	2203.12827v1	link
2022-03-24	Learning Motion-Dependent Appearance for High-Fidelity Rendering of Dynamic Humans from a Single Camera	Jae Shin Yoon et.al.	2203.12780v1	null
2022-03-23	Multidimensional Belief Quantification for Label-Efficient Meta-Learning	Deep Pandey et.al.	2203.12768v1	null
2022-03-23	UMT: Unified Multi-modal Transformers for Joint Video Moment Retrieval and Highlight Detection	Ye Liu et.al.	2203.12745v1	link
2022-03-23	Maximum Spatial Perturbation Consistency for Unpaired Image-to-Image Translation	Yanwu Xu et.al.	2203.12707v1	link
2022-03-23	DynamicEarthNet: Daily Multi-Spectral Satellite Dataset for Semantic Change Segmentation	Aysim Toker et.al.	2203.12560v1	null
2022-03-23	Transformer-based Multimodal Information Fusion for Facial Expression Analysis	Wei Zhang et.al.	2203.12367v1	null
2022-03-23	How Do You Do It? Fine-Grained Action Understanding with Pseudo-Adverbs	Hazel Doughty et.al.	2203.12344v1	link
2022-03-23	Towards Semi-Supervised Deep Facial Expression Recognition with An Adaptive Confidence Margin	Hangyu Li et.al.	2203.12341v2	link
2022-03-23	Real-time Object Detection for Streaming Perception	Jinrong Yang et.al.	2203.12338v1	link
2022-03-23	DR.VIC: Decomposition and Reasoning for Video Individual Counting	Tao Han et.al.	2203.12335v1	link
2022-03-23	Node Representation Learning in Graph via Node-to-Neighbourhood Mutual Information Maximization	Wei Dong et.al.	2203.12265v1	link
2022-03-23	Ev-TTA: Test-Time Adaptation for Event-Based Object Recognition	Junho Kim et.al.	2203.12247v1	null
2022-03-23	Training-free Transformer Architecture Search	Qinqin Zhou et.al.	2203.12217v1	null
2022-03-23	Self-supervised Learning of Adversarial Example: Towards Good Generalizations for Deepfake Detection	Liang Chen et.al.	2203.12208v1	link
2022-03-23	Unifying Motion Deblurring and Frame Interpolation with Events	Xiang Zhang et.al.	2203.12178v1	null
2022-03-22	PlaneMVS: 3D Plane Reconstruction from Multi-View Stereo	Jiachen Liu et.al.	2203.12082v1	null
2022-03-22	DTFD-MIL: Double-Tier Feature Distillation Multiple Instance Learning for Histopathology Whole Slide Image Classification	Hongrun Zhang et.al.	2203.12081v1	null
2022-03-22	φ-SfT: Shape-from-Template with a Physics-Based Deformation Model	Navami Kairanda et.al.	2203.11938v1	null
2022-03-22	Learning from All Vehicles	Dian Chen et.al.	2203.11934v1	link
2022-03-22	Dataset Distillation by Matching Training Trajectories	George Cazenavette et.al.	2203.11932v1	link
2022-03-22	GradViT: Gradient Inversion of Vision Transformers	Ali Hatamizadeh et.al.	2203.11894v1	null
2022-03-22	AP-BSN: Self-Supervised Denoising for Real-World Images via Asymmetric PD and Blind-Spot Network	Wooseok Lee et.al.	2203.11799v1	link
2022-03-22	Exploring and Evaluating Image Restoration Potential in Dynamic Scenes	Cheng Zhang et.al.	2203.11754v2	link
2022-03-22	FedDC: Federated Learning with Non-IID Data via Local Drift Decoupling and Correction	Liang Gao et.al.	2203.11751v1	link
2022-03-22	Meta-attention for ViT-backed Continual Learning	Mengqi Xue et.al.	2203.11684v1	link
2022-03-22	Look for the Change: Learning Object States and State-Modifying Actions from Untrimmed Web Videos	Tomáš Souček et.al.	2203.11637v1	link
2022-03-22	IDEA-Net: Dynamic 3D Point Cloud Interpolation via Deep Embedding Alignment	Yiming Zeng et.al.	2203.11590v1	link
2022-03-22	Out-of-distribution Generalization with Causal Invariant Transformations	Ruoyu Wang et.al.	2203.11528v2	null
2022-03-22	TransFusion: Robust LiDAR-Camera Fusion for 3D Object Detection with Transformers	Xuyang Bai et.al.	2203.11496v1	link
2022-03-22	Practical Stereo Matching via Cascaded Recurrent Network with Adaptive Correlation	Jiankun Li et.al.	2203.11483v1	link
2022-03-22	Mixed Differential Privacy in Computer Vision	Aditya Golatkar et.al.	2203.11481v1	null
2022-03-22	Remember Intentions: Retrospective-Memory-based Trajectory Prediction	Chenxin Xu et.al.	2203.11474v1	link
2022-03-22	Federated Class-Incremental Learning	Jiahua Dong et.al.	2203.11473v1	link
2022-03-22	Ray3D: ray-based 3D human pose estimation for monocular absolute 3D localization	Yu Zhan et.al.	2203.11471v1	link
2022-03-21	Global Matching with Overlapping Attention for Optical Flow Estimation	Shiyu Zhao et.al.	2203.11335v1	link
2022-03-21	NeRFusion: Fusing Radiance Fields for Large-Scale Scene Reconstruction	Xiaoshuai Zhang et.al.	2203.11283v1	null
2022-03-21	Transforming Model Prediction for Tracking	Christoph Mayer et.al.	2203.11192v1	link
2022-03-21	DiffPoseNet: Direct Differentiable Camera Pose Estimation	Chethan M. Parameshwara et.al.	2203.11174v1	null
2022-03-21	Not All Points Are Equal: Learning Highly Efficient Point-based Detectors for 3D LiDAR Point Clouds	Yifan Zhang et.al.	2203.11139v1	link
2022-03-21	No Pain, Big Gain: Classify Dynamic Point Cloud Sequences with Static Models by Fitting Feature-level Space-time Surfaces	Jia-Xing Zhong et.al.	2203.11113v1	link
2022-03-21	MixFormer: End-to-End Tracking with Iterative Mixed Attention	Yutao Cui et.al.	2203.11082v1	link
2022-03-21	MonoDTR: Monocular 3D Object Detection with Depth-Aware Transformer	Kuan-Chih Huang et.al.	2203.10981v1	link
2022-03-21	Unified Multivariate Gaussian Mixture for Efficient Neural Image Compression	Xiaosu Zhu et.al.	2203.10897v1	link
2022-03-21	Revisiting Domain Generalized Stereo Matching Networks from a Feature Consistency Perspective	Jiawei Zhang et.al.	2203.10887v1	link
2022-03-21	ELIC: Efficient Learned Image Compression with Unevenly Grouped Space-Channel Contextual Adaptive Coding	Dailan He et.al.	2203.10886v1	null
2022-03-21	RGB-Depth Fusion GAN for Indoor Depth Completion	Haowen Wang et.al.	2203.10856v1	null
2022-03-21	Hyperbolic Vision Transformers: Combining Improvements in Metric Learning	Aleksandr Ermolov et.al.	2203.10833v2	link
2022-03-21	ViM: Out-Of-Distribution with Virtual-logit Matching	Haoqi Wang et.al.	2203.10807v1	link
2022-03-21	Delving into the Estimation Shift of Batch Normalization in a Network	Lei Huang et.al.	2203.10778v1	link
2022-03-21	Tree Energy Loss: Towards Sparsely Annotated Semantic Segmentation	Zhiyuan Liang et.al.	2203.10739v1	null
2022-03-21	HP-Capsule: Unsupervised Face Part Discovery by Hierarchical Parsing Capsule Network	Chang Yu et.al.	2203.10699v1	null
2022-03-20	Unsupervised Domain Adaptation for Nighttime Aerial Tracking	Junjie Ye et.al.	2203.10541v1	link
2022-03-20	Depth Estimation by Combining Binocular Stereo and Monocular Structured-Light	Yuhua Xu et.al.	2203.10493v1	null
2022-03-20	SimAN: Exploring Self-Supervised Representation Learning of Scene Text via Similarity-Aware Normalization	Canjie Luo et.al.	2203.10492v1	link
2022-03-20	TVConv: Efficient Translation Variant Convolution for Layout-aware Visual Processing	Jierun Chen et.al.	2203.10489v1	link
2022-03-20	Portrait Eyeglasses and Shadow Removal by Leveraging 3D Synthetic Data	Junfeng Lyu et.al.	2203.10474v1	link
2022-03-19	CLRNet: Cross Layer Refinement Network for Lane Detection	Tu Zheng et.al.	2203.10350v1	null
2022-03-19	Voxel Set Transformer: A Set-to-Set Approach to 3D Object Detection from Point Clouds	Chenhang He et.al.	2203.10314v1	link
2022-03-19	DirecFormer: A Directed Attention in Transformer Approach to Robust Action Recognition	Thanh-Dat Truong et.al.	2203.10233v1	link
2022-03-19	SwinTextSpotter: Scene Text Spotting via Better Synergy between Text Detection and Text Recognition	Mingxin Huang et.al.	2203.10209v1	link
2022-03-18	Discovering Objects that Can Move	Zhipeng Bao et.al.	2203.10159v1	null
2022-03-18	Fourier Document Restoration for Robust Document Dewarping and Recognition	Chuhui Xue et.al.	2203.09910v1	null
2022-03-18	Learning Affordance Grounding from Exocentric Images	Hongchen Luo et.al.	2203.09905v1	link
2022-03-18	DTA: Physical Camouflage Attacks using Differentiable Transformation Network	Naufal Suryanto et.al.	2203.09831v1	null
2022-03-18	Cross-Modal Perceptionist: Can Face Geometry be Gleaned from Voices?	Cho-Ying Wu et.al.	2203.09824v1	null
2022-03-18	Stacked Hybrid-Attention and Group Collaborative Learning for Unbiased Scene Graph Generation	Xingning Dong et.al.	2203.09811v1	link
2022-03-18	Sparse Fuse Dense: Towards High Quality 3D Detection with Depth Completion	Xiaopei Wu et.al.	2203.09780v1	null
2022-03-18	ContrastMask: Contrastive Learning to Segment Every Thing	Xuehui Wang et.al.	2203.09775v1	null
2022-03-18	Class-Balanced Pixel-Level Self-Labeling for Domain Adaptive Semantic Segmentation	Ruihuang Li et.al.	2203.09744v1	link
2022-03-18	A Dual Weighting Label Assignment Scheme for Object Detection	Shuai Li et.al.	2203.09730v1	link
2022-03-18	VISTA: Boosting 3D Object Detection via Dual Cross-VIew SpaTial Attention	Shengheng Deng et.al.	2203.09704v1	link
2022-03-17	Regional Semantic Contrast and Aggregation for Weakly Supervised Semantic Segmentation	Tianfei Zhou et.al.	2203.09653v1	link
2022-03-17	Cascade Transformers for End-to-End Person Search	Rui Yu et.al.	2203.09642v1	link
2022-03-17	AutoSDF: Shape Priors for 3D Completion, Reconstruction and Generation	Paritosh Mittal et.al.	2203.09516v1	null
2022-03-17	FERV39k: A Large-Scale Multi-Scene Dataset for Facial Expression Recognition in Videos	Yan Wang et.al.	2203.09463v1	null
2022-03-17	Look Outside the Room: Synthesizing A Consistent Long-Term 3D Scene Video from A Single Image	Xuanchi Ren et.al.	2203.09457v1	null
2022-03-17	Vox2Cortex: Fast Explicit Reconstruction of Cortical Surfaces from 3D MRI Scans with Geometric Deep Neural Networks	Fabian Bongratz et.al.	2203.09446v2	null
2022-03-17	ZebraPose: Coarse to Fine Surface Encoding for 6DoF Object Pose Estimation	Yongzhi Su et.al.	2203.09418v1	link
2022-03-17	Bi-directional Object-context Prioritization Learning for Saliency Ranking	Xin Tian et.al.	2203.09416v1	link
2022-03-17	A Text Attention Network for Spatial Deformation Robust Scene Text Image Super-resolution	Jianqi Ma et.al.	2203.09388v2	link
2022-03-17	Interacting Attention Graph for Single Image Two-Hand Reconstruction	Mengcheng Li et.al.	2203.09364v2	null
2022-03-17	Object Localization under Single Coarse Point Supervision	Xuehui Yu et.al.	2203.09338v1	link
2022-03-17	Modulated Contrast for Versatile Image Synthesis	Fangneng Zhan et.al.	2203.09333v1	link
2022-03-17	Fine-tuning Global Model via Data-Free Knowledge Distillation for Non-IID Federated Learning	Lin Zhang et.al.	2203.09249v1	null
2022-03-17	Neural Compression-Based Feature Learning for Video Restoration	Cong Huang et.al.	2203.09208v2	null
2022-03-17	Details or Artifacts: A Locally Discriminative Learning Approach to Realistic Image Super-Resolution	Jie Liang et.al.	2203.09195v1	link
2022-03-17	MuKEA: Multimodal Knowledge Extraction and Accumulation for Knowledge-based Visual Question Answering	Yang Ding et.al.	2203.09138v1	link
2022-03-17	Global Convergence of MAML and Theory-Inspired Neural Architecture Search for Few-Shot Learning	Haoxiang Wang et.al.	2203.09137v1	link
2022-03-17	Improving the Transferability of Targeted Adversarial Examples through Object-Based Diverse Input	Junyoung Byun et.al.	2203.09123v1	link
2022-03-17	Attribute Surrogates Learning and Spectral Tokens Pooling in Transformers for Few-shot Learning	Yangji He et.al.	2203.09064v1	link
2022-03-17	DATA: Domain-Aware and Task-Aware Pre-training	Qing Chang et.al.	2203.09041v1	link
2022-03-16	Decoupled Knowledge Distillation	Borui Zhao et.al.	2203.08679v1	null
2022-03-16	Deep vanishing point detection: Geometric priors make dataset variations vanish	Yancong Lin et.al.	2203.08586v1	link
2022-03-16	EDTER: Edge Detection with Transformer	Mengyang Pu et.al.	2203.08566v1	link
2022-03-16	MonoJSG: Joint Semantic and Geometric Cost Volume for Monocular 3D Object Detection	Qing Lian et.al.	2203.08563v1	null
2022-03-16	Non-isotropy Regularization for Proxy-based Deep Metric Learning	Karsten Roth et.al.	2203.08547v1	link
2022-03-16	Integrating Language Guidance into Vision-based Deep Metric Learning	Karsten Roth et.al.	2203.08543v1	link
2022-03-16	Scribble-Supervised LiDAR Semantic Segmentation	Ozan Unal et.al.	2203.08537v1	link
2022-03-16	Capturing Humans in Motion: Temporal-Attentive 3D Human Pose and Shape Estimation from Monocular Video	Wen-Li Wei et.al.	2203.08534v1	null
2022-03-16	Physical Inertial Poser (PIP): Physics-aware Real-time Human Motion Tracking from Sparse Inertial Sensors	Xinyu Yi et.al.	2203.08528v2	null
2022-03-16	Towards Practical Certifiable Patch Defense with Vision Transformer	Zhaoyu Chen et.al.	2203.08519v1	null
2022-03-16	QS-Attn: Query-Selected Attention for Contrastive Learning in I2I Translation	Xueqi Hu et.al.	2203.08483v1	link
2022-03-16	Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding	Haojun Jiang et.al.	2203.08481v1	link
2022-03-16	The Devil Is in the Details: Window-based Attention for Image Compression	Renjie Zou et.al.	2203.08450v1	link
2022-03-16	Attribute Group Editing for Reliable Few-shot Image Generation	Guanqi Ding et.al.	2203.08422v1	link
2022-03-16	Privacy-preserving Online AutoML for Domain-Specific Face Detection	Chenqian Yan et.al.	2203.08399v1	null
2022-03-16	Represent, Compare, and Learn: A Similarity-Aware Framework for Class-Agnostic Counting	Min Shi et.al.	2203.08354v1	null
2022-03-15	DeepFusion: Lidar-Camera Deep Fusion for Multi-Modal 3D Object Detection	Yingwei Li et.al.	2203.08195v1	null
2022-03-15	Can Neural Nets Learn the Same Model Twice? Investigating Reproducibility and Double Descent from the Decision Boundary Perspective	Gowthami Somepalli et.al.	2203.08124v1	link
2022-03-15	Implicit Feature Decoupling with Depthwise Quantization	Iordanis Fostiropoulos et.al.	2203.08080v1	link
2022-03-15	OcclusionFusion: Occlusion-aware Motion Estimation for Real-time Dynamic 3D Reconstruction	Wenbin Lin et.al.	2203.07977v1	null
2022-03-15	Style Transformer for Image Inversion and Editing	Xueqi Hu et.al.	2203.07932v1	link
2022-03-15	GPV-Pose: Category-level Object Pose Estimation via Geometry-guided Point-wise Voting	Yan Di et.al.	2203.07918v1	link
2022-03-15	Interspace Pruning: Using Adaptive Filter Representations to Improve Training of Sparse CNNs	Paul Wimmer et.al.	2203.07808v1	null
2022-03-15	Scalable Penalized Regression for Noise Detection in Learning with Noisy Labels	Yikai Wang et.al.	2203.07788v1	null
2022-03-15	Exact Feature Distribution Matching for Arbitrary Style Transfer and Domain Generalization	Yabin Zhang et.al.	2203.07740v1	link
2022-03-15	Distribution-Aware Single-Stage Models for Multi-Person 3D Pose Estimation	Zitian Wang et.al.	2203.07697v2	null
2022-03-15	Learning What Not to Segment: A New Perspective on Few-Shot Segmentation	Chunbo Lang et.al.	2203.07615v1	link
2022-03-14	Implicit Motion Handling for Video Camouflaged Object Detection	Xuelian Cheng et.al.	2203.07363v2	null
2022-03-14	GCFSR: a Generative and Controllable Face Super Resolution Method Without Facial and GAN Priors	Jingwen He et.al.	2203.07319v1	null
2022-03-14	RCL: Recurrent Continuous Localization for Temporal Action Detection	Qiang Wang et.al.	2203.07112v1	null
2022-03-14	Active Learning by Feature Mixing	Amin Parvaneh et.al.	2203.07034v1	link
2022-03-14	Rethinking Minimal Sufficient Representation in Contrastive Learning	Haoqing Wang et.al.	2203.07004v1	link
2022-03-14	Blind2Unblind: Self-Supervised Image Denoising with Visible Blind Spots	Zejin Wang et.al.	2203.06967v2	link
2022-03-14	UniVIP: A Unified Framework for Self-Supervised Visual Pre-training	Zhaowen Li et.al.	2203.06965v1	null
2022-03-14	Forward Compatible Few-Shot Class-Incremental Learning	Da-Wei Zhou et.al.	2203.06953v1	link
2022-03-14	XYLayoutLM: Towards Layout-Aware Multimodal Networks For Visually-Rich Document Understanding	Zhangxuan Gu et.al.	2203.06947v2	null
2022-03-14	Accelerating DETR Convergence via Semantic-Aligned Matching	Gongjie Zhang et.al.	2203.06883v1	link
2022-03-14	ADAS: A Direct Adaptation Strategy for Multi-Target Domain Adaptive Semantic Segmentation	Seunghun Lee et.al.	2203.06811v1	null
2022-03-13	Scaling Up Your Kernels to 31x31: Revisiting Large Kernel Design in CNNs	Xiaohan Ding et.al.	2203.06717v1	link
2022-03-13	LAS-AT: Adversarial Training with Learnable Attack Strategy	Xiaojun Jia et.al.	2203.06616v1	link
2022-03-13	Depth-Aware Generative Adversarial Network for Talking Head Video Generation	Fa-Ting Hong et.al.	2203.06605v2	null
2022-03-13	AutoGPart: Intermediate Supervision Search for Generalizable 3D Part Segmentation	Xueyi Liu et.al.	2203.06558v1	null
2022-03-13	Sparse Local Patch Transformer for Robust Face Alignment and Landmarks Inherent Relation Learning	Jiahao Xia et.al.	2203.06541v1	link
2022-03-12	Kernel Proposal Network for Arbitrary Shape Text Detection	Shi-Xue Zhang et.al.	2203.06410v1	null
2022-03-12	SIGMA: Semantic-complete Graph Matching for Domain Adaptive Object Detection	Wuyang Li et.al.	2203.06398v1	link
2022-03-12	Self-Sustaining Representation Expansion for Non-Exemplar Class-Incremental Learning	Kai Zhu et.al.	2203.06359v1	null
2022-03-12	Wavelet Knowledge Distillation: Towards Efficient Image-to-Image Translation	Linfeng Zhang et.al.	2203.06321v1	null
2022-03-12	MISF: Multi-level Interactive Siamese Filtering for High-Fidelity Image Inpainting	Xiaoguang Li et.al.	2203.06304v1	link
2022-03-11	REX: Reasoning-aware and Grounded Explanation	Shi Chen et.al.	2203.06107v1	link
2022-03-11	Enhancing Adversarial Training with Second-Order Statistics of Weights	Gaojie Jin et.al.	2203.06020v1	link
2022-03-11	Hyperbolic Image Segmentation	Mina GhadimiAtigh et.al.	2203.05898v1	link
2022-03-11	WiCV 2021: The Eighth Women In Computer Vision Workshop	Arushi Goel et.al.	2203.05825v1	null
2022-03-11	FLAG: Flow-based 3D Avatar Generation from Sparse Observations	Sadegh Aliakbarian et.al.	2203.05789v1	null
2022-03-11	Democracy Does Matter: Comprehensive Feature Mining for Co-Salient Object Detection	Siyue Yu et.al.	2203.05787v1	null
2022-03-11	Learning Distinctive Margin toward Active Domain Adaptation	Ming Xie et.al.	2203.05738v1	null
2022-03-10	Point Density-Aware Voxels for LiDAR 3D Object Detection	Jordan S. K. Hu et.al.	2203.05662v1	link
2022-03-10	Conditional Prompt Learning for Vision-Language Models	Kaiyang Zhou et.al.	2203.05557v1	link
2022-03-10	Representation Compensation Networks for Continual Semantic Segmentation	Chang-Bin Zhang et.al.	2203.05402v1	link
2022-03-10	Spatial Commonsense Graph for Object Localisation in Partial Scenes	Francesco Giuliari et.al.	2203.05380v1	link
2022-03-10	Domain Generalization via Shuffled Style Assembly for Face Anti-Spoofing	Zhuo Wang et.al.	2203.05340v2	null
2022-03-10	Iterative Corresponding Geometry: Fusing Region and Depth for Highly Efficient 3D Tracking of Textureless Objects	Manuel Stoiber et.al.	2203.05334v1	link
2022-03-10	GrainSpace: A Large-scale Dataset for Fine-grained and Domain-adaptive Recognition of Cereal Grains	Lei Fan et.al.	2203.05306v1	null
2022-03-10	Contrastive Boundary Learning for Point Cloud Segmentation	Liyao Tang et.al.	2203.05272v2	link
2022-03-10	Back to Reality: Weakly-supervised 3D Object Detection with Shape-guided Label Enhancement	Xiuwei Xu et.al.	2203.05238v1	link
2022-03-10	Knowledge Distillation as Efficient Pre-training: Faster Convergence, Higher Data-efficiency, and Better Transferability	Ruifei He et.al.	2203.05180v1	link
2022-03-10	Practical Evaluation of Adversarial Robustness via Adaptive Auto Attack	Ye Liu et.al.	2203.05154v1	link
2022-03-10	Frequency-driven Imperceptible Adversarial Attack on Semantic Similarity	Cheng Luo et.al.	2203.05151v1	null
2022-03-10	OpenTAL: Towards Open Set Temporal Action Localization	Wentao Bao et.al.	2203.05114v1	link
2022-03-09	NLX-GPT: A Model for Natural Language Explanations in Vision and Vision-Language Tasks	Fawaz Sammani et.al.	2203.05081v1	link
2022-03-09	Adaptive Trajectory Prediction via Transferable GNN	Yi Xu et.al.	2203.05046v1	null
2022-03-09	Neural Data-Dependent Transform for Learned Image Compression	Dezhao Wang et.al.	2203.04963v1	null
2022-03-09	What Matters For Meta-Learning Vision Regression Tasks?	Ning Gao et.al.	2203.04905v1	null
2022-03-09	How many Observations are Enough? Knowledge Distillation for Trajectory Forecasting	Alessio Monti et.al.	2203.04781v1	null
2022-03-09	SkinningNet: Two-Stream Graph Convolutional Neural Network for Skinning Prediction of Synthetic Characters	Albert Mosella-Montoro et.al.	2203.04746v1	null
2022-03-09	FlexIT: Towards Flexible Semantic Image Translation	Guillaume Couairon et.al.	2203.04705v1	null
2022-03-09	ChiTransformer:Towards Reliable Stereo from Cues	Qing Su et.al.	2203.04554v1	null
2022-03-08	Dynamic Dual-Output Diffusion Models	Yaniv Benny et.al.	2203.04304v1	null
2022-03-08	A Simple Multi-Modality Transfer Learning Baseline for Sign Language Translation	Yutong Chen et.al.	2203.04287v1	null
2022-03-08	Probabilistic Warp Consistency for Weakly-Supervised Semantic Correspondences	Prune Truong et.al.	2203.04279v1	link
2022-03-08	End-to-End Semi-Supervised Learning for Video Action Detection	Akash Kumar et.al.	2203.04251v1	link
2022-03-08	Neural Face Identification in a 2D Wireframe Projection of a Manifold Object	Kehan Wang et.al.	2203.04229v1	link
2022-03-08	Selective-Supervised Contrastive Learning with Noisy Labels	Shikun Li et.al.	2203.04181v1	link
2022-03-08	Motron: Multimodal Probabilistic Human Motion Forecasting	Tim Salzmann et.al.	2203.04132v1	null
2022-03-08	E2EC: An End-to-End Contour-based Method for High-Quality High-Speed Instance Segmentation	Tao Zhang et.al.	2203.04074v1	link
2022-03-08	Shape-invariant 3D Adversarial Point Clouds	Qidong Huang et.al.	2203.04041v1	link
2022-03-08	DeltaCNN: End-to-End CNN Inference of Sparse Frame Differences in Videos	Mathias Parger et.al.	2203.03996v1	null
2022-03-08	Contrastive Conditional Neural Processes	Zesheng Ye et.al.	2203.03978v1	null
2022-03-08	On Generalizing Beyond Domains in Cross-Domain Continual Learning	Christian Simon et.al.	2203.03970v1	null
2022-03-08	Generative Cooperative Learning for Unsupervised Video Anomaly Detection	Muhammad Zaigham Zaheer et.al.	2203.03962v1	null
2022-03-08	ART-Point: Improving Rotation Robustness of Point Cloud Classifiers via Adversarial Rotation	Robin Wang et.al.	2203.03888v1	link
2022-03-08	Semi-Supervised Semantic Segmentation Using Unreliable Pseudo-Labels	Yuchao Wang et.al.	2203.03884v1	null
2022-03-08	Weakly Supervised Semantic Segmentation using Out-of-Distribution Data	Jungbeom Lee et.al.	2203.03860v1	link
2022-03-08	Deep Rectangling for Image Stitching: A Learning Baseline	Lang Nie et.al.	2203.03831v1	link
2022-03-08	Shadows can be Dangerous: Stealthy and Effective Physical-world Adversarial Attack by Natural Phenomenon	Yiqi Zhong et.al.	2203.03818v2	link
2022-03-08	Generating 3D Bio-Printable Patches Using Wound Segmentation and Reconstruction to Treat Diabetic Foot Ulcers	Han Joo Chae et.al.	2203.03814v1	null
2022-03-08	Unknown-Aware Object Detection: Learning What You Don't Know from Videos in the Wild	Xuefeng Du et.al.	2203.03800v1	link
2022-03-07	Kubric: A scalable dataset generator	Klaus Greff et.al.	2203.03570v1	link
2022-03-07	Adversarial Texture for Fooling Person Detectors in the Physical World	Zhanhao Hu et.al.	2203.03373v2	null
2022-03-07	Interpretable part-whole hierarchies and conceptual-semantic relationships in neural networks	Nicola Garau et.al.	2203.03282v1	null
2022-03-07	MSDN: Mutually Semantic Distillation Network for Zero-Shot Learning	Shiming Chen et.al.	2203.03137v1	link
2022-03-07	Protecting Facial Privacy: Generating Adversarial Identity Masks via Style-robust Makeup Transfer	Shengshan Hu et.al.	2203.03121v1	null

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
.github/workflows		.github/workflows
docs		docs
README.md		README.md
cv-arxiv-daily.json		cv-arxiv-daily.json
daily_arxiv.py		daily_arxiv.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.github/workflows

.github/workflows

docs

docs

README.md

README.md

cv-arxiv-daily.json

cv-arxiv-daily.json

daily_arxiv.py

daily_arxiv.py

Repository files navigation

Updated on 2022.04.12

CVPR2022

About

Releases

Packages

Languages

DWCTOD/arXiv-CVPR2022-daily

Folders and files

Latest commit

History

Repository files navigation

Updated on 2022.04.12

CVPR2022

About

Resources

Stars

Watchers

Forks

Languages