Skip to content
Change the repository type filter

All

    Repositories list

    • [TMM‘24]Hybrid Graph Reasoning with Dynamic Interaction for Visual Dialog
      Python
      MIT License
      0300Updated Aug 28, 2024Aug 28, 2024
    • MEP3P

      Public
      [TCSVT'24]Multi-modal Large Language Model Enhanced Pseudo 3D Perception Framework for Visual Commonsense Reasoning
      Python
      MIT License
      0000Updated Jul 14, 2024Jul 14, 2024
    • TLPK

      Public
      [TCSVT'23]Transductive Learning with Prior Knowledge for Generalized Zero-shot Action Recognition
      Python
      0500Updated Feb 7, 2024Feb 7, 2024
    • MSGT

      Public
      [TMM'23]Multi-modal Structure-embedding Graph Transformer for Visual Commonsense Reasoning
      Python
      MIT License
      0310Updated Feb 7, 2024Feb 7, 2024
    • SRSC

      Public
      [TMM'23]Self-supervised Video Representation Learning by Serial Restoration with Elastic Complexity
      Python
      0100Updated Feb 7, 2024Feb 7, 2024
    • MCLOS

      Public
      [ISCAS'24]Memory-Based Contrastive Learning with Optimized Sampling for Incremental Few-Shot Semantic Segmentation
      Python
      MIT License
      0200Updated Jan 27, 2024Jan 27, 2024
    • MCBD

      Public
      [TIP'23]Multi-level Content-aware Boundary Detection for Temporal Action Proposal Generation
      Python
      1200Updated Nov 10, 2023Nov 10, 2023
    • SaGAN

      Public
      [ICIP'23]Structure-aware Generative Adversarial Network for Text-to-image Generation
      Jupyter Notebook
      0500Updated Jul 11, 2023Jul 11, 2023
    • TDVC

      Public
      [TMM'2023]Task-Driven Video Compression for Humans and Machines: Framework Design and Optimization
      Python
      1310Updated May 17, 2023May 17, 2023
    • KAGS

      Public
      [TPAMI'2023]Knowledge-enriched Attention Network with Group-wise Semantic for Visual Storytelling
      Python
      MIT License
      21030Updated Jan 3, 2023Jan 3, 2023
    • IR-VQA

      Public
      [Electronics Letters'21]Improving Reasoning with Contrastive Visual Information for Visual Question Answering
      Python
      MIT License
      1001Updated Dec 11, 2022Dec 11, 2022
    • GAACNN

      Public
      [TCSVT'22]Joint Graph Attention and Asymmetric Convolutional Neural Network for Deep Image Compression
      Python
      2910Updated Nov 14, 2022Nov 14, 2022
    • UMFN

      Public
      [Electronics Letters'22]Unified multi-stage fusion network for affective video content analysis
      Python
      0000Updated Oct 9, 2022Oct 9, 2022
    • EmVidCap

      Public
      [TMM'22]Emotion Expression with Fact Transfer for Video Description
      Python
      MIT License
      0000Updated Sep 14, 2022Sep 14, 2022
    • ML-iFSOD

      Public
      [TCSVT'22]Meta-Learning Based Incremental Few-Shot Object Detection
      Python
      MIT License
      32020Updated Sep 14, 2022Sep 14, 2022
    • MCRGN

      Public
      [TCDS'22]Multi-scale Conditional Relationship Graph Network for Referring Relationships in Images
      Python
      MIT License
      0000Updated Sep 14, 2022Sep 14, 2022
    • CoVS

      Public
      [TCSVT'22]Coherent Visual Storytelling via Parallel Top-Down Visual and Topic Attention
      Python
      0010Updated Sep 14, 2022Sep 14, 2022
    • [TMM'21]CaptionNet: A Tailor-made Recurrent Neural Network for Generating Image Descriptions
      Python
      MIT License
      0100Updated Sep 14, 2022Sep 14, 2022
    • MABAN

      Public
      [TIP'21]MABAN: Multi-Agent Boundary-Aware Network for Natural Language Moment Retrieval
      Python
      MIT License
      1210Updated Sep 14, 2022Sep 14, 2022
    • BLJND

      Public
      [TOMM'21]Perceptual Image Compression with Block-level Just Noticeable Difference Prediction
      MATLAB
      MIT License
      0300Updated Sep 14, 2022Sep 14, 2022
    • AFRN

      Public
      [TMM'20]Affective Video Content Analysis with Adaptive Fusion Recurrent Network
      Python
      MIT License
      0310Updated Sep 14, 2022Sep 14, 2022
    • PWMSE

      Public
      [TBC'20]Perceptually Weighted Mean Squared Error Based Rate-Distortion Optimization for HEVC
      MIT License
      1100Updated Sep 14, 2022Sep 14, 2022
    • StruPyNet

      Public
      [MMM'20]Structural Pyramid Network for Cascaded Optical Flow Estimation
      Cuda
      MIT License
      0000Updated Sep 14, 2022Sep 14, 2022
    • MML

      Public
      [Multimed. Tools. Appl.'19]Multi-modal Learning for Affective Content Analysis in Movies
      C++
      MIT License
      0000Updated Sep 14, 2022Sep 14, 2022
    • MPS

      Public
      [TMM'19]A Multi-grained Parallel Solution for HEVC Encoding on Heterogeneous Platforms
      MIT License
      0000Updated Sep 14, 2022Sep 14, 2022
    • [TMM'18]A Collaborative Scheduling-Based Parallel Solution for HEVC Encoding on Multicore Platforms
      MIT License
      0010Updated Sep 14, 2022Sep 14, 2022
    • MKTBC

      Public
      [The Visual Computer'18]Motion Keypoint Trajectory and Covariance Descriptor for Human Action Recognition
      C++
      MIT License
      0000Updated Sep 14, 2022Sep 14, 2022
    • G-MS2F

      Public
      [Neurocomputing'17]G-MS2F: GoogLeNet Based Multi-Stage Feature Fusion of Deep CNN for Scene Recognition
      Jupyter Notebook
      MIT License
      0000Updated Sep 14, 2022Sep 14, 2022
    • [Computers and Electrical Engineering'17]Richer Feature for Image Classification with Super and Sub Kernels Based on Deep Convolutional Neural Network
      Jupyter Notebook
      MIT License
      0000Updated Sep 14, 2022Sep 14, 2022
    • CHCF

      Public
      [TCSVT'15]CHCF: A Cloud-based Heterogeneous Computing Framework for Large-Scale Image Retrieval
      Shell
      MIT License
      0000Updated Sep 14, 2022Sep 14, 2022