[AAAI 2025] EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
-
Updated
Dec 10, 2024 - Python
[AAAI 2025] EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
Implementation of "Disentangled Motion Modeling for Video Frame Interpolation", AAAI 2025
[AAAI2025] Official repo for paper "MotionCraft: Crafting Whole-Body Motion with Plug-and-Play Multimodal Controls"
[AAAI 2025] ORQA is a new QA benchmark designed to assess the reasoning capabilities of LLMs in a specialized technical domain of Operations Research. The benchmark evaluates whether LLMs can emulate the knowledge and reasoning skills of OR experts when presented with complex optimization modeling tasks.
[AAAI 2025] SparX: A Sparse Cross-Layer Connection Mechanism for Hierarchical Vision Mamba and Transformer Networks
[AAAI 2025] MonoBox: Tightness-free Box-supervised Polyp Segmentation using Monotonicity Constraint
Official Implementation (Pytorch) of the "VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Captioning", AAAI 2025
The code for the paper "Efficient Self-Supervised Video Hashing with Selective State Spaces" (AAAI'25).
AAAI 2025 | A2RNet: Adversarial Attack Resilient Network for Robust Infrared and Visible Image Fusion
Official Implementation (Pytorch) of "Super-class guided Transformer for Zero-Shot Attribute Classification", AAAI 2025
Official PyTorch Implementation of 'Entropy-Guided Attention for Private LLMs' (PPAI Workshop. AAAI 2025)
[AAAI2025] Code for ViFactCheck: A New Benchmark Dataset and Methods for Multi-domain News Fact-Checking in Vietnamese
Implementation of CODE: Confident Ordinary Differential Editing
ToMATO: Verbalizing the Mental States of Role-Playing LLMs for Benchmarking Theory of Mind (AAAI2025)
Source code for the AAAI 2025 paper "TimeCAP: Learning to Contextualize, Augment, and Predict Time Series Events with Large Language Model Agents."
[AAAI 2025] Representing Sounds as Neural Amplitude Fields: A Benchmark of Coordinate-MLPs and A Fourier Kolmogorov-Arnold Framework
Code repo for the paper "PixelMan: Consistent Object Editing with Diffusion Models via Pixel Manipulation and Generation", accepted to AAAI-25.
[AAAI 2025] CDM-PSL: Expensive Multi-Objective Bayesian Optimization Based on Diffusion Models
Deep CBC Models for Prototype Based Interpretability Benchmarks
[AAAI 2025] Towards Audio-visual Navigation in Noisy Environments: A Large-scale Benchmark Dataset and An Architecture Considering Multiple Sound-Sources
Add a description, image, and links to the aaai2025 topic page so that developers can more easily learn about it.
To associate your repository with the aaai2025 topic, visit your repo's landing page and select "manage topics."