- View Selection for 3D Captioning via Diffusion Ranking - [Arxiv] [QA]
- Language Imbalance Can Boost Cross-lingual Generalisation - [Arxiv] [QA]
- OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments - [Arxiv] [QA]
- Ferret-v2: An Improved Baseline for Referring and Grounding with Large Language Models - [Arxiv] [QA]
- Rho-1: Not All Tokens Are What You Need - [Arxiv] [QA]
- Lyapunov-stable Neural Control for State and Output Feedback: A Novel Formulation for Efficient Synthesis and Verification - [Arxiv] [QA]
- On Unified Prompt Tuning for Request Quality Assurance in Public Code Review - [Arxiv] [QA]
- AmpleGCG: Learning a Universal and Transferable Generative Model of Adversarial Suffixes for Jailbreaking Both Open and Closed LLMs - [Arxiv] [QA]
- High-Dimension Human Value Representation in Large Language Models - [Arxiv] [QA]
- Overparameterized Multiple Linear Regression as Hyper-Curve Fitting - [Arxiv] [QA]
- Fuss-Free Network: A Simplified and Efficient Neural Network for Crowd Counting - [Arxiv] [QA]
- Heron-Bench: A Benchmark for Evaluating Vision Language Models in Japanese - [Arxiv] [QA]
- Sparse Laneformer - [Arxiv] [QA]
- From the Lab to the Theater: An Unconventional Field Robotics Journey - [Arxiv] [QA]
- Discourse-Aware In-Context Learning for Temporal Expression Normalization - [Arxiv] [QA]
- An Overview of Diffusion Models: Applications, Guided Generation, Statistical Rates and Optimization - [Arxiv] [QA]
- Joint Conditional Diffusion Model for Image Restoration with Mixed Degradations - [Arxiv] [QA]
- RMAFF-PSN: A Residual Multi-Scale Attention Feature Fusion Photometric Stereo Network - [Arxiv] [QA]
- Generating Synthetic Satellite Imagery With Deep-Learning Text-to-Image Models -- Technical Challenges and Implications for Monitoring and Verification - [Arxiv] [QA]
- Mitigating Vulnerable Road Users Occlusion Risk Via Collective Perception: An Empirical Analysis - [Arxiv] [QA]
- Reframing the Mind-Body Picture: Applying Formal Systems to the Relationship of Mind and Matter - [Arxiv] [QA]
- Reflectance Estimation for Proximity Sensing by Vision-Language Models: Utilizing Distributional Semantics for Low-Level Cognition in Robotics - [Arxiv] [QA]
- Chaos in Motion: Unveiling Robustness in Remote Heart Rate Measurement through Brain-Inspired Skin Tracking - [Arxiv] [QA]
- Run-time Monitoring of 3D Object Detection in Automated Driving Systems Using Early Layer Neural Activation Patterns - [Arxiv] [QA]
- Finding Dino: A plug-and-play framework for unsupervised detection of out-of-distribution objects using prototypes - [Arxiv] [QA]
- Papers for 2024
- Papers for 2023
- Papers for 2022
- Papers for 2021
- Papers for 2020
- Papers for 2019
- Papers for 2018
- Papers for 2017
- Papers for 2016
- Papers for 2015
- Papers for 2014
- Papers for 2013
- Papers for 2012
- Papers for 2010
- Papers for 2009
This project is made possible through the generous support of Anthropic, who provided free access to the Claude-2.1
API.