Skip to content

Latest commit

 

History

History
141 lines (105 loc) · 9.22 KB

README.md

File metadata and controls

141 lines (105 loc) · 9.22 KB
An introduction to AI methods for flying agents (birds, UAVs, etc.)

🧺 Table of Contents

📰 News

  • 👀 (2023/07/08) A review paper on "Single object tracking (SOT) for UAV" has been accepted by Artificial Intelligence Review [Paper] [Code]. Don't miss it if you are interested in single object tracking on UAV platforms!

  • 🚀 (2023/07/04) We present SAM-DA [paper] [project], a SAM-powered domain adaptation method for nighttime UAV tracking! With only 2.4 hours of training, the performance surpasses baseline with 12 hours of training, enabling the rapid deployment of nighttime UAV trakcing methods!

🐦 Bird flock

Awesome bird paper
  • (Nature communication) Behavioural plasticity and the transition to order in jackdaw flocks. [paper]
  • (Nature ecology & evolution) Costs and benefits of social relationships in the collective motion of bird flocks. [paper]

📓 Related works

👬 Awesome multi-object tracking (MOT)


2023 awesome mot paper
  • (2023 CVPR) Tracking Multiple Deformable Objects in Egocentric Videos. [paper][project]

    New task: track multiple deformable objects using smart glasses.
    New dataset: DogThruGlasses, 150 videos and 73K annotated frames from smart glasses.

  • (2023 CVPR) Referring Multi-Object Tracking. [paper][project]

    New task: employ a language expression as a semantic cue to track specific objects.
    New dataset: Refer-KITTI, 18 videos with 818 expressions.

  • (2023 CVPR) MotionTrack: Learning Robust Short-term and Long-term Motions for Multi-Object Tracking. [paper]

    Task: associate detected objects between frames by modeling object motion with attention.

  • (2023 CVPR) UTM: A Unified Multiple Object Tracking Model with Identity-Aware Feature Enhancement. [paper]

    Task: associate detected objects between frames by modeling object motion with attention.

  • (2023 CVPR) OVTrack: Open-Vocabulary Multiple Object Tracking [paper][code]

    New task: open-vocabulary MOT, with vision-language models for classification and association and diffusion models for appearance learning. An interesting data hallucination strategy using Stable Diffusion for training data generation.

  • (2023 arxiv) SparseTrack: Multi-Object Tracking by Performing Scene Decomposition based on Pseudo-Depth. [paper][project]

    Task: combine identity learning with detection.

2022 awesome mot paper
  • (2022 T-PAMI) TransCenter: Transformers with Dense Representations for Multiple-Object Tracking.[paper][project]

    Task: use dense detection queries and efficient sparse tracking queries for MOT with PVT backbone.

  • (2022 CVPR) TrackFormer: Multi-Object Tracking with Transformers. [paper] [project]

    Task: formulate MOT as a frame-to-frame set prediction problem.

  • (2022 ECCV) ByteTrack: Multi-Object Tracking by Associating Every Detection Box. [paper][project]

    Task: using YOLOX as a strong detector.

  • (2022 ECCV) Tracking Objects as Pixel-wise Distributions. [paper][project]

    Task: propagates pixel-wise features guided by flow information to pass messages between frames.

🏃 Awesome tracking any point (TAP)


2023 awesome tap paper
  • (2023 arxiv) CoTracker: It is Better to Track Together. [paper][project]

    Task: jointly track multiple points throughout an entire video.

  • (2023 arxiv) Segment Anything Meets Point Tracking. [paper][code]

    Task: Zero-shot tracking and segmenting anything in dynamic videos, using SAM and PIPs.

  • (2023 arxiv) Tracking Everything Everywhere All at Once. [paper][project]

    New task: estimate dense and long-range motion from a video sequence during inference.

  • (2023 arxiv) TAPIR: Tracking Any Point with per-frame Initialization and temporal Refinement. [paper][project]

    New task: online and fast point tracking with matching and refinement.

2022 awesome tap paper
  • (2022 NeurIPS) TAP-Vid: A Benchmark for Tracking Any Point in a Video. [paper][code]

    New Task: tracking arbitrary physical points on surfaces over longer video clips.
    New benchmark: TAP-Vid, with both real and synthetic videos, including TAP-Vid-Kinetics, TAP-Vid-DAVIS, TAP-Vid-Kubric, and TAP-Vid-RGB-Stacking.

  • (2022 ECCV oral) Particle Video Revisited: Tracking Through Occlusions Using Point Trajectories. [paper][code]

    New Task: Long-range motion estimation using dense cost maps, iterative optimization, and learned appearance updates.

Others

2020 awesome tracking paper * (2020 NeurIPS) Space-Time Correspondence as a Contrastive Random Walk. [[paper]](https://proceedings.neurips.cc/paper/2020/hash/e2ef524fbf3d9fe611d5a8e90fefdc9c-Abstract.html)[[project]](https://ajabri.github.io/videowalk/) > Task: obtain the correspondence of image patches by unsupervised learning from a palindrome sequence, where the query and target are the same.

📽️ Recommended resources

  • Earthflight -- BBC, 2011, 6 episodes.

    "A British nature documentary that shows a flight from the view of the wings of birds across six continents, showing some of the world's greatest natural spectacles from a bird's-eye view. The BBC series was created by John Downer and narrated by David Tennant and consisted of six 60-minute episodes. The first episode aired on BBC One on 29 December 2011." -- from Wikipedia

  • IOC World Bird List -- International Ornithological Congress, updating

    "The IOC World Bird List is an open access resource of the international community of ornithologists. Our primary goal is to facilitate worldwide communication in ornithology and conservation based on an up-to-date evolutionary classification of world birds and a set of English names that follow explicit guidelines for spelling and construction." --from IOC World Bird List

🙏 Acknowledgement