Skip to content

icey-zhang/notebook

Repository files navigation

notebook

工具介绍:

【markdown的教程】

【期刊名字】

【thop的使用】

【HuggingFace的使用】

论文阅读:

  1. GeminiFusion | GeminiFusion: Efficient Pixel-wise Multimodal Fusion for Vision Transformer
  2. Qwen-VL | Qwen-VL: A Versatile Vision-Language Model for Understanding, Localization, Text Reading, and Beyond
  3. ODGEN | ODGEN: Domain-specific Object Detection Data Generation with Diffusion Models
  4. ControlNets | Adding Conditional Control to Text-to-Image Diffusion Models
  5. YOLO-World | YOLO-World: Real-Time Open-Vocabulary Object Detection

Releases

No releases published

Packages

No packages published