Skip to content
@SCUT-DLVCLab

SCUT-DLVCLab

华南理工大学深度学习与视觉计算实验室

About Us 🚀

The Deep Learning and Vision Computing Lab is dedicated to advanced theoretical research and innovative applications in the fields of artificial intelligence, computer vision, machine learning, and pattern recognition. Our current research focuses on deep learning, text detection and recognition, document analysis and understanding, and artificial intelligence. In recent years, our team has led more than 30 national and provincial research projects, making significant achievements in optical character recognition (OCR), handwriting recognition, gesture recognition and interaction technology, and innovative applications of deep learning. We have published over 300 SCI/EI papers, obtained more than 50 authorized invention patents, won 5 provincial and ministerial science and technology awards, and achieved first place in international academic competitions 4 times.

Pinned Loading

  1. TongGu-LLM Public

    [EMNLP 2024] TongGu, a classical Chinese language model.

    36 1

  2. GPT-4V_OCR Public

    Evaluation of the Optical Character Recognition (OCR) capabilities of GPT-4V(ision)

    Python 124 4

  3. Document-AI-Recommendations Public

    Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.

    190 7

  4. SCUT-EnsExam Public

    SCUT-EnsExam is a real-world handwritten text erasure dataset for examination paper scenarios, which consists of 545 examination paper images. The dataset is randomly divided into training set and …

    11

  5. RFUND Public

    [MM'2024] Official release of RFUND introduced in the MM'2024 paper "PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking for End-to-end Document Pair Extraction"

    19

  6. HisDoc1B Public

    10 1

Repositories

Showing 10 of 19 repositories
  • OCR-Reasoning Public
    Python 17 Apache-2.0 0 0 0 Updated May 19, 2025
  • LongHisDoc Public

    A Comprehensive Benchmark for Chinese Long Historical Document Understanding

    Python 1 0 0 0 Updated May 18, 2025
  • DOLPHIN Public

    [IEEE TIFS 2024] Official repository of "Online Writer Retrieval with Chinese Handwritten Phrases: A Synergistic Temporal-Frequency Representation Learning Approach".

    Python 9 GPL-3.0 0 0 0 Updated May 17, 2025
  • AutoScaler Public

    The official GitHub page of "AutoScaler: Self Scale Alignment for Handwritten Mathematical Expression Recognition"

    Python 2 0 0 0 Updated May 15, 2025
  • MegaHan97K Public

    [PR 2025] The official GitHub page of "MegaHan97K: A Large-Scale Dataset for Mega-Category Chinese Character Recognition with over 97K Categories"

    Python 19 0 0 0 Updated May 12, 2025
  • ACP-RAG Public

    [NAACL 2025] Large-Scale Corpus Construction and Retrieval-Augmented Generation for Ancient Chinese Poetry: New Method and Data Insights (ACP-Corpus; ACP-QA; ACP-RAG)

    Python 3 0 0 0 Updated May 6, 2025
  • MCS-Bench Public
    Python 1 0 0 0 Updated May 6, 2025
  • C3bench Public

    C3 benchmark

    2 0 1 0 Updated Mar 30, 2025
  • PAVENet Public

    [IEEE TPAMI 2025] Official repository of "Privacy-Preserving Biometric Verification With Handwritten Random Digit String".

    Python 5 GPL-3.0 0 0 0 Updated Mar 18, 2025
  • HisDoc1B Public
    10 1 1 0 Updated Mar 2, 2025

Top languages

Loading…

Most used topics

Loading…