# iFLYTEK AI Competitions - Multimodal RAG
**Link: https://challenge.xfyun.cn/topic/info?type=Multimodal-RAG-QA&ch=dwsf2517**

Goal:
Build a Retrieval-Augmented Generation (RAG) system for multimodal QA using a PDF knowledge base with text & images.

Tasks:
1. Understand natural language queries + visual/text content.
2. Retrieve relevant text, charts, images from PDFs.
3. Fuse & reason over image + text.
4. Generate accurate, concise, source-cited answers.

Data:
- train.json (training set)
- test.json (predict `page`, `filename`, `answer`)
- sample_submit.json (submission format)
- financial_reports_database.zip (PDFs)

Scoring (max 1.0 per Q):
- Page match: 0.25
- Filename match: 0.25
- Answer similarity (Jaccard): 0.5


# Keypoint Brake down

## 

### **Input**
We are given three main resources:

1. **`financial_reports_database.zip`**  
   - A collection of real-world company financial report PDFs in mixed text–image format.  
   - Contains paragraphs, tables, and charts (bar, pie, line, etc.).  
   - **Only source of information** — no external knowledge allowed.

2. **`train.json`**  
   - A list of sample Q&A pairs for system development, training, and validation.  
   - Format example:  
     ```json
     [
        {
            "question": "根据图表显示，产品A的销售额在哪个季度开始下降？",
            "answer": "产品A的销售额在第三季度开始出现下降。",
            "filename": "2023年度第三季度财报.pdf",
            "page": 5
        },
        {
            "question": "...",
            "answer": "...",
            "filename": "...",
            "page": "..."
        }
    ]
     ```

3. **`test.json`**  
   - Contains only `question` fields.  
   - We must **predict**: `answer`, `filename`, and `page`.

---

### **Output**
- For each question in `test.json`, output:  
  - **answer** → predicted answer text  
  - **filename** → full PDF file name containing the answer  
  - **page** → page number containing the answer  

- Submission format: JSON list (see `sample_submit.json`).  
  Example:
  ```json
    [
        {
            "filename": "xx.pdf",
            "page": 1,
            "question": "广联达在苏中建设集团南宁龙湖春江天越项目中，具体运用了哪些BIM技术，并取得了哪些成果？",
            "answer": "广联达在苏中建设集团南宁龙湖春江天越项目中，具体运用了哪些BIM技术，并取得了哪些成果？"
        },
        {
            "filename": "xx.pdf",
            "page": 1,
            "question": "广联达公司如何通过数字项目管理平台提升施工企业的数字化转型能力？",
            "answer": "广联达公司如何通过数字项目管理平台提升施工企业的数字化转型能力？"
        },
    ……
    ]
```