[Question]: Best practices for a data-ware-house knowlege base?

### Self Checks

- [x] I have searched for existing issues [search for existing issues](https://github.com/infiniflow/ragflow/issues), including closed ones.
- [x] I confirm that I am using English to submit this report ([Language Policy](https://github.com/infiniflow/ragflow/issues/5910)).
- [x] Non-english title submitions will be closed directly ( 非英文标题的提交将会被直接关闭 ) ([Language Policy](https://github.com/infiniflow/ragflow/issues/5910)).
- [x] Please do not modify this template :) and fill in all the required fields.

### Describe your problem

After reading this: [Implementing Text2SQL with RAGFlow](https://ragflow.io/blog/implementing-text2sql-with-ragflow)

I'm dataware  manager, I want to upload all metadata(DDL, description, business description, dictionary, tables and columns relationships)  to RAGFlow knowlege base, and to achieve this：

- for IT, they can retrievel tables (via table name, table description, business info. etc..)，and ask LLM helping generate SQLs.

You can image, there will be a large number of DDLs. What is the best practices?

1. put all DDLs in one file or one DDL one file? （some DDLs have more than 300 columns）
2. why chunk token number is "8", shouldn't be a large number? ( can hold all columns）
3. compare with DDL and excel document, which one is better?



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Question]: Best practices for a data-ware-house knowlege base? #6117

Self Checks

Describe your problem

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Question]: Best practices for a data-ware-house knowlege base? #6117

Description

Self Checks

Describe your problem

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions