Skip to content

[Question]: Best practices for a data-ware-house knowlege base? #6117

@baicl123

Description

@baicl123

Self Checks

  • I have searched for existing issues search for existing issues, including closed ones.
  • I confirm that I am using English to submit this report (Language Policy).
  • Non-english title submitions will be closed directly ( 非英文标题的提交将会被直接关闭 ) (Language Policy).
  • Please do not modify this template :) and fill in all the required fields.

Describe your problem

After reading this: Implementing Text2SQL with RAGFlow

I'm dataware manager, I want to upload all metadata(DDL, description, business description, dictionary, tables and columns relationships) to RAGFlow knowlege base, and to achieve this:

  • for IT, they can retrievel tables (via table name, table description, business info. etc..),and ask LLM helping generate SQLs.

You can image, there will be a large number of DDLs. What is the best practices?

  1. put all DDLs in one file or one DDL one file? (some DDLs have more than 300 columns)
  2. why chunk token number is "8", shouldn't be a large number? ( can hold all columns)
  3. compare with DDL and excel document, which one is better?

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions