Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

新增extractDataFromText API #108

Closed
da730 opened this issue May 14, 2024 · 3 comments
Closed

新增extractDataFromText API #108

da730 opened this issue May 14, 2024 · 3 comments
Assignees
Labels
Milestone

Comments

@da730
Copy link
Contributor

da730 commented May 14, 2024

新增一个API:VMind.extractDataFromText,从文本中提取json格式的数据,并生成可以用于绘图的指令。
示例:

const text="此前《金融时报》曾报道,2023年小红书营收达到37亿美元,净利润达到5亿美元,相比2022年收入20亿美元、亏损2亿美元,小红书不仅首次实现盈利,而且营收更是上了一个台阶。这很大程度得归功于广告业务多元化、高效率,以及电商业务带来的增量。"

const {data,instruction}=vmind.extractDataFromText(text,options)
console.log(data)
/*[{"name":"个护美妆","adRevenuePercentage":18},{"name":"时尚穿搭","adRevenuePercentage":12.5},{"name":"美食饮品","adRevenuePercentage":10},{"name":"家居家装","adRevenuePercentage":8.7},{"name":"母婴","adRevenuePercentage":7},{"name":"黄金珠宝","adRevenuePercentage":6},{"name":"汽车","adRevenuePercentage":5.3},{"name":"日化百货","adRevenuePercentage":5.1}]*/
console.log(instruction)
//请绘制一个饼图,展示小红书广告收入在各个行业中的占比
@da730 da730 self-assigned this May 14, 2024
@da730 da730 added this to the Iteration 16 milestone May 14, 2024
@da730 da730 added good first issue Good for newcomers OSPP 2024 labels May 14, 2024
@da730 da730 changed the title add extract data from text api 新增e'x't'r'a'c't'D'a't'a'F'r'o'm'S'p'e'c May 14, 2024
@da730 da730 changed the title 新增e'x't'r'a'c't'D'a't'a'F'r'o'm'S'p'e'c 新增extractDataFromSpec API May 14, 2024
@da730 da730 changed the title 新增extractDataFromSpec API 新增extractDataFromText API May 14, 2024
@da730 da730 modified the milestones: Iteration 16, Iteration 17 May 22, 2024
@da730 da730 removed their assignment May 28, 2024
@xsy-kucha
Copy link
Collaborator

这个issue我可以认领吗?
这个接口我的理解是需要通过llm去解析构建数据让用户的输入更加自由,不用局限于csv的格式,而是自然语言的方式。
我比较好奇为什么不在输入里传入用户意图,而是llm自己生成一个可能的意图。觉得这里应该让用户输入意图和文本素材,让llm去组织数据。组织完数据再调用图表生成逻辑。在用户输入的意图为空时再让llm去生成意图指令。

@da730
Copy link
Contributor Author

da730 commented May 28, 2024

这个issue我可以认领吗? 这个接口我的理解是需要通过llm去解析构建数据让用户的输入更加自由,不用局限于csv的格式,而是自然语言的方式。 我比较好奇为什么不在输入里传入用户意图,而是llm自己生成一个可能的意图。觉得这里应该让用户输入意图和文本素材,让llm去组织数据。组织完数据再调用图表生成逻辑。在用户输入的意图为空时再让llm去生成意图指令。

可以的,那把接口改一下,传入一个可选的userPrompt,模型在提取数据时也可以使用userPrompt作为参考。

@da730
Copy link
Contributor Author

da730 commented May 28, 2024

@xsy-kucha 需要结合VMind现有的框架来做,抽空给你讲一下

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants