feat: add Fireworks AI component #237

namwoam · 2024-07-24T00:11:09Z

Because

We want to integrate Fireworks AI Client into our VDP pipeline platform.

This commit

Added the Fireworks AI Component, which supports the following tasks:
- (a) TASK_TEXT_GENERATION_CHAT
  Models: llama3.1 (405b, 70b, 9b), llama3 (70b, 8b), gemma2-9b, phi-3-vision-128K, deepseek-coder, qwen2 (72b) and more
- (b) TASK_TEXT_EMBEDDINGS
  Models: nomic-ai/nomic-embed, thenlper/gte, WhereIsAI/UAE

API Documentation: https://docs.fireworks.ai/api-reference/introduction
Pricing Documentation: https://fireworks.ai/pricing

ai/fireworksai/v0/main.go

namwoam · 2024-07-24T00:21:35Z

Above is the performance of llama3-70b-it across different providers. This sector is extremely competitive and I am not sure if we will support all the providers?

codecov · 2024-07-28T02:12:10Z

Codecov Report

Attention: Patch coverage is 55.80110% with 80 lines in your changes missing coverage. Please review.

Project coverage is 37.15%. Comparing base (4aeae69) to head (21e4fb7).
Report is 20 commits behind head on main.

Files	Patch %	Lines
ai/fireworksai/v0/tasks.go	55.67%	38 Missing and 5 partials ⚠️
ai/fireworksai/v0/client.go	36.00%	16 Missing ⚠️
ai/fireworksai/v0/main.go	71.69%	13 Missing and 2 partials ⚠️
store/store.go	0.00%	6 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #237      +/-   ##
==========================================
- Coverage   40.70%   37.15%   -3.55%     
==========================================
  Files         121      153      +32     
  Lines       11842    19404    +7562     
==========================================
+ Hits         4820     7210    +2390     
- Misses       6402    11138    +4736     
- Partials      620     1056     +436

Flag	Coverage Δ
unittests	`37.15% <55.80%> (-3.55%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

chuang8511

lgtm.

chuang8511 · 2024-08-08T18:04:51Z

@namwoam
Btw, does Fireworks have their official tokenizer that we can use?
It will be used in Text Component ChunkTask

namwoam · 2024-08-08T22:22:18Z

All their model are open sourced, so I guess it could be imported from huggingface?

chuang8511 · 2024-08-09T16:17:42Z

@namwoam
Could you help me check the pricing?
I am not sure these models' params count.

remaining models

deepseek-coder-v2-instruct
deepseek-coder-v2-lite-instruct
phi-3-vision-128k-instruct
nomic-ai/nomic-embed-text-v1.5
nomic-ai/nomic-embed-text-v1
WhereIsAI/UAE-Large-V1
thenlper/gte-large
thenlper/gte-base

namwoam · 2024-08-09T23:09:36Z

Hello, the parameter count are as follows:

Text models:

deepseek-coder-v2-instruct: 236b MoE
deepseek-coder-v2-lite-instruct: 16b MoE source: huggingface
phi-3-vision-128k-instruct: 4.2b source: phi3 technical report under section 6.1

Embedding models:

nomic-embed-text-v1.5: 137m source: huggingface
nomic-embed-text-v1: 137m source: huggingface
UAE-Large-V1: 335m source: huggingface
gte-large: 335m source: huggingface
gte-base: 109m source: huggingface

The pricing for embedding models:

chuang8511 · 2024-08-12T14:28:31Z

@namwoam
I still cannot match which price is correct. I will take out from Instill Credit first.
Please help us check it again. Thank you.

deepseek-coder-v2-instruct: 236b MoE

✅ deepseek-coder-v2-lite-instruct: 16b MoE source: huggingface
-> 0.5
✅ phi-3-vision-128k-instruct: 4.2b source: phi3 technical report under section 6.1

-> 0.2

namwoam · 2024-08-12T19:54:20Z

Ok, no problem.

🤖 I have created a release *beep* *boop* --- ## [0.25.0-beta](v0.24.0-beta...v0.25.0-beta) (2024-08-13) ### Features * add a hook to avoid we miss make document ([#244](#244)) ([4c4531d](4c4531d)) * add elasticsearch component ([#211](#211)) ([eb492ca](eb492ca)) * add Fireworks AI component ([#237](#237)) ([0c40652](0c40652)) * add Groq component ([#269](#269)) ([1401220](1401220)) * add mongodb component ([#198](#198)) ([2cb550f](2cb550f)) * add qdrant component ([#271](#271)) ([bd2b9e6](bd2b9e6)) * add weaviate component ([#246](#246)) ([cb3e667](cb3e667)) * add WhatsApp component ([#226](#226)) ([28d0de8](28d0de8)) * **artifact:** add artifact component ([#268](#268)) ([dabf472](dabf472)) * **artifact:** add artifact component ([#275](#275)) ([15fc0d2](15fc0d2)) * **document:** integrate pdf2md in document operator ([#277](#277)) ([07360d1](07360d1)) * **groq, fireworksai:** take out the unsupported models from instill credit ([#283](#283)) ([8978acd](8978acd)) * make component ID accessible on IExecution ([#257](#257)) ([dd63656](dd63656)) * **openai:** support `gpt-4o-2024-08-06` and structured output ([#280](#280)) ([8bdaef7](8bdaef7)) * **sql:** add TASK_INSERT_MANY and fix sql query validation ([#252](#252)) ([3a93cea](3a93cea)) * **text:** add tokenizer for cohere & new gpt-4o ([#276](#276)) ([5d8cec3](5d8cec3)) * **text:** revert "add tokenizer for cohere & new gpt-4o ([#276](#276))" ([910a330](910a330)) ### Bug Fixes * **artifact:** add the description to remind users to add file extension ([#281](#281)) ([5ff5d7a](5ff5d7a)) * ignore bold case and add all line to result ([#272](#272)) ([219c77e](219c77e)) --- This PR was generated with [Release Please](https://github.com/googleapis/release-please). See [documentation](https://github.com/googleapis/release-please#release-please).

namwoam requested review from donch1989, pinglin, xiaofei-du and jvallesm as code owners July 24, 2024 00:11

namwoam marked this pull request as draft July 24, 2024 00:11

droplet-bot added the instill component label Jul 24, 2024

namwoam changed the title ~~Namwoam/fireworks~~ feat: add Fireworks AI component Jul 24, 2024

github-advanced-security bot found potential problems Jul 24, 2024

View reviewed changes

ai/fireworksai/v0/main.go Dismissed Show dismissed Hide dismissed

namwoam marked this pull request as ready for review July 28, 2024 02:26

namwoam requested a review from GeorgeWilliamStrong as a code owner July 28, 2024 02:26

chuang8511 previously approved these changes Aug 8, 2024

View reviewed changes

namwoam dismissed chuang8511’s stale review via e19a0ac August 9, 2024 23:17

namwoam added 9 commits August 10, 2024 00:24

add: add config files

e64c7c8

add: implement TASK_TEXT_GENERATION_CHAT and TASK_TEXT_EMBEDDINGS

8e389fa

add: add topP option

77ae72d

migrate: migrate from text model to vision text model

6379bfe

dev: working on tests

d6e7954

fix: migrate to model and added testcase

dd53c40

fix: fix documentation

c498324

fix: fix model list

09ca941

chore: add Yi-large

27b3a0a

namwoam force-pushed the namwoam/fireworks branch from e19a0ac to 27b3a0a Compare August 9, 2024 23:38

namwoam added 2 commits August 10, 2024 00:38

chore: migrate to new component architecture

21e4fb7

add: add icon

e71f929

Merge branch 'main' into namwoam/fireworks

8db61c2

donch1989 merged commit 0c40652 into instill-ai:main Aug 11, 2024
6 checks passed

droplet-bot mentioned this pull request Aug 11, 2024

chore(main): release 0.25.0-beta #266

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add Fireworks AI component #237

feat: add Fireworks AI component #237

namwoam commented Jul 24, 2024

namwoam commented Jul 24, 2024

codecov bot commented Jul 28, 2024 •

edited

Loading

chuang8511 left a comment

chuang8511 commented Aug 8, 2024

namwoam commented Aug 8, 2024

chuang8511 commented Aug 9, 2024 •

edited

Loading

namwoam commented Aug 9, 2024 •

edited

Loading

chuang8511 commented Aug 12, 2024

namwoam commented Aug 12, 2024

feat: add Fireworks AI component #237

feat: add Fireworks AI component #237

Conversation

namwoam commented Jul 24, 2024

namwoam commented Jul 24, 2024

codecov bot commented Jul 28, 2024 • edited Loading

Codecov Report

chuang8511 left a comment

Choose a reason for hiding this comment

chuang8511 commented Aug 8, 2024

namwoam commented Aug 8, 2024

chuang8511 commented Aug 9, 2024 • edited Loading

remaining models

namwoam commented Aug 9, 2024 • edited Loading

chuang8511 commented Aug 12, 2024

namwoam commented Aug 12, 2024

codecov bot commented Jul 28, 2024 •

edited

Loading

chuang8511 commented Aug 9, 2024 •

edited

Loading

namwoam commented Aug 9, 2024 •

edited

Loading