feat(llm): support multiple property_type & importing graph from the entire doc by vichayturen · Pull Request #84 · apache/hugegraph-ai

vichayturen · 2024-09-22T16:07:36Z

Done:

Fixed bug of config qianfan llm api
Added two enums for properties settings
support different property type
support list/set cardinality for property_key

TODO:

enhance the logic for split chunk for vector & graph (maybe we should divide them separately)

…ire document

hugegraph-llm/src/hugegraph_llm/operators/hugegraph_op/schema_manager.py

hugegraph-llm/src/hugegraph_llm/utils/graph_index_utils.py

hugegraph-llm/src/hugegraph_llm/demo/rag_demo/configs_block.py

simon824 · 2024-10-09T11:51:48Z

hugegraph-llm/src/hugegraph_llm/operators/hugegraph_op/commit_to_hugegraph.py

-            for prop in properties:
-                self.schema.propertyKey(prop).asText().ifNotExist().create()
+            # for prop in properties:
+            #     self.schema.propertyKey(prop).asText().ifNotExist().create()


remove directly?

simon824 · 2024-10-09T11:51:56Z

hugegraph-llm/src/hugegraph_llm/operators/hugegraph_op/commit_to_hugegraph.py

-            for prop in properties:
-                self.schema.propertyKey(prop).asText().ifNotExist().create()
+            # for prop in properties:
+            #     self.schema.propertyKey(prop).asText().ifNotExist().create()


simon824 · 2024-10-09T11:55:14Z

hugegraph-llm/src/hugegraph_llm/operators/kg_construction_task.py

-            language: Literal["zh", "en"] = "zh"
+        self,
+        text: Union[str, List[str]],  # text to be split
+        split_type: Literal["document", "paragraph", "sentence"] = "document",


Why is the default value changed to document, does it have a better effect?

Why is the default value changed to document, does it have a better effect?

handle it in next PR (we need refactor the split logic for better effect)

simon824 · 2024-10-09T11:58:14Z

hugegraph-llm/src/hugegraph_llm/utils/graph_index_utils.py

+        print(context)
        return json.dumps(context, ensure_ascii=False, indent=2)
-    except Exception as e: # pylint: disable=W0718
+    except Exception as e:  # pylint: disable=W0718


Is this configuration universal? Does it need to be added to pylint.conf?

Is this configuration universal? Does it need to be added to pylint.conf?

No need for now maybe

simon824 · 2024-10-09T11:59:08Z

hugegraph-llm/src/hugegraph_llm/models/llms/openai.py

        """Get max-allowed token length"""
        # TODO: list all models and their max tokens from api
-        return 2049
+        return 8192


why increase to 8192？

why increase to 8192？

2049 is outdated (in 2023y)

BTW, this method is not used now (may remove it later)

…texData.id while inserting vertex

imbajin

Merge it to avoid blocking other PRs (handle chunk split logic in another PR later) @vichayturen

feat: Modify the method of importing graph by extracting from the ent…

32d772f

…ire document

dosubot bot added the size:L This PR changes 100-499 lines, ignoring generated files. label Sep 22, 2024

github-actions bot added the llm label Sep 22, 2024

dosubot bot added bug Something isn't working enhancement New feature or request labels Sep 22, 2024

vichayturen and others added 5 commits September 24, 2024 22:34

fix: remove 'llm' objetc in json response

5bc958c

fix: remove 'llm' object in context transferring

4da5144

Merge branch 'main' into main0920

d15a58a

Merge branch 'main' into main0920

6b185e4

TODO: only fetch vid rather than all vertex data

b2710da

imbajin mentioned this pull request Sep 28, 2024

refactor(llm): enhance a string of graph query method #89

Merged

10 tasks

imbajin and others added 2 commits September 29, 2024 00:47

Merge branch 'main' into main0920

4e10a38

feat: Support multiple property data types

af471ad

vichayturen changed the title ~~feat: Modify the method of importing graph by extracting from the entire document~~ feat: Supported multiple property data types and modified the method of importing graph by extracting from the entire document Sep 29, 2024

vichayturen and others added 2 commits October 9, 2024 14:55

feat: Support multiple property cardinality

52cfa87

refactor: separate logic for CheckSchema

442c57d

imbajin force-pushed the main0920 branch from cf8f85e to 442c57d Compare October 9, 2024 10:12

refactor: clean ChunkSplit & use lambda to replace annoy method

4fcf2d1

dosubot bot added size:XL This PR changes 500-999 lines, ignoring generated files. and removed size:L This PR changes 100-499 lines, ignoring generated files. labels Oct 9, 2024

refactor: separate logic for Commit2Graph

014af6e

imbajin reviewed Oct 9, 2024

View reviewed changes

hugegraph-llm/src/hugegraph_llm/operators/hugegraph_op/schema_manager.py Outdated Show resolved Hide resolved

hugegraph-llm/src/hugegraph_llm/utils/graph_index_utils.py Show resolved Hide resolved

simon824 reviewed Oct 9, 2024

View reviewed changes

vichayturen and others added 2 commits October 10, 2024 00:44

fix: allow nullable keys to be empty, and replace VertexData with Ver…

a72f1dd

…texData.id while inserting vertex

fix: check Date format & enhance Enum usage

e27b341

imbajin force-pushed the main0920 branch from 6a7b626 to e27b341 Compare October 10, 2024 08:15

imbajin approved these changes Oct 10, 2024

View reviewed changes

imbajin changed the title ~~feat: Supported multiple property data types and modified the method of importing graph by extracting from the entire document~~ feat(llm): support multiple property_type & importing graph from the entire doc Oct 10, 2024

imbajin merged commit f7fc02f into apache:main Oct 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(llm): support multiple property_type & importing graph from the entire doc#84

feat(llm): support multiple property_type & importing graph from the entire doc#84
imbajin merged 14 commits intoapache:mainfrom
vichayturen:main0920

vichayturen commented Sep 22, 2024 •

edited by imbajin

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

simon824 Oct 9, 2024

Uh oh!

simon824 Oct 9, 2024

Uh oh!

simon824 Oct 9, 2024

Uh oh!

imbajin Oct 10, 2024

Uh oh!

simon824 Oct 9, 2024

Uh oh!

imbajin Oct 10, 2024

Uh oh!

simon824 Oct 9, 2024

Uh oh!

imbajin Oct 10, 2024

Uh oh!

imbajin left a comment •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

vichayturen commented Sep 22, 2024 • edited by imbajin Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

imbajin left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

vichayturen commented Sep 22, 2024 •

edited by imbajin

Loading

imbajin left a comment •

edited

Loading