feat(llm): support multiple property_type & importing graph from the entire doc#84
feat(llm): support multiple property_type & importing graph from the entire doc#84imbajin merged 14 commits intoapache:mainfrom
Conversation
hugegraph-llm/src/hugegraph_llm/operators/hugegraph_op/schema_manager.py
Outdated
Show resolved
Hide resolved
| for prop in properties: | ||
| self.schema.propertyKey(prop).asText().ifNotExist().create() | ||
| # for prop in properties: | ||
| # self.schema.propertyKey(prop).asText().ifNotExist().create() |
| for prop in properties: | ||
| self.schema.propertyKey(prop).asText().ifNotExist().create() | ||
| # for prop in properties: | ||
| # self.schema.propertyKey(prop).asText().ifNotExist().create() |
| language: Literal["zh", "en"] = "zh" | ||
| self, | ||
| text: Union[str, List[str]], # text to be split | ||
| split_type: Literal["document", "paragraph", "sentence"] = "document", |
There was a problem hiding this comment.
Why is the default value changed to document, does it have a better effect?
There was a problem hiding this comment.
Why is the default value changed to
document, does it have a better effect?
handle it in next PR (we need refactor the split logic for better effect)
| print(context) | ||
| return json.dumps(context, ensure_ascii=False, indent=2) | ||
| except Exception as e: # pylint: disable=W0718 | ||
| except Exception as e: # pylint: disable=W0718 |
There was a problem hiding this comment.
Is this configuration universal? Does it need to be added to pylint.conf?
There was a problem hiding this comment.
Is this configuration universal? Does it need to be added to
pylint.conf?
No need for now maybe
| """Get max-allowed token length""" | ||
| # TODO: list all models and their max tokens from api | ||
| return 2049 | ||
| return 8192 |
There was a problem hiding this comment.
why increase to 8192?
2049 is outdated (in 2023y)
BTW, this method is not used now (may remove it later)
…texData.id while inserting vertex
There was a problem hiding this comment.
Merge it to avoid blocking other PRs (handle chunk split logic in another PR later) @vichayturen
Done:
TODO: