Discussed in #1599
Originally posted by ajain85 January 9, 2025
HI All, I have running graphrag cli for update command for blob but getting below error , I am using azure blob and azure search AI services to save parquet file in blob and update indexing in search ai. but getting below error , Can anyone suggest me the solution does it graphrag library error or anything I am missing in setting .yml file .
Error->
ValueError: Incremental Indexing Error: No new documents to process.
this is how I have updated my update file setting
update_index_storage:
type: "blob" # or blob
connection_string: ""
container_name: "graphrag"
base_dir: "output"
storage_account_blob_url: "https://*.blob.core.windows.net/"
error_msg = 'Incremental Indexing Error: No new documents to process.' │ │
│ │ is_update_run = True │ │
│ │ logger = <graphrag.logger.rich_progress.RichProgressLogger object at 0x000001E99D45B2D0> │ │
│ │ progress_logger = <graphrag.logger.rich_progress.RichProgressLogger object at 0x000001E99D45B2D0> │ │
│ │ root_dir = 'C:\Users\JAINAB\UNHCR Workspace\test_graphrag\cligraphrag' │ │
│ │ run_id = '20250109-135013' │ │
│ │ storage = <graphrag.storage.blob_pipeline_storage.BlobPipelineStorage object at │ │
│ │ 0x000001E99D472250> │ │
│ │ storage_config = { │ │
│ │ │ 'type': "blob", │ │
│ │ │ 'base_dir': 'output', │ │
│ │ │ 'connection_string': │ │
│ │ 'DefaultEndpointsProtocol=https;AccountName=d1hcrstgenaisharedxfc;AccountKey=K9Ya'… │ │
│ │ │ 'container_name': 'graphrag', │ │
│ │ │ 'storage_account_blob_url': │ │
│ │ 'https://d1hcrstgenaisharedxfc.blob.core.windows.net/', │ │
│ │ │ 'cosmosdb_account_url': None │ │
│ │ } │ │
│ │ update_index_storage = <graphrag.storage.file_pipeline_storage.FilePipelineStorage object at │ │
│ │ 0x000001E99F017710> │ │
│ │ update_storage_config = { │ │
│ │ │ 'type': "file", │ │
│ │ │ 'base_dir': 'C:\Users\JAINAB\UNHCR │ │
│ │ Workspace\test_graphrag\cligraphrag\update_output', │ │
│ │ │ 'connection_string': None, │ │
│ │ │ 'container_name': None, │ │
│ │ │ 'storage_account_blob_url': None, │ │
│ │ │ 'cosmosdb_account_url': None │ │
│ │ } │ │
│ │ workflows = [ │ │
│ │ │ 'create_base_text_units', │ │
│ │ │ 'create_final_documents', │ │
│ │ │ 'extract_graph', │ │
│ │ │ 'compute_communities', │ │
│ │ │ 'create_final_entities', │ │
│ │ │ 'create_final_relationships', │ │
│ │ │ 'create_final_nodes', │ │
│ │ │ 'create_final_communities', │ ││ │ │ 'create_final_text_units', │ ││ │ │ 'create_final_community_reports', │ ││ │ │ ... +1 │ ││ │ ] │ ││ ╰─────────────────────────────────────────────────────────────────────────────────────────────────────────────╯ │╰─────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯ValueError: Incremental Indexing Error: No new documents to process.
Discussed in #1599
Originally posted by ajain85 January 9, 2025
HI All, I have running graphrag cli for update command for blob but getting below error , I am using azure blob and azure search AI services to save parquet file in blob and update indexing in search ai. but getting below error , Can anyone suggest me the solution does it graphrag library error or anything I am missing in setting .yml file .
Error->
ValueError: Incremental Indexing Error: No new documents to process.
this is how I have updated my update file setting
update_index_storage:
type: "blob" # or blob
connection_string: ""
container_name: "graphrag"
base_dir: "output"
storage_account_blob_url: "https://*.blob.core.windows.net/"
error_msg = 'Incremental Indexing Error: No new documents to process.' │ │
│ │ is_update_run = True │ │
│ │ logger = <graphrag.logger.rich_progress.RichProgressLogger object at 0x000001E99D45B2D0> │ │
│ │ progress_logger = <graphrag.logger.rich_progress.RichProgressLogger object at 0x000001E99D45B2D0> │ │
│ │ root_dir = 'C:\Users\JAINAB\UNHCR Workspace\test_graphrag\cligraphrag' │ │
│ │ run_id = '20250109-135013' │ │
│ │ storage = <graphrag.storage.blob_pipeline_storage.BlobPipelineStorage object at │ │
│ │ 0x000001E99D472250> │ │
│ │ storage_config = { │ │
│ │ │ 'type': "blob", │ │
│ │ │ 'base_dir': 'output', │ │
│ │ │ 'connection_string': │ │
│ │ 'DefaultEndpointsProtocol=https;AccountName=d1hcrstgenaisharedxfc;AccountKey=K9Ya'… │ │
│ │ │ 'container_name': 'graphrag', │ │
│ │ │ 'storage_account_blob_url': │ │
│ │ 'https://d1hcrstgenaisharedxfc.blob.core.windows.net/', │ │
│ │ │ 'cosmosdb_account_url': None │ │
│ │ } │ │
│ │ update_index_storage = <graphrag.storage.file_pipeline_storage.FilePipelineStorage object at │ │
│ │ 0x000001E99F017710> │ │
│ │ update_storage_config = { │ │
│ │ │ 'type': "file", │ │
│ │ │ 'base_dir': 'C:\Users\JAINAB\UNHCR │ │
│ │ Workspace\test_graphrag\cligraphrag\update_output', │ │
│ │ │ 'connection_string': None, │ │
│ │ │ 'container_name': None, │ │
│ │ │ 'storage_account_blob_url': None, │ │
│ │ │ 'cosmosdb_account_url': None │ │
│ │ } │ │
│ │ workflows = [ │ │
│ │ │ 'create_base_text_units', │ │
│ │ │ 'create_final_documents', │ │
│ │ │ 'extract_graph', │ │
│ │ │ 'compute_communities', │ │
│ │ │ 'create_final_entities', │ │
│ │ │ 'create_final_relationships', │ │
│ │ │ 'create_final_nodes', │ │
│ │ │ 'create_final_communities', │ ││ │ │ 'create_final_text_units', │ ││ │ │ 'create_final_community_reports', │ ││ │ │ ... +1 │ ││ │ ] │ ││ ╰─────────────────────────────────────────────────────────────────────────────────────────────────────────────╯ │╰─────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯ValueError: Incremental Indexing Error: No new documents to process.