Skip to content

[Bug]: Updating index twice will result in a ValueError #1822

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
3 tasks done
masies opened this issue Mar 18, 2025 · 1 comment
Open
3 tasks done

[Bug]: Updating index twice will result in a ValueError #1822

masies opened this issue Mar 18, 2025 · 1 comment
Labels
bug Something isn't working triage Default label assignment, indicates new issue needs reviewed by a maintainer

Comments

@masies
Copy link

masies commented Mar 18, 2025

Do you need to file an issue?

  • I have searched the existing issues and this bug is not already filed.
  • My model is hosted on OpenAI or Azure. If not, please look at the "model providers" issue and don't file a new one here.
  • I believe this is a legitimate bug, not just a question. If this is a question, please use the Discussions area.

Describe the bug

Running update_output twice throws
ValueError: Could not find update_output/YYYYMMDD-HHMMSS/delta/communities.parquet in storage!

this issue is related to here

key = self._keyname(key) becomes (in my case) 'output\update_output\YYYYMMDD-HHMMSS\delta\communities.parquet'
I guess this is because of the logic from the previous version

Steps to reproduce

  1. init the graphrag root dir,
  2. create the index
  3. add a file and update the index
  4. add another file and update the index

Expected Behavior

it should update normally the index once again, with both documents added in step 3 and 4 present in the index

GraphRAG Config Used

...
update_index_output:
    type: blob 
    provider: azure
    storage_account_blob_url: ${BLOB_STORAGE_URL}
    container_name: ${ROOT_DIR}
    base_dir: "update_output"
...

Logs and screenshots

No response

Additional Information

  • GraphRAG Version: 2.1.0
  • Operating System: win 11
  • Python Version: 3.14
  • Related Issues:
@masies masies added bug Something isn't working triage Default label assignment, indicates new issue needs reviewed by a maintainer labels Mar 18, 2025
@xiao-hf
Copy link

xiao-hf commented Mar 25, 2025

11

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working triage Default label assignment, indicates new issue needs reviewed by a maintainer
Projects
None yet
Development

No branches or pull requests

2 participants