Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create TaskFlow API to use WeaviateHook and use taskflow API for ask-astro-load.py #132

Merged
merged 27 commits into from
Nov 23, 2023

Conversation

sunank200
Copy link
Collaborator

@sunank200 sunank200 commented Nov 16, 2023

  • I have taken airflow Docs ingestion for my implementation. Hence commits from Michael.
  • Create AskAstroWeaviateHook which inherits from OSS WeaviateHook.
  • Add get_schema, check_schema, create_schema, handle_upsert_rollback and ingest_data
  • Use Taskflow API for ask-astro-load.py.

closes: #134

Copy link

cloudflare-pages bot commented Nov 17, 2023

Deploying with  Cloudflare Pages  Cloudflare Pages

Latest commit: 93ea3d8
Status: ✅  Deploy successful!
Preview URL: https://3db4b985.ask-astro.pages.dev
Branch Preview URL: https://oss-weaviate-integration.ask-astro.pages.dev

View logs

airflow/Dockerfile Outdated Show resolved Hide resolved
airflow/README.md Outdated Show resolved Hide resolved
airflow/requirements.txt Outdated Show resolved Hide resolved
airflow/requirements.txt Outdated Show resolved Hide resolved
@sunank200 sunank200 marked this pull request as ready for review November 20, 2023 11:20
@sunank200 sunank200 changed the title Create TaskFlow API to use WeaviateHook. Create TaskFlow API to use WeaviateHook and use taskflow API for ask-astro-load.py Nov 20, 2023
airflow/dags/ingestion/ask-astro-load-airflow-docs.py Outdated Show resolved Hide resolved
airflow/dags/ingestion/ask-astro-load-airflow-docs.py Outdated Show resolved Hide resolved
airflow/dags/ingestion/ask-astro-load.py Outdated Show resolved Hide resolved
airflow/include/tasks/extract/github.py Outdated Show resolved Hide resolved
airflow/include/tasks/extract/github.py Outdated Show resolved Hide resolved
airflow/include/tasks/extract/stack_overflow.py Outdated Show resolved Hide resolved
airflow/include/tasks/extract/stack_overflow.py Outdated Show resolved Hide resolved
airflow/include/tasks/extract/utils/html_helpers.py Outdated Show resolved Hide resolved
@mpgreg mpgreg mentioned this pull request Nov 21, 2023
11 tasks
Copy link
Contributor

@mpgreg mpgreg left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Some logic errors but mostly I'm concerned about readablity and generalizability. For the sake of the reference implementation we want to be very clear about DAG/task code and what should (eventually) go into the hook.

airflow/dags/ingestion/ask-astro-load.py Outdated Show resolved Hide resolved
airflow/include/data/schema.json Outdated Show resolved Hide resolved
airflow/include/tasks/extract/stack_overflow.py Outdated Show resolved Hide resolved
airflow/include/tasks/ingest.py Show resolved Hide resolved
airflow/dags/ingestion/ask-astro-load.py Outdated Show resolved Hide resolved
airflow/dags/ingestion/ask-astro-load.py Show resolved Hide resolved
airflow/dags/ingestion/ask-astro-load.py Show resolved Hide resolved
@sunank200 sunank200 force-pushed the oss_weaviate_integration branch 4 times, most recently from 57af49b to 64d97f0 Compare November 21, 2023 21:35
@sunank200
Copy link
Collaborator Author

Some logic errors but mostly I'm concerned about readablity and generalizability. For the sake of the reference implementation we want to be very clear about DAG/task code and what should (eventually) go into the hook.

@mpgreg I have fixed most of your comments. Please check. Also just to be clear. This hook is not a drop-in replacement for LLM providers. It requires work contextual change to be added for LLM provider

Copy link
Collaborator

@Lee-W Lee-W left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

There are only some nitpicks from my side. other parts look good to me

@sunank200 sunank200 force-pushed the oss_weaviate_integration branch 2 times, most recently from d15958d to c036d67 Compare November 23, 2023 08:08
- Implement AskAstroWeaviateHook which inherits from WeavaiteHook from
OSS.
- Use AskAstroWeaviateHook as the taskflow API for
[ask-astro-load-airflow-docs.py](https://github.com/astronomer/ask-astro/blob/45ce6543d044a977d97e9314e443f629604e84a9/airflow/dags/ingestion/ask-astro-load-airflow-docs.py)


closes: #141 

Note: Merge this PR before merging
[132](#132)
Copy link
Contributor

@mpgreg mpgreg left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks great @sunank200. Just a couple of small suggestions.

@sunank200 sunank200 merged commit 1067794 into main Nov 23, 2023
7 checks passed
@sunank200 sunank200 deleted the oss_weaviate_integration branch November 23, 2023 11:24
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Create AskAstroWeaviateHook which uses OSS Weavaite Hook in ask-astro-load.py
3 participants