New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

docs: add Azure AI Search + Azure OpenAI RAG recipe notebook #675

Open

farzad528 wants to merge 5 commits into DS4SD:main from farzad528:main

+704 −0

farzad528 commented Jan 4, 2025

Azure AI Search RAG Example Using Docling

This notebook demonstrates how to build a Retrieval Augmented Generation (RAG) system using Docling and Azure AI Search. The example showcases document parsing, chunking, vector search integration, and RAG query implementation.

Checklist:

Documentation has been updated, if necessary.
Examples have been added, if necessary.
[] Tests have been added, if necessary.

farzad528 added 3 commits

January 4, 2025 17:31


          docling azure ai search

6d718c1


          azure ai search updates

acdb32d


          mkdocs

df6201a

mergify bot commented Jan 4, 2025

Merge Protections

Your pull request matches the following merge protections and will not be merged until they are valid.

🟢 Enforce conventional commit

Wonderful, this rule succeeded.

Make sure that we follow https://www.conventionalcommits.org/en/v1.0.0/

title ~= ^(fix|feat|docs|style|refactor|perf|test|build|ci|chore|revert)(?:\(.+\))?(!)?:

farzad528 added 2 commits

January 4, 2025 17:59


          colab check

86c1fd2


          colab link fix

e582b88

Author

farzad528 commented Jan 5, 2025 •

edited

Loading

@vagenas @pablocastro can you review :)

pablocastro reviewed

View reviewed changes

docs/examples/rag_azuresearch.ipynb

+                  "    print(\"MPS GPU is enabled.\")\n",
+                  "else:\n",
+                  "    raise EnvironmentError(\n",
+                  "        \"No GPU or MPS device found. Please check your environment and ensure GPU or MPS support is configured.\"\n",

pablocastro Jan 6, 2025

Not all the other examples seem to require a GPU/MPS backend. Is this a hard requirement?

docs/examples/rag_azuresearch.ipynb

+                  "AZURE_OPENAI_ENDPOINT = os.getenv(\"AZURE_OPENAI_ENDPOINT\")\n",
+                  "AZURE_OPENAI_API_KEY = os.getenv(\"AZURE_OPENAI_API_KEY\")\n",
+                  "AZURE_OPENAI_CHAT_MODEL = os.getenv(\"AZURE_OPENAI_CHAT_MODEL\")\n",
+                  "AZURE_OPENAI_API_VERSION = os.getenv(\"AZURE_OPENAI_API_VERSION\")\n",

pablocastro Jan 6, 2025

You could include a default version to save folks from having to go to the docs to chase a good version number.

docs/examples/rag_azuresearch.ipynb

+                 "source": [
+                  "# Part 0: Prerequisites\n",
+                  " - Azure AI Search resource\n",
+                  " - Azure OpenAI resource with deployed embeddings & chat models\n",

pablocastro Jan 6, 2025

Further down you make the assumption that the embedding model is text-embedding-3-small, maybe we could call this out here so folks know what to deploy in their Azure OpenAI instance?

docs/examples/rag_azuresearch.ipynb

+                  "    embedding_vector = embed_text(chunk_text)\n",
+                  "    upload_docs.append(\n",
+                  "        {\n",
+                  "            \"chunk_id\": str(uuid.uuid4()),\n",

pablocastro Jan 6, 2025

In a previous cell you created a list of tuples (chunk id, chunk text), but here you're not using the id and generating a GUID instead. The previously generated chunk id doesn't seem to be used, seems you only need one thing, either the chunk ids you generated before (then you don't need the new GUID here, use the first element of the tuple instead), or don't generate chunk ids in the previous cell (and in that case it looks like you don't need to create all_chunks at all, since you could just enumerate the original doc_chunks instead.

docs/examples/rag_azuresearch.ipynb

+                  "    subset = upload_docs[i : i + BATCH_SIZE]\n",
+                  "    resp = search_client.upload_documents(documents=subset)\n",
+                  "    console.print(\n",
+                  "        f\"Uploaded batch {i} -> {i+len(subset)}; success: {resp[0].succeeded}, status code: {resp[0].status_code}\"\n",

pablocastro Jan 6, 2025

Did you mean to check if all succeeded/any failed? Aren't you just checking the first doc above?

docs/examples/rag_azuresearch.ipynb

+                 "source": [
+                  "from azure.search.documents.models import VectorizableTextQuery\n",
+                  "\n",
+                  "def generate_chat_response(prompt: str, system_message: str = None):\n",

pablocastro Jan 6, 2025

Since this method creates a new message list each time, there's no chat history preserved between turns (e.g. can't make follow up questions). I assume this was on purpose, but calling it out just in case.

vagenas self-assigned this

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet