mirror of
				https://github.com/hwchase17/langchain.git
				synced 2025-10-25 21:03:11 +00:00 
			
		
		
		
	
		
			
				
	
	
		
			343 lines
		
	
	
		
			13 KiB
		
	
	
	
		
			Plaintext
		
	
	
	
	
	
			
		
		
	
	
			343 lines
		
	
	
		
			13 KiB
		
	
	
	
		
			Plaintext
		
	
	
	
	
	
| {
 | |
|  "cells": [
 | |
|   {
 | |
|    "cell_type": "markdown",
 | |
|    "metadata": {},
 | |
|    "source": [
 | |
|     "# Azure Cognitive Services Toolkit\n",
 | |
|     "\n",
 | |
|     "This toolkit is used to interact with the `Azure Cognitive Services API` to achieve some multimodal capabilities.\n",
 | |
|     "\n",
 | |
|     "Currently There are four tools bundled in this toolkit:\n",
 | |
|     "- AzureCogsImageAnalysisTool: used to extract caption, objects, tags, and text from images. (Note: this tool is not available on Mac OS yet, due to the dependency on `azure-ai-vision` package, which is only supported on Windows and Linux currently.)\n",
 | |
|     "- AzureCogsFormRecognizerTool: used to extract text, tables, and key-value pairs from documents.\n",
 | |
|     "- AzureCogsSpeech2TextTool: used to transcribe speech to text.\n",
 | |
|     "- AzureCogsText2SpeechTool: used to synthesize text to speech.\n",
 | |
|     "- AzureCogsTextAnalyticsHealthTool: used to extract healthcare entities."
 | |
|    ]
 | |
|   },
 | |
|   {
 | |
|    "cell_type": "markdown",
 | |
|    "metadata": {},
 | |
|    "source": [
 | |
|     "First, you need to set up an Azure account and create a Cognitive Services resource. You can follow the instructions [here](https://docs.microsoft.com/en-us/azure/cognitive-services/cognitive-services-apis-create-account?tabs=multiservice%2Cwindows) to create a resource. \n",
 | |
|     "\n",
 | |
|     "Then, you need to get the endpoint, key and region of your resource, and set them as environment variables. You can find them in the \"Keys and Endpoint\" page of your resource."
 | |
|    ]
 | |
|   },
 | |
|   {
 | |
|    "cell_type": "code",
 | |
|    "execution_count": null,
 | |
|    "metadata": {},
 | |
|    "outputs": [],
 | |
|    "source": [
 | |
|     "%pip install --upgrade --quiet  azure-ai-formrecognizer > /dev/null\n",
 | |
|     "%pip install --upgrade --quiet  azure-cognitiveservices-speech > /dev/null\n",
 | |
|     "%pip install --upgrade --quiet  azure-ai-textanalytics > /dev/null\n",
 | |
|     "\n",
 | |
|     "# For Windows/Linux\n",
 | |
|     "%pip install --upgrade --quiet  azure-ai-vision > /dev/null"
 | |
|    ]
 | |
|   },
 | |
|   {
 | |
|    "cell_type": "code",
 | |
|    "execution_count": null,
 | |
|    "metadata": {},
 | |
|    "outputs": [],
 | |
|    "source": [
 | |
|     "%pip install -qU langchain-community"
 | |
|    ]
 | |
|   },
 | |
|   {
 | |
|    "cell_type": "code",
 | |
|    "execution_count": null,
 | |
|    "metadata": {},
 | |
|    "outputs": [],
 | |
|    "source": [
 | |
|     "import os\n",
 | |
|     "\n",
 | |
|     "os.environ[\"OPENAI_API_KEY\"] = \"sk-\"\n",
 | |
|     "os.environ[\"AZURE_COGS_KEY\"] = \"\"\n",
 | |
|     "os.environ[\"AZURE_COGS_ENDPOINT\"] = \"\"\n",
 | |
|     "os.environ[\"AZURE_COGS_REGION\"] = \"\""
 | |
|    ]
 | |
|   },
 | |
|   {
 | |
|    "cell_type": "markdown",
 | |
|    "metadata": {},
 | |
|    "source": [
 | |
|     "## Create the Toolkit"
 | |
|    ]
 | |
|   },
 | |
|   {
 | |
|    "cell_type": "code",
 | |
|    "execution_count": 19,
 | |
|    "metadata": {},
 | |
|    "outputs": [],
 | |
|    "source": [
 | |
|     "from langchain_community.agent_toolkits import AzureCognitiveServicesToolkit\n",
 | |
|     "\n",
 | |
|     "toolkit = AzureCognitiveServicesToolkit()"
 | |
|    ]
 | |
|   },
 | |
|   {
 | |
|    "cell_type": "code",
 | |
|    "execution_count": null,
 | |
|    "metadata": {},
 | |
|    "outputs": [
 | |
|     {
 | |
|      "data": {
 | |
|       "text/plain": [
 | |
|        "['Azure Cognitive Services Image Analysis',\n",
 | |
|        " 'Azure Cognitive Services Form Recognizer',\n",
 | |
|        " 'Azure Cognitive Services Speech2Text',\n",
 | |
|        " 'Azure Cognitive Services Text2Speech']"
 | |
|       ]
 | |
|      },
 | |
|      "execution_count": null,
 | |
|      "metadata": {},
 | |
|      "output_type": "execute_result"
 | |
|     }
 | |
|    ],
 | |
|    "source": [
 | |
|     "[tool.name for tool in toolkit.get_tools()]"
 | |
|    ]
 | |
|   },
 | |
|   {
 | |
|    "cell_type": "markdown",
 | |
|    "metadata": {},
 | |
|    "source": [
 | |
|     "## Use within an Agent"
 | |
|    ]
 | |
|   },
 | |
|   {
 | |
|    "cell_type": "code",
 | |
|    "execution_count": 20,
 | |
|    "metadata": {},
 | |
|    "outputs": [],
 | |
|    "source": [
 | |
|     "from langchain.agents import AgentType, initialize_agent\n",
 | |
|     "from langchain_openai import OpenAI"
 | |
|    ]
 | |
|   },
 | |
|   {
 | |
|    "cell_type": "code",
 | |
|    "execution_count": 21,
 | |
|    "metadata": {},
 | |
|    "outputs": [],
 | |
|    "source": [
 | |
|     "llm = OpenAI(temperature=0)\n",
 | |
|     "agent = initialize_agent(\n",
 | |
|     "    tools=toolkit.get_tools(),\n",
 | |
|     "    llm=llm,\n",
 | |
|     "    agent=AgentType.STRUCTURED_CHAT_ZERO_SHOT_REACT_DESCRIPTION,\n",
 | |
|     "    verbose=True,\n",
 | |
|     ")"
 | |
|    ]
 | |
|   },
 | |
|   {
 | |
|    "cell_type": "code",
 | |
|    "execution_count": null,
 | |
|    "metadata": {},
 | |
|    "outputs": [
 | |
|     {
 | |
|      "name": "stdout",
 | |
|      "output_type": "stream",
 | |
|      "text": [
 | |
|       "\n",
 | |
|       "\n",
 | |
|       "\u001b[1m> Entering new AgentExecutor chain...\u001b[0m\n",
 | |
|       "\u001b[32;1m\u001b[1;3m\n",
 | |
|       "Action:\n",
 | |
|       "```\n",
 | |
|       "{\n",
 | |
|       "  \"action\": \"Azure Cognitive Services Image Analysis\",\n",
 | |
|       "  \"action_input\": \"https://images.openai.com/blob/9ad5a2ab-041f-475f-ad6a-b51899c50182/ingredients.png\"\n",
 | |
|       "}\n",
 | |
|       "```\n",
 | |
|       "\n",
 | |
|       "\u001b[0m\n",
 | |
|       "Observation: \u001b[36;1m\u001b[1;3mCaption: a group of eggs and flour in bowls\n",
 | |
|       "Objects: Egg, Egg, Food\n",
 | |
|       "Tags: dairy, ingredient, indoor, thickening agent, food, mixing bowl, powder, flour, egg, bowl\u001b[0m\n",
 | |
|       "Thought:\u001b[32;1m\u001b[1;3m I can use the objects and tags to suggest recipes\n",
 | |
|       "Action:\n",
 | |
|       "```\n",
 | |
|       "{\n",
 | |
|       "  \"action\": \"Final Answer\",\n",
 | |
|       "  \"action_input\": \"You can make pancakes, omelettes, or quiches with these ingredients!\"\n",
 | |
|       "}\n",
 | |
|       "```\u001b[0m\n",
 | |
|       "\n",
 | |
|       "\u001b[1m> Finished chain.\u001b[0m\n"
 | |
|      ]
 | |
|     },
 | |
|     {
 | |
|      "data": {
 | |
|       "text/plain": [
 | |
|        "'You can make pancakes, omelettes, or quiches with these ingredients!'"
 | |
|       ]
 | |
|      },
 | |
|      "execution_count": null,
 | |
|      "metadata": {},
 | |
|      "output_type": "execute_result"
 | |
|     }
 | |
|    ],
 | |
|    "source": [
 | |
|     "agent.run(\n",
 | |
|     "    \"What can I make with these ingredients?\"\n",
 | |
|     "    \"https://images.openai.com/blob/9ad5a2ab-041f-475f-ad6a-b51899c50182/ingredients.png\"\n",
 | |
|     ")"
 | |
|    ]
 | |
|   },
 | |
|   {
 | |
|    "cell_type": "code",
 | |
|    "execution_count": null,
 | |
|    "metadata": {},
 | |
|    "outputs": [
 | |
|     {
 | |
|      "name": "stdout",
 | |
|      "output_type": "stream",
 | |
|      "text": [
 | |
|       "\n",
 | |
|       "\n",
 | |
|       "\u001b[1m> Entering new AgentExecutor chain...\u001b[0m\n",
 | |
|       "\u001b[32;1m\u001b[1;3mAction:\n",
 | |
|       "```\n",
 | |
|       "{\n",
 | |
|       "  \"action\": \"Azure Cognitive Services Text2Speech\",\n",
 | |
|       "  \"action_input\": \"Why did the chicken cross the playground? To get to the other slide!\"\n",
 | |
|       "}\n",
 | |
|       "```\n",
 | |
|       "\n",
 | |
|       "\u001b[0m\n",
 | |
|       "Observation: \u001b[31;1m\u001b[1;3m/tmp/tmpa3uu_j6b.wav\u001b[0m\n",
 | |
|       "Thought:\u001b[32;1m\u001b[1;3m I have the audio file of the joke\n",
 | |
|       "Action:\n",
 | |
|       "```\n",
 | |
|       "{\n",
 | |
|       "  \"action\": \"Final Answer\",\n",
 | |
|       "  \"action_input\": \"/tmp/tmpa3uu_j6b.wav\"\n",
 | |
|       "}\n",
 | |
|       "```\u001b[0m\n",
 | |
|       "\n",
 | |
|       "\u001b[1m> Finished chain.\u001b[0m\n"
 | |
|      ]
 | |
|     },
 | |
|     {
 | |
|      "data": {
 | |
|       "text/plain": [
 | |
|        "'/tmp/tmpa3uu_j6b.wav'"
 | |
|       ]
 | |
|      },
 | |
|      "execution_count": null,
 | |
|      "metadata": {},
 | |
|      "output_type": "execute_result"
 | |
|     }
 | |
|    ],
 | |
|    "source": [
 | |
|     "audio_file = agent.run(\"Tell me a joke and read it out for me.\")"
 | |
|    ]
 | |
|   },
 | |
|   {
 | |
|    "cell_type": "code",
 | |
|    "execution_count": null,
 | |
|    "metadata": {},
 | |
|    "outputs": [],
 | |
|    "source": [
 | |
|     "from IPython import display\n",
 | |
|     "\n",
 | |
|     "audio = display.Audio(audio_file)\n",
 | |
|     "display.display(audio)"
 | |
|    ]
 | |
|   },
 | |
|   {
 | |
|    "cell_type": "code",
 | |
|    "execution_count": 22,
 | |
|    "metadata": {},
 | |
|    "outputs": [
 | |
|     {
 | |
|      "name": "stdout",
 | |
|      "output_type": "stream",
 | |
|      "text": [
 | |
|       "\n",
 | |
|       "\n",
 | |
|       "\u001b[1m> Entering new AgentExecutor chain...\u001b[0m\n",
 | |
|       "\u001b[32;1m\u001b[1;3mAction:\n",
 | |
|       "```\n",
 | |
|       "{\n",
 | |
|       "  \"action\": \"azure_cognitive_services_text_analyics_health\",\n",
 | |
|       "  \"action_input\": \"The patient is a 54-year-old gentleman with a history of progressive angina over the past several months. The patient had a cardiac catheterization in July of this year revealing total occlusion of the RCA and 50% left main disease, with a strong family history of coronary artery disease with a brother dying at the age of 52 from a myocardial infarction and another brother who is status post coronary artery bypass grafting. The patient had a stress echocardiogram done on July, 2001, which showed no wall motion abnormalities, but this was a difficult study due to body habitus. The patient went for six minutes with minimal ST depressions in the anterior lateral leads, thought due to fatigue and wrist pain, his anginal equivalent. Due to the patient's increased symptoms and family history and history left main disease with total occasional of his RCA was referred for revascularization with open heart surgery.\"\n",
 | |
|       "}\n",
 | |
|       "```\n",
 | |
|       "\u001b[0m\n",
 | |
|       "Observation: \u001b[36;1m\u001b[1;3mThe text conatins the following healthcare entities: 54-year-old is a healthcare entity of type Age, gentleman is a healthcare entity of type Gender, progressive angina is a healthcare entity of type Diagnosis, past several months is a healthcare entity of type Time, cardiac catheterization is a healthcare entity of type ExaminationName, July of this year is a healthcare entity of type Time, total is a healthcare entity of type ConditionQualifier, occlusion is a healthcare entity of type SymptomOrSign, RCA is a healthcare entity of type BodyStructure, 50 is a healthcare entity of type MeasurementValue, % is a healthcare entity of type MeasurementUnit, left main is a healthcare entity of type BodyStructure, disease is a healthcare entity of type Diagnosis, family is a healthcare entity of type FamilyRelation, coronary artery disease is a healthcare entity of type Diagnosis, brother is a healthcare entity of type FamilyRelation, dying is a healthcare entity of type Diagnosis, 52 is a healthcare entity of type Age, myocardial infarction is a healthcare entity of type Diagnosis, brother is a healthcare entity of type FamilyRelation, coronary artery bypass grafting is a healthcare entity of type TreatmentName, stress echocardiogram is a healthcare entity of type ExaminationName, July, 2001 is a healthcare entity of type Time, wall motion abnormalities is a healthcare entity of type SymptomOrSign, body habitus is a healthcare entity of type SymptomOrSign, six minutes is a healthcare entity of type Time, minimal is a healthcare entity of type ConditionQualifier, ST depressions in the anterior lateral leads is a healthcare entity of type SymptomOrSign, fatigue is a healthcare entity of type SymptomOrSign, wrist pain is a healthcare entity of type SymptomOrSign, anginal equivalent is a healthcare entity of type SymptomOrSign, increased is a healthcare entity of type Course, symptoms is a healthcare entity of type SymptomOrSign, family is a healthcare entity of type FamilyRelation, left is a healthcare entity of type Direction, main is a healthcare entity of type BodyStructure, disease is a healthcare entity of type Diagnosis, occasional is a healthcare entity of type Course, RCA is a healthcare entity of type BodyStructure, revascularization is a healthcare entity of type TreatmentName, open heart surgery is a healthcare entity of type TreatmentName\u001b[0m\n",
 | |
|       "Thought:\u001b[32;1m\u001b[1;3m I know what to respond\n",
 | |
|       "Action:\n",
 | |
|       "```\n",
 | |
|       "{\n",
 | |
|       "  \"action\": \"Final Answer\",\n",
 | |
|       "  \"action_input\": \"The text contains the following diagnoses: progressive angina, coronary artery disease, myocardial infarction, and coronary artery bypass grafting.\"\n",
 | |
|       "}\n",
 | |
|       "```\u001b[0m\n",
 | |
|       "\n",
 | |
|       "\u001b[1m> Finished chain.\u001b[0m\n"
 | |
|      ]
 | |
|     },
 | |
|     {
 | |
|      "data": {
 | |
|       "text/plain": [
 | |
|        "'The text contains the following diagnoses: progressive angina, coronary artery disease, myocardial infarction, and coronary artery bypass grafting.'"
 | |
|       ]
 | |
|      },
 | |
|      "execution_count": 22,
 | |
|      "metadata": {},
 | |
|      "output_type": "execute_result"
 | |
|     }
 | |
|    ],
 | |
|    "source": [
 | |
|     "agent.run(\n",
 | |
|     "    \"\"\"The patient is a 54-year-old gentleman with a history of progressive angina over the past several months.\n",
 | |
|     "The patient had a cardiac catheterization in July of this year revealing total occlusion of the RCA and 50% left main disease ,\n",
 | |
|     "with a strong family history of coronary artery disease with a brother dying at the age of 52 from a myocardial infarction and\n",
 | |
|     "another brother who is status post coronary artery bypass grafting. The patient had a stress echocardiogram done on July , 2001 ,\n",
 | |
|     "which showed no wall motion abnormalities , but this was a difficult study due to body habitus. The patient went for six minutes with\n",
 | |
|     "minimal ST depressions in the anterior lateral leads , thought due to fatigue and wrist pain , his anginal equivalent. Due to the patient's\n",
 | |
|     "increased symptoms and family history and history left main disease with total occasional of his RCA was referred for revascularization with open heart surgery.\n",
 | |
|     "\n",
 | |
|     "List all the diagnoses.\n",
 | |
|     "\"\"\"\n",
 | |
|     ")"
 | |
|    ]
 | |
|   },
 | |
|   {
 | |
|    "cell_type": "code",
 | |
|    "execution_count": null,
 | |
|    "metadata": {},
 | |
|    "outputs": [],
 | |
|    "source": []
 | |
|   }
 | |
|  ],
 | |
|  "metadata": {
 | |
|   "kernelspec": {
 | |
|    "display_name": "Python 3 (ipykernel)",
 | |
|    "language": "python",
 | |
|    "name": "python3"
 | |
|   },
 | |
|   "language_info": {
 | |
|    "codemirror_mode": {
 | |
|     "name": "ipython",
 | |
|     "version": 3
 | |
|    },
 | |
|    "file_extension": ".py",
 | |
|    "mimetype": "text/x-python",
 | |
|    "name": "python",
 | |
|    "nbconvert_exporter": "python",
 | |
|    "pygments_lexer": "ipython3",
 | |
|    "version": "3.8.10"
 | |
|   }
 | |
|  },
 | |
|  "nbformat": 4,
 | |
|  "nbformat_minor": 4
 | |
| }
 |