Newest 'azure-openai' Questions

0 votes

1 answer

46 views

Azure Agent Retrieval (Azure AI Search) returns JSON references instead of LLM-generated answer

I’m testing Azure Agent Retrieval with Azure AI Search and Azure OpenAI integration. However, instead of getting a natural language LLM-generated answer, the response only returns raw JSON with ...

roczstar

73

asked Oct 28 at 20:50

0 votes

0 answers

45 views

Is there a way to diagnose connection issues between the Azure OpenAI platform and other services?

We're setting up an Azure OpenAI instance that needs to connect to an Azure AI Search instance (i.e. "Bring Your Own Data"), we're currently using an API key to connect to the Search ...

Zhaph - Ben Duguid

27k

asked Oct 21 at 16:49

0 votes

1 answer

59 views

'metadata_storage_name' always null: Missing or empty value '/document/blob_name'

Using Azure's AI Search Service to chunk and read textfiles, the target field values are always NULL. I tried using a simplified example, just for the fields metadata_blob_name / title and ...

Jonathan

2,107

asked Oct 8 at 8:33

0 votes

1 answer

107 views

Structured output with full response content and additional metadata

I want to create a structured output from an agent call that contains the full response content, as well as additional metadata. For example, I am making a call to find the differences between two ...

Daniel André

1,248

asked Oct 6 at 18:12

0 votes

1 answer

81 views

How can I update the capacity of a finetuned GPT model on Azure using Python?

I want to update the capacity of a finetuned GPT model on Azure. How can I do so in Python? The following code used to work a few months ago (it used to take a few seconds to update the capacity) but ...

Franck Dernoncourt

84.7k

asked Oct 2 at 10:00

2 votes

1 answer

308 views

Implementing a custom LiteLlm for Google agent development kit that supports tool calls

I'm trying to implement a custom LiteLlm wrapper to access an LLM endpoint. This is the code I have: import os import litellm import requests from dotenv import load_dotenv from litellm import ...

Al Ant

3

asked Sep 17 at 21:35

0 votes

0 answers

149 views

Configuring MCP with Microsoft.Extensions.AI

Using the following Microsoft template as my base: https://devblogs.microsoft.com/dotnet/announcing-dotnet-ai-template-preview1/ I want to be able to add my local stdio MCP server as a tool but can't ...

ChewToy

756

asked Aug 27 at 3:12

1 vote

1 answer

203 views

how to download the files generated by Azure OpenAi Assistant Code Interpreter in streaming mode

In streaming mode, there is an event "response.output_text.annotation.added", the data looks like { "type": "response.output_text.annotation.added", "...

Luk Aron

1,497

asked Aug 20 at 8:33

0 votes

0 answers

59 views

Correct input for OpenAI embeddings API?

I'm using the OpenAi text-embedding-3-small model to create embeddings for each product category in a file. In total it's about 6000 product categories and they look like this: Vehicles & Parts &...

eztam

3,873

asked Aug 16 at 18:16

1 vote

0 answers

169 views

How to authenticate Azure Web App using Managed Identity to APIM?

I have APIM with an AOAI backend. APIM authenticates to AOAI using a separate Managed Identity by way of the following API policy plus AOAI IAM roles: <authentication-managed-identity resource=&...

Angrydot

21

asked Jul 14 at 2:19

0 votes

0 answers

22 views

Feedback button not showing after updating cdn path in aoai webapp

After I Updated the cdn path so that the icons work, the other icons are showing but the ThumbsUp & ThumbsDown buttons are not showing now. What has to be done for that and why this is happening ...

Sweekriti Pant

1

asked Jul 7 at 7:49

-3 votes

1 answer

128 views

Fill vector store with CURL does not work

I want to fill the vector memory of my project in Azure AI Foundry via a CURL command. I use the way described in the documentation Uploading the file, which apparently has to be done before adding ...

HendrikD2005

1

asked Jul 3 at 11:39

0 votes

1 answer

138 views

Why is pylance not able to statically determine Literal Union type in dictionary value

When writing a to_ChatCompletionMessageParam() instance method, I have the following implementation: def to_ChatCompletionMessageParam(self) -> ChatCompletionMessageParam: author: Literal["...

feverdreme

537

asked Jul 1 at 23:04

0 votes

2 answers

349 views

Semantic Kernel plugin methods to return literal values without re-interpreting

I have a C# semantic kernel app that calls various plugin methods. Each method returns a string value. I would like for the app to return this exact string value to the user, but instead semantic ...

jc116

63

asked Jun 30 at 18:53

2 votes

1 answer

70 views

Why can't (langchain) AzureOpenAI find a model that AzureChatOpenAI can?

Invoking a request using AzureChatOpenAI returns response as expected: import os from dotenv import load_dotenv from langchain_openai import AzureChatOpenAI load_dotenv() llm = AzureChatOpenAI( ...

Sanchez333

460

asked Jun 20 at 7:17

2 votes

1 answer

1k views

Receiving a 400 error when calling Azure OpenAI Python SDK Response.create

I'm running an Azure Responses call using the Azure Python SDK: from openai import AzureOpenAI . . . self.client = AzureOpenAI( api_key=config.get("api_key"), base_url= config.get(&...

Joseph Pourtaymour

21

asked Jun 4 at 9:27

1 vote

1 answer

313 views

Azure OpenAI / Foundry: Vector Store File Attributes (Metadata) Problem

I'm encountering an issue when trying to add attributes (metadata) to files in an Azure OpenAI Vector Store. My setup: Azure OpenAI API (version 2025-04-01-preview) Python SDK for file uploads Goal: ...

Luke

385

asked May 18 at 8:14

0 votes

1 answer

534 views

Get IChatClient from AddAzureOpenAIChatCompletion

I'm trying to create a MCP (Model Context Protocol) client using C#'s SDK and Azure Open AI. This MCP clients needs to have Sampling capabilities, but I don't know how to extract the IChatClient ...

Infinity

3

asked May 6 at 10:04

0 votes

0 answers

99 views

How to loop through text chunks created using AzureOpenAI `client.vector_stores.create`

I checked Azure's documentation on this topic here but I do not see anything related to this. My goal is to create a question and answer dataset for my RAG solution based on each chunk for a good ...

Mike B

3,629

asked May 5 at 13:03

1 vote

3 answers

662 views

Issues connecting to Azure AI Foundry Models using MS Entra ID

I am trying to connect a non-Azure OpenAI embedding model in AI Foundry using Microsoft Entra ID. I don't have an issue connecting to Azure OpenAI embedding models using Microsoft Entra ID. After ...

trideau

31

asked May 3 at 17:17

2 votes

2 answers

409 views

Azure OpenAI: DeploymentNotFound: The API deployment for this resource does not exist

I am using AzureOpenAI Service, and I just encountered an issue openai.NotFoundError: Error code: 404 - {'error': {'code': 'DeploymentNotFound', 'message': 'The API deployment for this resource does ...

mengyuest

21

asked May 2 at 23:06

1 vote

2 answers

314 views

Semantic Kernel OpenAIAssistantAgent GroupChat InvokeStreamingAsync NullReferenceException in swedencentral

Maybe you have the same issue or any idea how to fix this?! I'm using Semantic Kernel (.net) to access an Azure AI Service with GPT-4o model deployed in swedencentral via an APIM. When I start a group ...

Haiko

56

asked Apr 28 at 12:31

0 votes

1 answer

71 views

Automating using Autogen AI to fetch azure migrate resport and generate PPT

Im new to autogen ai currently using autogen version 0.7 Automating using Autogen AI to fetch azure migrate assessment report and generating PPT using autogen agents group chat agent but some times ...

User

79

asked Apr 24 at 15:34

1 vote

1 answer

362 views

How to reduce response latency in Azure OpenAI GPT-3.5 Turbo & GPT-4 API?

I'm using the Azure OpenAI API with GPT-3.5 Turbo and GPT-4 models. I'm currently experiencing average response times of: GPT-3.5 Turbo: ~900ms GPT-4: ~1.3 seconds I’m already using the following ...

Shaan

11

asked Apr 21 at 20:25

0 votes

0 answers

114 views

OpenAI Azure chat.completions.create stalling without throwing exception

I have a fastapi endpoint which streams a response from Azure OpenAI: @router.post("/message", dependencies=[Depends(validate_auth)]) def message(sessionID: str = Form(...), messages: str = ...

Benjamin Gilbert

2

asked Apr 21 at 17:01

0 votes

1 answer

490 views

How do you get Azure OpenAI to return json in C#?

If I want to return json from Azure OpenAI, such as Gpt-4o, how do I get it to respond in json? I've found examples in Python, but none using the Azure.AI.OpenAI sdk library for C# .NET.

Robert Corvus

2,176

asked Apr 19 at 1:28

1 vote

1 answer

241 views

How to properly use OpenAI.RealtimeConversation.RealtimeConversationClient to start multiple RealtimeConversationSession(s)?

The Azure OpenAI SDK appears to follow a one-client-per-session model, but I want to be sure. Since it is still in beta, it is not clear if a single instance of the client returned by ...

kalabic

21

asked Apr 17 at 21:21

3 votes

1 answer

708 views

Suddenly getting 400 Bad Request error calling OpenAI.Chat.ChatClient.CompleteChat()

Late last week, our Azure-based RAG chat service started returning HTTP 400 Bad Request errors. We are running a web API on ASP.NET Core 8. Nothing in the code changed, and I have verified that our ...

WhosOnFirst

31

asked Apr 16 at 20:01

0 votes

1 answer

167 views

How can I deploy a fine-tuned GPT model in Azure via Python without using a token (e.g., using an endpoint key instead)?

I follow Azure's tutorial on fine-tuning GPT. Here is the code for the deployment phase: # Deploy fine-tuned model import json import requests token = '[redacted]' subscription = '[redacted]' ...

Franck Dernoncourt

84.7k

asked Apr 14 at 19:27

0 votes

1 answer

334 views

Using Managed Identity to call Azure OpenAI endpoint instead of Azure OpenAI key

I have the following code to call Azure OpenAI endpoint using OpenAI Key. import axios from 'axios'; const openaiApiKey = process.env.REACT_APP_OPENAI_API_KEY; const openaiApiEndpoint = process.env....

user989988

3,896

asked Apr 12 at 4:17

0 votes

1 answer

910 views

Azure OpenAI Embedding SDK (C#) – Official example uses outdated types and alternatives return 404

I have successfully deployed a model (text-embedding-3-large) using the Azure OpenAI portal, and I'm now trying to integrate it using the OpenAI .NET SDK. The Azure portal itself provides the ...

José Augustinho

110

asked Apr 7 at 13:34

0 votes

0 answers

43 views

OpenAI dotnet SDK - Assistant : How to retain the conversation context

I am using OpenAI dotnet sdk, I use assistant to post the prompt and to get the response. I would like assistant to remember the previous prompt posted (i.e. retain the context of the conversation)...

Sankaranarayanan Rajkumar

51

asked Apr 7 at 6:35

0 votes

1 answer

172 views

How to call AzureOpenAI API with PyRIT?

Im triying to do a basic PyRIT project but im not able to use the AzureOpenAI with PyRIT. I have this code woking, but not with PyRIT: from openai import AzureOpenAI client = AzureOpenAI( api_key=&...

user30129718

3

asked Apr 1 at 8:25

0 votes

1 answer

1k views

Why doesn't my Azure OpenAI model show up in Cursor?

Why doesn't my Azure OpenAI model show up in Cursor as one of the available models? I added my Azure OpenAI account show up in Cursor in the Settings window: The green toggle switch seems to ...

Franck Dernoncourt

84.7k

asked Mar 29 at 23:57

1 vote

1 answer

1k views

Connecting AzureOpenAI via LiteLLM - returning Authentication Error 401

I am using DSPy framework (v2.6.4) which uses liteLLM (v1.63.7) to connect to LLMs. While connecting Azure OpenAI via liteLLM (v1.63.7) using the below method (Azure AD Token Refresh - ...

4run4

35

asked Mar 27 at 8:08

0 votes

0 answers

66 views

PandasAI Error in VS Code but Works in Google Colab

I'm using pandasai with AzureOpenAI to analyze a DataFrame. My code works fine in Google Colab but throws an error in VS Code. Code: import pandasai as pai from pandasai_openai import AzureOpenAI ...

Sh'Jil

82

asked Mar 27 at 6:01

0 votes

1 answer

309 views

Additional headers are not being sent to the Azure OpenAI through Weaviate SDK

I am using the Python SDK to connect to Weaviate. Additionally, I am create collection that is able to send requests to Azure OpenAI, using text2vec-azure-openai module. To use that, I am sending X-...

user30039021

1

asked Mar 24 at 6:58

1 vote

1 answer

431 views

Azure OpenAI Phi-4-multimodel-instruct: 'auto' tool choice error when using runTools() method that worked with GPT-4o

I recently switched from using GPT-4o to Phi-4-multimodel-instruct in my Next.js application using Azure AI services, but I'm encountering the following error: BadRequestError: 400 {"object":...

Junaid Nazir

121

asked Mar 22 at 18:01

0 votes

2 answers

271 views

How can I resolve the 403 Forbidden error when deploying a fine-tuned GPT model in Azure via Python?

I follow Azure's tutorial on fine-tuning GPT. I'm stuck at the deployment phase. Code: # Deploy fine-tuned model import json import requests token = '[redacted]' subscription = '[redacted]' ...

Franck Dernoncourt

84.7k

asked Mar 20 at 17:31

0 votes

1 answer

403 views

The API deployment for this resource does not exist. If you created the deployment within the last 5 minutes, please wait a moment and try again

I am creating a POC for a Chat bot using Azure OpenAI and .Net Core. I have created Open AI resource and Deployment Model in Azure. But I am getting error when calling Completechat() function. Error : ...

Ak C

53

asked Mar 20 at 9:35

3 votes

1 answer

540 views

Azure AI Search - Tag Scoring profile on azureopenai extra_body

I created an index on Azure AI Search and connected it to Azure OpenAI using the extra_body. It works perfectly. However, I created a default scoring profile for my index, which boosts documents ...

R_Student

809

asked Mar 16 at 0:36

1 vote

1 answer

315 views

My AzureOpenAI batch job shows complete, but only a few rows (not all rows) are completed in the batch file without any error

I have tried uploading a batch job with unique custom_id for each row in my input file. The job gets validated but completes very quickly and once I check the job only 276/4096 (as shown in the ...

Nandan Thakur

46

asked Mar 15 at 16:59

2 votes

1 answer

309 views

Is there a way to disable content filteration in Azure OpenAi or to process the prompt before using it [closed]

I’m working on a report generation using some rubric’s and a conversation string containing questions and answers for the questions from the user which is captured through openai real-time api. But ...

EPIC PIXELS

21

asked Mar 8 at 19:03

0 votes

1 answer

381 views

How to Reduce OpenAI Azure Response Time for Structured Output Using GPT-4o Mini (Fine-Tuned Model)?

I am using an Azure OpenAI GPT-4o Mini fine-tuned model to generate structured responses (e.g., JSON format). However, the response time is higher than expected, and I am looking for ways to optimize ...

BitByBitBuilder

1

asked Mar 3 at 10:15

-1 votes

1 answer

85 views

I am not able to create the embeddings of my fields titleVector and contentVector

I used AzureOpenAI Vectorizer to create embeddings in my search index but it is not created in search index why while uploadind documents i only uploaded id,content,title and titleVector and ...

A V Saravanan

1

asked Mar 1 at 9:59

0 votes

1 answer

331 views

409 conflict while updating multiple properties of terraform azapi_resource

We are using terraform's az_api resource to create Azure openai content filter(rai_policy). While updating the contentFilters property for multiple resources , we are seeing below error **RESPONSE 409:...

G_S

115

asked Feb 28 at 15:39

0 votes

2 answers

592 views

How to use Azure AI OpenAI to generate responses from a model using a set of data inputs?

I’m trying to use Azure AI OpenAI to generate responses from a trained model based on a set of data I’ll provide as part of the prompt. My goal is to pass a set of data (e.g., metrics, statistics, or ...

imj

470

asked Feb 26 at 17:41

0 votes

0 answers

136 views

Autogen groupchat and Society of Mind - Issue in making framework with added complexity

I am trying to create 3 step framework where 1. data quality issue gets reported , 2. user reviewes the data quality and 3. approves to execute agents for DQ corrections in database. I have three ...

Carpe Diem

73

asked Feb 22 at 15:55

0 votes

1 answer

279 views

Azure Devops Pull Request Review Integration with AzureOpenAI instance

We are using Azure Devops Server 2022 on-premise and we have a private Azure OpenAI instance enabled with a private endpoint. In VSTS marketplace, we were able to find many extensions: https://...

Vowneee

1,551

asked Feb 19 at 19:13

3 votes

1 answer

2k views

Calculating total cost of OpenAI reponses with cached input tokens

Should I subtract the number of cached input tokens from the number of input tokens when I receive the 'Usage' object back in the OpenAI response? I'm trying to calculate the total cost of the ...

dmm-l-mediehus

21

asked Feb 19 at 13:23

Collectives™ on Stack Overflow