531 questions
0
votes
1
answer
46
views
Azure Agent Retrieval (Azure AI Search) returns JSON references instead of LLM-generated answer
I’m testing Azure Agent Retrieval with Azure AI Search and Azure OpenAI integration.
However, instead of getting a natural language LLM-generated answer, the response only returns raw JSON with ...
0
votes
0
answers
45
views
Is there a way to diagnose connection issues between the Azure OpenAI platform and other services?
We're setting up an Azure OpenAI instance that needs to connect to an Azure AI Search instance (i.e. "Bring Your Own Data"), we're currently using an API key to connect to the Search ...
0
votes
1
answer
59
views
'metadata_storage_name' always null: Missing or empty value '/document/blob_name'
Using Azure's AI Search Service to chunk and read textfiles, the target field values are always NULL.
I tried using a simplified example, just for the fields metadata_blob_name / title and ...
0
votes
1
answer
107
views
Structured output with full response content and additional metadata
I want to create a structured output from an agent call that contains the full response content, as well as additional metadata.
For example, I am making a call to find the differences between two ...
0
votes
1
answer
81
views
How can I update the capacity of a finetuned GPT model on Azure using Python?
I want to update the capacity of a finetuned GPT model on Azure. How can I do so in Python?
The following code used to work a few months ago (it used to take a few seconds to update the capacity) but ...
2
votes
1
answer
308
views
Implementing a custom LiteLlm for Google agent development kit that supports tool calls
I'm trying to implement a custom LiteLlm wrapper to access an LLM endpoint. This is the code I have:
import os
import litellm
import requests
from dotenv import load_dotenv
from litellm import ...
0
votes
0
answers
149
views
Configuring MCP with Microsoft.Extensions.AI
Using the following Microsoft template as my base:
https://devblogs.microsoft.com/dotnet/announcing-dotnet-ai-template-preview1/
I want to be able to add my local stdio MCP server as a tool but can't ...
1
vote
1
answer
203
views
how to download the files generated by Azure OpenAi Assistant Code Interpreter in streaming mode
In streaming mode, there is an event "response.output_text.annotation.added",
the data looks like
{
"type": "response.output_text.annotation.added",
"...
0
votes
0
answers
59
views
Correct input for OpenAI embeddings API?
I'm using the OpenAi text-embedding-3-small model to create embeddings for each product category in a file. In total it's about 6000 product categories and they look like this:
Vehicles & Parts &...
1
vote
0
answers
169
views
How to authenticate Azure Web App using Managed Identity to APIM?
I have APIM with an AOAI backend. APIM authenticates to AOAI using a separate Managed Identity by way of the following API policy plus AOAI IAM roles:
<authentication-managed-identity resource=&...
0
votes
0
answers
22
views
Feedback button not showing after updating cdn path in aoai webapp
After I Updated the cdn path so that the icons work, the other icons are showing but the ThumbsUp & ThumbsDown buttons are not showing now. What has to be done for that and why this is happening ...
-3
votes
1
answer
128
views
Fill vector store with CURL does not work
I want to fill the vector memory of my project in Azure AI Foundry via a CURL command. I use the way described in the documentation
Uploading the file, which apparently has to be done before adding ...
0
votes
1
answer
138
views
Why is pylance not able to statically determine Literal Union type in dictionary value
When writing a to_ChatCompletionMessageParam() instance method, I have the following implementation:
def to_ChatCompletionMessageParam(self) -> ChatCompletionMessageParam:
author: Literal["...
0
votes
2
answers
349
views
Semantic Kernel plugin methods to return literal values without re-interpreting
I have a C# semantic kernel app that calls various plugin methods. Each method returns a string value. I would like for the app to return this exact string value to the user, but instead semantic ...
2
votes
1
answer
70
views
Why can't (langchain) AzureOpenAI find a model that AzureChatOpenAI can?
Invoking a request using AzureChatOpenAI returns response as expected:
import os
from dotenv import load_dotenv
from langchain_openai import AzureChatOpenAI
load_dotenv()
llm = AzureChatOpenAI(
...
2
votes
1
answer
1k
views
Receiving a 400 error when calling Azure OpenAI Python SDK Response.create
I'm running an Azure Responses call using the Azure Python SDK:
from openai import AzureOpenAI
.
.
.
self.client = AzureOpenAI(
api_key=config.get("api_key"),
base_url= config.get(&...
1
vote
1
answer
313
views
Azure OpenAI / Foundry: Vector Store File Attributes (Metadata) Problem
I'm encountering an issue when trying to add attributes (metadata) to files in an Azure OpenAI Vector Store.
My setup:
Azure OpenAI API (version 2025-04-01-preview)
Python SDK for file uploads
Goal: ...
0
votes
1
answer
534
views
Get IChatClient from AddAzureOpenAIChatCompletion
I'm trying to create a MCP (Model Context Protocol) client using C#'s SDK and Azure Open AI. This MCP clients needs to have Sampling capabilities, but I don't know how to extract the IChatClient ...
0
votes
0
answers
99
views
How to loop through text chunks created using AzureOpenAI `client.vector_stores.create`
I checked Azure's documentation on this topic here but I do not see anything related to this. My goal is to create a question and answer dataset for my RAG solution based on each chunk for a good ...
1
vote
3
answers
662
views
Issues connecting to Azure AI Foundry Models using MS Entra ID
I am trying to connect a non-Azure OpenAI embedding model in AI Foundry using Microsoft Entra ID.
I don't have an issue connecting to Azure OpenAI embedding models using Microsoft Entra ID.
After ...
2
votes
2
answers
409
views
Azure OpenAI: DeploymentNotFound: The API deployment for this resource does not exist
I am using AzureOpenAI Service, and I just encountered an issue openai.NotFoundError: Error code: 404 - {'error': {'code': 'DeploymentNotFound', 'message': 'The API deployment for this resource does ...
1
vote
2
answers
314
views
Semantic Kernel OpenAIAssistantAgent GroupChat InvokeStreamingAsync NullReferenceException in swedencentral
Maybe you have the same issue or any idea how to fix this?!
I'm using Semantic Kernel (.net) to access an Azure AI Service with GPT-4o model deployed in swedencentral via an APIM. When I start a group ...
0
votes
1
answer
71
views
Automating using Autogen AI to fetch azure migrate resport and generate PPT
Im new to autogen ai currently using autogen version 0.7 Automating using Autogen AI to fetch azure migrate assessment report and generating PPT using autogen agents group chat agent but some times ...
1
vote
1
answer
362
views
How to reduce response latency in Azure OpenAI GPT-3.5 Turbo & GPT-4 API?
I'm using the Azure OpenAI API with GPT-3.5 Turbo and GPT-4 models. I'm currently experiencing average response times of:
GPT-3.5 Turbo: ~900ms
GPT-4: ~1.3 seconds
I’m already using the following ...
0
votes
0
answers
114
views
OpenAI Azure chat.completions.create stalling without throwing exception
I have a fastapi endpoint which streams a response from Azure OpenAI:
@router.post("/message", dependencies=[Depends(validate_auth)])
def message(sessionID: str = Form(...), messages: str = ...
0
votes
1
answer
490
views
How do you get Azure OpenAI to return json in C#?
If I want to return json from Azure OpenAI, such as Gpt-4o, how do I get it to respond in json? I've found examples in Python, but none using the Azure.AI.OpenAI sdk library for C# .NET.
1
vote
1
answer
241
views
How to properly use OpenAI.RealtimeConversation.RealtimeConversationClient to start multiple RealtimeConversationSession(s)?
The Azure OpenAI SDK appears to follow a one-client-per-session model, but I want to be sure. Since it is still in beta, it is not clear if a single instance of the client returned by ...
3
votes
1
answer
708
views
Suddenly getting 400 Bad Request error calling OpenAI.Chat.ChatClient.CompleteChat()
Late last week, our Azure-based RAG chat service started returning HTTP 400 Bad Request errors. We are running a web API on ASP.NET Core 8. Nothing in the code changed, and I have verified that our ...
0
votes
1
answer
167
views
How can I deploy a fine-tuned GPT model in Azure via Python without using a token (e.g., using an endpoint key instead)?
I follow Azure's tutorial on fine-tuning GPT. Here is the code for the deployment phase:
# Deploy fine-tuned model
import json
import requests
token = '[redacted]'
subscription = '[redacted]'
...
0
votes
1
answer
334
views
Using Managed Identity to call Azure OpenAI endpoint instead of Azure OpenAI key
I have the following code to call Azure OpenAI endpoint using OpenAI Key.
import axios from 'axios';
const openaiApiKey = process.env.REACT_APP_OPENAI_API_KEY;
const openaiApiEndpoint = process.env....
0
votes
1
answer
910
views
Azure OpenAI Embedding SDK (C#) – Official example uses outdated types and alternatives return 404
I have successfully deployed a model (text-embedding-3-large) using the Azure OpenAI portal, and I'm now trying to integrate it using the OpenAI .NET SDK.
The Azure portal itself provides the ...
0
votes
0
answers
43
views
OpenAI dotnet SDK - Assistant : How to retain the conversation context
I am using OpenAI dotnet sdk, I use assistant to post the prompt and to get the response. I would like assistant to remember the previous prompt posted (i.e. retain the context of the conversation)...
0
votes
1
answer
172
views
How to call AzureOpenAI API with PyRIT?
Im triying to do a basic PyRIT project but im not able to use the AzureOpenAI with PyRIT.
I have this code woking, but not with PyRIT:
from openai import AzureOpenAI
client = AzureOpenAI(
api_key=&...
0
votes
1
answer
1k
views
Why doesn't my Azure OpenAI model show up in Cursor?
Why doesn't my Azure OpenAI model show up in Cursor as one of the available models?
I added my Azure OpenAI account show up in Cursor in the Settings window:
The green toggle switch seems to ...
1
vote
1
answer
1k
views
Connecting AzureOpenAI via LiteLLM - returning Authentication Error 401
I am using DSPy framework (v2.6.4) which uses liteLLM (v1.63.7) to connect to LLMs.
While connecting Azure OpenAI via liteLLM (v1.63.7) using the below method (Azure AD Token Refresh - ...
0
votes
0
answers
66
views
PandasAI Error in VS Code but Works in Google Colab
I'm using pandasai with AzureOpenAI to analyze a DataFrame. My code works fine in Google Colab but throws an error in VS Code.
Code:
import pandasai as pai
from pandasai_openai import AzureOpenAI
...
0
votes
1
answer
309
views
Additional headers are not being sent to the Azure OpenAI through Weaviate SDK
I am using the Python SDK to connect to Weaviate. Additionally, I am create collection that is able to send requests to Azure OpenAI, using text2vec-azure-openai module. To use that, I am sending X-...
1
vote
1
answer
431
views
Azure OpenAI Phi-4-multimodel-instruct: 'auto' tool choice error when using runTools() method that worked with GPT-4o
I recently switched from using GPT-4o to Phi-4-multimodel-instruct in my Next.js application using Azure AI services, but I'm encountering the following error:
BadRequestError: 400 {"object":...
0
votes
2
answers
271
views
How can I resolve the 403 Forbidden error when deploying a fine-tuned GPT model in Azure via Python?
I follow Azure's tutorial on fine-tuning GPT. I'm stuck at the deployment phase.
Code:
# Deploy fine-tuned model
import json
import requests
token = '[redacted]'
subscription = '[redacted]'
...
0
votes
1
answer
403
views
The API deployment for this resource does not exist. If you created the deployment within the last 5 minutes, please wait a moment and try again
I am creating a POC for a Chat bot using Azure OpenAI and .Net Core.
I have created Open AI resource and Deployment Model in Azure.
But I am getting error when calling Completechat() function.
Error : ...
3
votes
1
answer
540
views
Azure AI Search - Tag Scoring profile on azureopenai extra_body
I created an index on Azure AI Search and connected it to Azure OpenAI using the extra_body. It works perfectly. However, I created a default scoring profile for my index, which boosts documents ...
1
vote
1
answer
315
views
My AzureOpenAI batch job shows complete, but only a few rows (not all rows) are completed in the batch file without any error
I have tried uploading a batch job with unique custom_id for each row in my input file. The job gets validated but completes very quickly and once I check the job only 276/4096 (as shown in the ...
2
votes
1
answer
309
views
Is there a way to disable content filteration in Azure OpenAi or to process the prompt before using it [closed]
I’m working on a report generation using some rubric’s and a conversation string containing questions and answers for the questions from the user which is captured through openai real-time api. But ...
0
votes
1
answer
381
views
How to Reduce OpenAI Azure Response Time for Structured Output Using GPT-4o Mini (Fine-Tuned Model)?
I am using an Azure OpenAI GPT-4o Mini fine-tuned model to generate structured responses (e.g., JSON format). However, the response time is higher than expected, and I am looking for ways to optimize ...
-1
votes
1
answer
85
views
I am not able to create the embeddings of my fields titleVector and contentVector
I used AzureOpenAI Vectorizer to create embeddings in my search index but it is not created in search index why while uploadind documents i only uploaded id,content,title and titleVector and ...
0
votes
1
answer
331
views
409 conflict while updating multiple properties of terraform azapi_resource
We are using terraform's az_api resource to create Azure openai content filter(rai_policy).
While updating the contentFilters property for multiple resources , we are seeing below error
**RESPONSE 409:...
0
votes
2
answers
592
views
How to use Azure AI OpenAI to generate responses from a model using a set of data inputs?
I’m trying to use Azure AI OpenAI to generate responses from a trained model based on a set of data I’ll provide as part of the prompt. My goal is to pass a set of data (e.g., metrics, statistics, or ...
0
votes
0
answers
136
views
Autogen groupchat and Society of Mind - Issue in making framework with added complexity
I am trying to create 3 step framework where 1. data quality issue gets reported , 2. user reviewes the data quality and 3. approves to execute agents for DQ corrections in database.
I have three ...
0
votes
1
answer
279
views
Azure Devops Pull Request Review Integration with AzureOpenAI instance
We are using Azure Devops Server 2022 on-premise and we have a private Azure OpenAI instance enabled with a private endpoint. In VSTS marketplace, we were able to find many extensions:
https://...
3
votes
1
answer
2k
views
Calculating total cost of OpenAI reponses with cached input tokens
Should I subtract the number of cached input tokens from the number of input tokens when I receive the 'Usage' object back in the OpenAI response?
I'm trying to calculate the total cost of the ...