标签： Python

AI 资讯

ModuleNotFoundError: No module named ‘auto_gptq’

用户在 Windows 系统上使用 oobabooga TextGen WebUI 时，尝试加载 yarn-mistral-7b-128k.Q4_K_M.gguf 模型（GGUF 格式）后出现报错。用户下载的是 CPU 版本安装包，但模型加载过程中意外调用了 auto_gptq 模块，最终因找不到

celebrityanime
2026年 6月 21日

AI 资讯

bug: instructions parameter (OpenAI Responses API) not captured/displayed properly

用户在 Langfuse 中通过以下两种方式使用 OpenAI Responses API 时触发：

celebrityanime
2026年 6月 21日

AI 资讯

Bug: OpenAI Responses API tracing does not include tools in generation input

用户通过 from langfuse.openai import openai 或 from langfuse.openai import AsyncOpenAI 创建客户端，并调用 client.responses.create(...) 时触发。在 Chat Completions 接口下 to

celebrityanime
2026年 6月 21日

AI 资讯

Session ID not present in memory store should return 404, not 400

在运行基于 MCP Python SDK 构建的 Streamable HTTP 服务器时，客户端连接时若提供的 Mcp-Session-Id 在服务器内存存储中不存在（例如服务器重启导致 session 丢失），本应返回 404 让客户端重新初始化，但服务器返回了 400。这会阻止客户端正确恢复连

celebrityanime
2026年 6月 21日

AI 资讯

Dead code path in MCPServer._handle_call_tool and incorrect call_tool return type

用户在 MCP Python SDK 中使用 MCPServer.call_tool 方法，或工具函数通过 @server.tool() 装饰器注册时触发。该问题已在源码层面被诊断并修复，但未合入前某些分支逻辑和返回类型注解仍然是错误的。

celebrityanime
2026年 6月 21日

AI 资讯

[Bug]: Hardcoded RERANK_LIMIT logic causes API failures (400) and ignores UI Top-K settings

用户在使用 RAGFlow v0.24.0 官方镜像时，配置了 Chatbot 并启用 Reranker（例如 Cohere 或 vLLM 托管的 BGE 模型）。在 UI 中将 Top-N ( page_size ) 设置为 6，Top-K 设置为较低的值（如 10）后，执行查询时触发 400 错

celebrityanime
2026年 6月 21日

AI 资讯

ValueError: Incompatible keys detected:

用户在运行 FluxLoraLoaderMixin.lora_state_dict() 时触发此问题，该函数用于将 kohya/sd-scripts 格式的 FLUX LoRA 转换为 Diffusers 兼容格式。问题在 `diffusers==0.38.0` 和当前 `main` 分支上均可复现

celebrityanime
2026年 6月 21日

AI 资讯

[Bug]: Gemma4-31B-it deployed on vLLM cannot process images in tool message

用户在 vLLM 上部署 Gemma4-31B-it 模型，通过 OpenAI 兼容 API（ /v1/chat/completions ）发送包含图片的 tool message 请求时，服务端返回 HTTP 500 Internal Server Error。环境为 Ubuntu 24.04 +