优化推理逻辑,提升推理ｉ速度

2025-11-28 14:26:37 +08:00 · 2025-11-28 14:26:37 +08:00 · 5e26d88d18
commit 5e26d88d18
parent af29716571
4 changed files with 108 additions and 123 deletions
--- a/prompt/guideline_prompt.md
+++ b/prompt/guideline_prompt.md
@ -31,9 +31,7 @@ Examples of Guideline Match Evaluations:
    "checks": [
      {
        "guideline_id": "<example-id-for-few-shots--do-not-use-this-output>",
-        "condition": "The customer asks for logistical or legal requirements.",
+        "rationale": "The customer now asked about visas and documents which are legal requirements"
        "rationale": "The customer now asked about visas and documents which are legal requirements",
        "applies": true
      }
    ]
  }
@ -58,15 +56,11 @@ Examples of Guideline Match Evaluations:
    "checks": [
      {
        "guideline_id": "<example-id-for-few-shots--do-not-use-this-output>",
-        "condition": "The customer mentions a constraint that related to commitment to the course",
+        "rationale": "In the most recent message the customer mentions that they work full time which is a constraint"
        "rationale": "In the most recent message the customer mentions that they work full time which is a constraint",
        "applies": true
      },
      {
        "guideline_id": "<example-id-for-few-shots--do-not-use-this-output>",
-        "condition": "The user expresses hesitation or self-doubt.",
+        "rationale": "In the most recent message the user still sounds hesitating about their fit to the course"
        "rationale": "In the most recent message the user still sounds hesitating about their fit to the course",
        "applies": true
      }
    ]
  }
@ -89,9 +83,7 @@ Examples of Guideline Match Evaluations:
    "checks": [
      {
        "guideline_id": "<example-id-for-few-shots--do-not-use-this-in-output>",
-        "condition": "When the user is having a problem with login.",
+        "rationale": "In the most recent message the customer is still pursuing their login problem, making the mail access problem a sub-issue rather than a new topic"
        "rationale": "In the most recent message the customer is still pursuing their login problem, making the mail access problem a sub-issue rather than a new topic",
        "applies": true
      }
    ]
  }
@ -111,9 +103,7 @@ Examples of Guideline Match Evaluations:
    "checks": [
      {
        "guideline_id": "<example-id-for-few-shots--do-not-use-this-in-output>",
-        "condition": "When the customer asks about how to return an item.",
+        "rationale": "In the most recent message the customer asks about what happens when they wore the item, which an inquiry regarding returning an item"
        "rationale": "In the most recent message the customer asks about what happens when they wore the item, which an inquiry regarding returning an item",
        "applies": true
      }
    ]
  }
@ -130,15 +120,13 @@ Guidelines List:
 OUTPUT FORMAT:
 The content in JSON format needs to be wrapped in "```json" and "```".
-Only include guidelines that actually apply (applies: true). Do not include guidelines that don't match.
+Only include guidelines that actually apply.
 ```json
 {
  "checks": [
      {
          "guideline_id": "1",
-          "condition": "Specific condition description",
+          "rationale": "<Explain why the conditions are met and what action should be taken>"
          "rationale": "<Explain why the conditions are met and what action should be taken>",
          "applies": true
      }
  ]
 }
--- a/prompt/preamble_prompt.md
+++ b/prompt/preamble_prompt.md
@ -1,42 +1,48 @@
-You are an AI agent that is expected to generate a preamble message for the customer.
+You are a friendly AI assistant that generates natural preamble responses to acknowledge user messages before passing them to a more capable agent for detailed processing.
-The actual message will be sent later by a smarter agent. Your job is only to generate the right preamble in order to save time.
+## Scenario Analysis
-## Scenario Detection Logic
+Analyze the user's message to determine the appropriate response type:
-FIRST, determine if this is a COMPLEX scenario that requires a preamble:
+**Use a friendly preamble for:**
 - Questions, information requests, or searches
 - Task execution or action requests
 - Knowledge retrieval from documents, databases, or web
 - Problem-solving or troubleshooting requests
 - Complex multi-step instructions
 - Technical assistance needs
-**Complex Scenarios (preamble needed):**
+**Return "<empty>" for:**
- Query scenarios: User is asking for information, searching, or looking up data
+- Simple greetings and farewells
- Action scenarios: User wants to perform an operation, execute a task  
+- Basic acknowledgments (thanks, ok, etc.)
- Knowledge retrieval scenarios: User needs to search knowledge base, documents、databases or Internet
+- Casual small talk
- Problem-solving: User is reporting issues, asking for help with problems
+- Very brief or non-substantive messages
 - Complex requests: Multi-step tasks, detailed instructions needed
-**ALL OTHER scenarios (output "<empty>"):**
+## Response Guidelines
 - Simple greetings: "hi", "hello", "你好", "在吗", "早上好/晚上好"
 - Simple acknowledgments: "thanks", "ok", "好的", "谢谢"
 - Small talk: "how are you", "天气怎么样", "最近怎么样"
 - Simple farewells: "bye", "goodbye", "再见"
 - Any other scenarios not explicitly listed as complex
-## Preamble Selection
+Generate a warm, natural preamble that:
 - Acknowledges the user's message positively
 - Shows you're ready to help
 - Creates a friendly interaction flow
 - Doesn't commit to specific answers (that's for the main agent)
-ONLY IF this is a COMPLEX scenario, choose from these preamble messages. You must ONLY choose one of these: ###
+**Reference these examples for inspiration:**
 {preamble_choices_text}
 ###
-Basically, the preamble is something very short that continues the interaction naturally, without committing to any later action or response.
+**Your approach:**
-We leave that later response to another agent. Make sure you understand this.
+- Match the tone to the user's message (formal/professional or casual/friendly)
 - Use contextual awareness - consider what the user is asking about
 - Keep it brief and welcoming
 - Feel free to create natural variations beyond the examples
 - In Chinese conversations, use appropriate conversational phrases
 - Be helpful and encouraging
-Instructions:
+**Examples of good preambles:**
- For COMPLEX scenarios:
+- "I'd be happy to help you with that!"
-  - Note that some of the choices are more generic, and some are more specific to a particular scenario.
+- "Let me look into that for you."
-  - If you're unsure what to choose --> prefer to go with a more generic, bland choice. This should be 80% of cases.
+- "Thanks for reaching out - I'll assist you with this."
-    Examples of generic choices: "Hey there!", "Just a moment.", "Hello.", "Got it."
+- "That's an interesting question! Let me help you find the answer."
-  - If you see clear value in saying something more specific and nuanced --> then go with a more specific choice. This should be 20% or less of cases.
+- "I understand what you need. Let me get that information for you."
    Examples of specific choices: "Let me check that for you.", "Sorry to hear that.", "Thanks for your patience."
 - For ALL OTHER scenarios: Always output preamble: "<empty>"
 Chat History:
@ -46,26 +52,25 @@ User's Last Message:
 {last_message}
 OUTPUT FORMAT:
-You must now choose the preamble message. You must produce a JSON object with a single key, "preamble", holding the preamble message as a string,
+Generate a JSON response with your preamble message:
 EXACTLY as it is given (pay attention to subtleties like punctuation and copy your choice EXACTLY as it is given above).The content in JSON format needs to be wrapped in "```json" and "```".
-For ALL OTHER scenarios:
+For simple interactions (greetings, basic acknowledgments, small talk):
 ```json
 {
  "preamble": "<empty>"
 }
 ```
-For COMPLEX scenarios:
+For substantive requests and questions:
 ```json
 {
-  "preamble": "Just a moment."
+  "preamble": "Your friendly, contextual preamble here"
 }
 ```
-You will now be given the current state of the interaction to which you must generate the next preamble message.
+Remember to wrap your entire response in ```json ... ``` tags.
 Preamble LANGAGE:
 {language}
-
+You will now be given the current state of the interaction to which you must generate the next preamble message.
--- a/routes/chat.py
+++ b/routes/chat.py
@ -77,7 +77,7 @@ async def process_guidelines_and_terms(
    公共函数：处理guideline分析和terms处理，返回agent和analysis结果
    Returns:
-        tuple: (agent, processed_system_prompt, guideline_analysis, terms_analysis)
+        tuple: (agent, processed_system_prompt, guideline_reasoning, terms_analysis)
    """
    # 提取system_prompt中的guideline和terms
    processed_system_prompt, guidelines_list, terms_list = extract_block_from_system_prompt(system_prompt)
@ -108,7 +108,8 @@ async def process_guidelines_and_terms(
            logger.error(f"Error removing terms cache file: {e}")
    # 处理guidelines
-    guideline_analysis = ""
+    guideline_reasoning = ""
    active_guidelines = ""
    agent = None
    if guidelines_list:
@ -125,7 +126,7 @@ async def process_guidelines_and_terms(
            batch_guidelines = guidelines_list[i:i + guidelines_per_batch]
            batch_strings = []
            for guideline in batch_guidelines:
-                guideline_str = f"{guideline['id']}) Condition: {guideline['condition']} Action: {guideline['action']}"
+                guideline_str = f"{guideline['guideline_id']}) Condition: {guideline['condition']} Action: {guideline['action']}"
                batch_strings.append(guideline_str)
            batches.append(batch_strings)
@ -179,14 +180,50 @@ async def process_guidelines_and_terms(
                logger.error(f"Guideline batch {i} failed: {result}")
                continue
            if result and isinstance(result, dict) and 'checks' in result:
-                applicable_checks = [check for check in result['checks'] if check.get('applies') is True]
+                applicable_checks = [check for check in result['checks']]
                all_checks.extend(applicable_checks)
            elif result and isinstance(result, str) and result.strip():
                logger.info(f"Non-JSON result from batch {i}: {result}")
        if all_checks:
-            guideline_analysis = "\n".join([item["condition"]+":"+item["rationale"] for item in all_checks])
+            # 首先创建guideline_id到action字段的映射，同时支持string和int类型的guideline_id
-            logger.info(f"Guideline analysis completed: {len(guideline_analysis)} chars")
+            guideline_map = {}
            for guideline in guidelines_list:
                guideline_id = guideline.get('guideline_id')
                if guideline_id is not None:
                    # 同时存储字符串和整数类型的键，确保匹配成功
                    guideline_map[str(guideline_id)] = guideline
            # 补全all_checks中缺失的action字段
            completed_checks = []
            for check in all_checks:
                completed_check = check.copy()
                guideline_id = check.get('guideline_id')
                # 如果action字段缺失或为空，从guidelines_list中获取
                if str(guideline_id) in guideline_map:
                    completed_check['action'] = guideline_map[str(guideline_id)].get('action', "")
                    completed_check['condition'] = guideline_map[str(guideline_id)].get('condition', "")
                completed_checks.append(completed_check)
            guideline_reasoning = "\n".join([item["rationale"] for item in completed_checks])
            # 使用补全后的checks生成active_guidelines，添加安全检查
            active_guidelines_parts = []
            for item in completed_checks:
                if 'action' in item and item['action']:
                    active_guidelines_parts.append(
                        "Condition:" + item["condition"] + "\nRationale:" + item["rationale"] + "\nAction:" + item["action"]
                    )
                else:
                    # 如果仍然缺少action字段，使用默认值
                    active_guidelines_parts.append(
                        "Condition:" + item["condition"] + "\nRationale:" + item["rationale"] + "\nAction:无具体操作要求"
                    )
            active_guidelines = "\n".join(active_guidelines_parts)
            logger.info(f"Guideline analysis completed: {len(guideline_reasoning)} chars, processed {len(completed_checks)} checks")
    else:
        # 没有guidelines，直接创建agent
@ -204,7 +241,7 @@ async def process_guidelines_and_terms(
            user_identifier=user_identifier
        )
-    return agent, processed_system_prompt, guideline_analysis, terms_analysis
+    return agent, processed_system_prompt, guideline_reasoning, active_guidelines
 async def enhanced_generate_stream_response(
@ -269,11 +306,11 @@ async def enhanced_generate_stream_response(
            logger.error(f"Error generating preamble text: {e}")
        # 等待guideline分析任务完成
-        agent, _, guideline_analysis, _ = await guidelines_task
+        agent, system_prompt, guideline_reasoning, active_guidelines = await guidelines_task
-        # 立即发送guideline_analysis
+        # 立即发送guideline_reasoning
-        if guideline_analysis:
+        if guideline_reasoning:
-            guideline_content = get_content_from_messages([{"role": "assistant","reasoning_content": guideline_analysis+ "\n"}], tool_response=tool_response)
+            guideline_content = get_content_from_messages([{"role": "assistant","reasoning_content": guideline_reasoning+ "\n"}], tool_response=tool_response)
            chunk_data = create_stream_chunk(f"chatcmpl-guideline", model_name, guideline_content)
            yield f"data: {json.dumps(chunk_data, ensure_ascii=False)}\n\n"
@ -281,8 +318,8 @@ async def enhanced_generate_stream_response(
        final_messages = messages.copy()
        final_messages = append_user_last_message(final_messages, f"\n\nlanguage:{get_language_text(language)}")
-        if guideline_analysis:
+        if active_guidelines:
-            final_messages = append_user_last_message(final_messages, f"\n\nActive Guidelines:\n{guideline_analysis}\nPlease follow these guidelines in your response.")
+            final_messages = append_user_last_message(final_messages, f"\n\nActive Guidelines:\n{active_guidelines}\nPlease follow these guidelines in your response.")
        # 第三阶段：agent响应流式传输
        logger.info(f"Starting agent stream response")
@ -373,14 +410,14 @@ async def create_agent_and_generate_response(
        )
-
+    _, system_prompt = get_preamble_text(language, system_prompt)
    # 使用公共函数处理所有逻辑
-    agent, _, guideline_analysis, _ = await process_guidelines_and_terms(
+    agent, system_prompt, guideline_reasoning, active_guidelines = await process_guidelines_and_terms(
        bot_id=bot_id,
        api_key=api_key,
        model_name=model_name,
        model_server=model_server,
-        system_prompt=system_prompt or "",
+        system_prompt=system_prompt,
        messages=messages,
        agent_manager=agent_manager,
        project_dir=project_dir,
@ -395,9 +432,10 @@ async def create_agent_and_generate_response(
    final_messages = messages.copy()
    final_messages = append_user_last_message(final_messages, f"\n\nlanguage:{get_language_text(language)}")
    pre_message_list = []
-    if guideline_analysis:
+    if active_guidelines:
-        final_messages = append_user_last_message(final_messages, f"\n\nActive Guidelines:\n{guideline_analysis}\nPlease follow these guidelines in your response.")
+        final_messages = append_user_last_message(final_messages, f"\n\nActive Guidelines:\n{active_guidelines}\nPlease follow these guidelines in your response.")
-        pre_message_list.append({"role": "assistant","reasoning_content": guideline_analysis+ "\n"})
+    if guideline_reasoning:
        pre_message_list.append({"role": "assistant","reasoning_content": guideline_reasoning+ "\n"})
    # 非流式响应
    agent_responses = agent.run_nonstream(final_messages)
--- a/utils/fastapi_utils.py
+++ b/utils/fastapi_utils.py
@ -749,45 +749,13 @@ def parse_guidelines_text(text: str) -> List[Dict[str, Any]]:
        match = re.match(id_condition_action_pattern, line, re.IGNORECASE)
        if match:
            guidelines.append({
-                'id': int(match.group(1)),
+                'guideline_id': int(match.group(1)),
                'condition': match.group(2).strip(),
                'action': match.group(3).strip(),
                'priority': int(match.group(4)) if match.group(4) else 1
            })
            continue
        # 尝试解析 "condition -> action" 格式
        arrow_pattern = r'(?:\d+\)\s*)?(.*?)\s*->\s*(.*?)(?:\s*\[(\d+)\])?$'
        match = re.match(arrow_pattern, line, re.IGNORECASE)
        if match:
            guidelines.append({
                'id': len(guidelines) + 1,
                'condition': match.group(1).strip(),
                'action': match.group(2).strip(),
                'priority': int(match.group(3)) if match.group(3) else 1
            })
            continue
        # 尝试解析 "if condition then action" 格式
        if_then_pattern = r'(?:\d+\)\s*)?if\s+(.*?)\s+then\s+(.*?)(?:\s*\[(\d+)\])?$'
        match = re.match(if_then_pattern, line, re.IGNORECASE)
        if match:
            guidelines.append({
                'id': len(guidelines) + 1,
                'condition': match.group(1).strip(),
                'action': match.group(2).strip(),
                'priority': int(match.group(3)) if match.group(3) else 1
            })
            continue
        # 默认格式：整行作为action，condition为空
        guidelines.append({
            'id': len(guidelines) + 1,
            'condition': '',
            'action': line.strip(),
            'priority': 1
        })
    return guidelines
@ -848,20 +816,6 @@ def parse_terms_text(text: str) -> List[Dict[str, Any]]:
            current_term = term_data
            continue
        # 尝试解析 "| value" 格式（简化格式）
        if line.startswith('|'):
            parts = [p.strip() for p in line[1:].split('|', 2)]  # 最多分割3部分
            if len(parts) >= 1:
                if current_term:
                    terms.append(current_term)
                current_term = {'name': parts[0]}
                if len(parts) >= 2:
                    current_term['description'] = parts[1]
                if len(parts) >= 3:
                    synonyms = re.split(r'[,;|]', parts[2])
                    current_term['synonyms'] = [s.strip() for s in synonyms if s.strip()]
            continue
    # 添加最后一个term
    if current_term:
        terms.append(current_term)