2026-04-16 20:09:02 +08:00

8.7 KiB

Raw Blame History

name	description
rag-retrieve	RAG retrieval skill for querying and retrieving relevant documents from knowledge base. Use this skill when users need to search documentation, retrieve knowledge base articles, or get context from a vector database. Supports semantic search with configurable top-k results.

RAG Retrieve

Skill Structure

This is a self-contained skill package that can be distributed independently. The skill includes its own scripts and configuration:

rag-retrieve/
├── SKILL.md              # Core instruction file (this file)
├── skill.yaml            # Skill metadata
├── scripts/              # Executable scripts
│   └── rag_retrieve.py   # Main RAG retrieval script

Overview

Query and retrieve relevant documents from a RAG (Retrieval-Augmented Generation) knowledge base using vector search. This skill provides semantic search capabilities with support for multiple bot instances and configurable result limits.

Required Parameters

Before executing any retrieval, you MUST confirm the following required parameters with the user if they are not explicitly provided:

Parameter	Description	Type
query	Search query content	string

Optional Parameters

Parameter	Description	Type	Default
top_k	Maximum number of results	integer	100

Confirmation Template

When the required parameter is missing, ask the user:

I need some information to perform the RAG retrieval:

1. Query: What would you like to search for?

Quick Start

Use the scripts/rag_retrieve.py script to execute RAG queries:

scripts/rag_retrieve.py --query "your search query"

Usage Examples

Basic Query

scripts/rag_retrieve.py --query "How to configure authentication?"

Search with Specific Top-K

scripts/rag_retrieve.py --query "API error handling" --top-k 50

Common Use Cases

Scenario 1: Documentation Search

scripts/rag_retrieve.py --query "deployment guide"

Scenario 2: Troubleshooting

scripts/rag_retrieve.py --query "connection timeout error"

Scenario 3: Feature Information

scripts/rag_retrieve.py --query "enterprise pricing plans"

Script Usage

rag_retrieve.py

Main script for executing RAG retrieval queries.

scripts/rag_retrieve.py [OPTIONS]

Options:

Option	Required	Description	Default
`--query`, `-q`	Yes	Search query content	-
`--top-k`, `-k`	No	Maximum number of results	100

Examples:

# Basic query
scripts/rag_retrieve.py --query "authentication setup"

# Custom top-k
scripts/rag_retrieve.py --query "API reference" --top-k 20

Common Workflows

Research Mode: Comprehensive Search

scripts/rag_retrieve.py --query "machine learning algorithms" --top-k 100

Quick Answer Mode: Focused Search

scripts/rag_retrieve.py --query "password reset" --top-k 10

Comparison Mode: Multiple Queries

# Search for related topics
scripts/rag_retrieve.py --query "REST API" --top-k 30
scripts/rag_retrieve.py --query "GraphQL API" --top-k 30

Resources

scripts/rag_retrieve.py

Executable Python script for RAG retrieval. Handles:

HTTP requests to RAG API
Authentication token generation
Configuration file loading
Error handling and reporting
Markdown response parsing

The script can be executed directly without loading into context.

Retrieval Policy (Priority & Fallback)

1. Retrieval Source Priority

If earlier context does not explicitly specify a knowledge retrieval priority, the default order is: skill-enabled knowledge retrieval tools > rag_retrieve / table_rag_retrieve > local filesystem retrieval (including datasets/ and any file browsing/search tools).
Follow this Retrieval Policy (Priority & Fallback) section for retrieval source selection, tool selection order, query rewrite, top_k, result handling, fallback, and citation requirements.
The local filesystem is the lowest-priority source. Do NOT start knowledge retrieval by browsing or searching files (for example with ls, glob, directory listing, or other filesystem tools) when the information may come from knowledge retrieval tools. Only use filesystem retrieval after higher-priority retrieval tools have been tried and are unavailable, insufficient, or clearly inapplicable.

2. Tool Selection

When knowledge retrieval is needed and no higher-priority skill-enabled retrieval tool is specified or available, you MUST start with rag_retrieve or table_rag_retrieve based on the question type. Do NOT answer from model knowledge before trying the appropriate retrieval tool.
Use table_rag_retrieve first for values, prices, quantities, inventory, specifications, rankings, comparisons, summaries, extraction, lists, tables, person / project / product name lookup, historical coverage, mixed questions, or any unclear case.
Use rag_retrieve first only for clearly pure concept / definition / workflow / policy / explanation questions that do not need structured data.

3. Query Preparation

Do NOT pass the user's raw question directly unless it already fits retrieval needs well.
Rewrite the query to improve recall: extract the core entity, time scope, attributes, and intent.
Add meaningful variants such as synonyms, aliases, abbreviations, related titles, historical names, and category terms.
Expand enumeration-style, historical, roster, timeline, overview, archive, extraction, and list-style queries more aggressively.
Preserve the original meaning and do not introduce unrelated topics. Use both the original query and rewritten variants whenever possible.

4. Retrieval Breadth (`top_k`)

top_k applies to rag_retrieve. Use the smallest sufficient top_k and expand only when coverage is insufficient.
Use 30 for simple fact lookup about one specific thing.
Use 50 for moderate synthesis, comparison, summarization, or disambiguation.
Use 100 for broad-recall queries needing high coverage, such as comprehensive analysis, scattered knowledge, multiple entities or periods, list / catalog / timeline / roster / overview requests, or all items / historical succession / many records.
Raise top_k when query rewrite produces many useful keyword branches or when results are too few, repetitive, incomplete, sparse, or too narrow in coverage. Do not raise top_k just because the query is longer.
Expansion sequence: 30 -> 50 -> 100. If uncertain, prefer 100.

5. Result Evaluation

Treat the result as insufficient when it is empty, starts with Error:, says no excel files found, is off-topic, does not match the user's core entity / scope, or clearly contains no usable evidence.
Treat the result as insufficient when it only covers part of the user's request, or when the user asked for a complete list, historical coverage, comparison, or mixed data + explanation but the result is only partial or truncated.

6. Fallback and Sequential Retry

If the first retrieval tool returns empty results, errors, clearly irrelevant content, or only partial coverage of the user's request, you MUST try the other retrieval tool before replying to the user.
If the table result is empty, continue with rag_retrieve before concluding that no relevant data exists.
You may say that no relevant information was found only after both rag_retrieve and table_rag_retrieve have been tried and still do not provide enough evidence to answer.

7. Table Result Handling

When processing table_rag_retrieve results, follow all instructions in [INSTRUCTION] and [EXTRA_INSTRUCTION] sections of the response.
If Query result hint indicates truncation (for example, Only the first N rows are included; the remaining M rows were omitted), you MUST explicitly tell the user the total matches (N+M), displayed count (N), and omitted count (M).
Cite data sources using file names from file_ref_table in the response.

8. Citation Requirements for Retrieved Knowledge

When your answer uses learned knowledge from rag_retrieve or table_rag_retrieve, you MUST generate <CITATION ... /> tags.
Follow the specific citation format instructions returned by each tool.
Citations MUST appear IMMEDIATELY AFTER the paragraph or bullet list that uses the knowledge.
NEVER collect all citations and place them at the end of your response.
Limit to 1-2 citations per paragraph or bullet list, combining related facts under one citation when possible.
If your answer uses learned knowledge, you MUST generate at least 1 <CITATION ... /> in the response.

8.7 KiB Raw Blame History