Retrieval and RAG API

The Retrieval and RAG API allows you to query your knowledge base, retrieve relevant document chunks, and generate answers using Retrieval Augmented Generation (RAG).

Query Knowledge Base (POST)

Query the knowledge base and retrieve relevant chunks.

POST /api/retrieval/query

Request Body

ParameterTypeDescriptionDefault
querystringThe search queryRequired
vector_dbstringVector database to use”pinecone”
top_kintegerNumber of results to return3
modelstringModel to use for generating answers”gpt-4”

Response

Returns relevant document chunks and potentially a generated answer.

Query Knowledge Base (GET)

Get relevant chunks from the knowledge base without generating an answer.

GET /api/retrieval/query

Query Parameters

ParameterTypeDescriptionDefault
querystringThe search queryRequired
vector_dbstringVector database to use”pinecone”
top_kintegerNumber of results to return3

Response

Returns relevant document chunks without generating an answer.

RAG Query

Handle RAG query requests compatible with the RAG pipeline frontend.

POST /api/chatbot/retrieval/query

Request Body

ParameterTypeDescriptionDefault
querystringThe search queryRequired
vector_dbstringVector database to use”pinecone”
top_kintegerNumber of results to return3
modelstringModel to use for generating answers”gpt-4”

Response

Returns a structured response with:

  • query: The original query
  • answer: The generated answer
  • chunks: Array of retrieved chunks with:
    • id: Chunk ID
    • text: Chunk text
    • score: Relevance score
    • source: Source document
    • entities: Array of entity mentions (optional)
  • total_chunks_searched: Number of chunks searched
  • retrieval_time_ms: Time taken for retrieval in milliseconds

Simple Chatbot Query

Simple query endpoint for chatbot functionality.

GET /api/chatbot/query/

Query Parameters

ParameterTypeDescription
query_strstringThe search query
ragbooleanWhether to use RAG
topkintegerNumber of results to return

Response

Returns a response from the chatbot, optionally using RAG if specified.