Learn how to upload and process documents with Vedaya
chunk_size
: Number of characters in each chunk (default: 500)overlap_pct
: Percentage of overlap between chunks (default: 10%)pypdf2
: Fast and lightweight (default)pdfplumber
: Better for complex layoutspymupdf
: High performance with advanced featurespymupdf4llm
: Optimized for language model processing