Deeper dive
Embedding version bumps shipped behind a flag so teams could A/B perplexity vs old index without user-visible downtime.
Employees find the right clause in seconds — embeddings refresh on every policy publish, with access control at query time.
Keyword search failed across PDFs and wiki pages; legal waited on ops to forward the correct attachment.
Chunked ingestion with deterministic IDs, nightly embedding refresh pipelines, and hybrid retrieval reranked for policy freshness.
Pinecone namespaces per division; ACL metadata filters applied server-side before LLM summarization.
Connectors for SharePoint + Confluence with checksum dedupe.
Batch jobs with backoff; poison documents quarantined automatically.
Answer cards show clause IDs for audit-friendly references.
Thumbs-down routes to content owners with suggested edits.
Underwriters stopped mailing PDFs internally—findability became as reliable as the ERP for numbers.
Embedding version bumps shipped behind a flag so teams could A/B perplexity vs old index without user-visible downtime.