10 Commits

Author SHA1 Message Date
DrMelone
26524efcee base64 warn 2026-03-07 21:33:14 +01:00
Classic
db1ca1a342 docs: performance best practices, scaling, and tool calling legacy/native
- Mark Default (prompt-based) tool calling as legacy; recommend Native Mode
  - Document KV cache breaking in Default Mode, system tools only in Native
  - Updated comparison table with Status, KV Cache, System Tools rows
- Add Content Extraction Engine section (pypdf memory leak warning, Tika/Docling)
- Add Embedding Engine section (SentenceTransformers RAM warning at scale)
- Add Common Anti-Patterns section with 6 real-world scaling mistakes
- Add Redis Tuning subsection (timeout, maxclients, single instance sufficiency)
- Expand Profile 3 with content extraction, embeddings, tool calling, Redis
- New Step 6 in scaling guide: Fix Content Extraction & Embeddings
- Quick reference table: add Ext. Content Extraction and Ext. Embeddings columns
- Add CONTENT_EXTRACTION_ENGINE and RAG_EMBEDDING_ENGINE to minimum env vars
2026-02-28 21:31:11 +01:00
Claude
68c7feca10 Add cross-links between scaling guide and troubleshooting docs
- scaling.md: Link each step to relevant troubleshooting sections
  (DB corruption, WebSocket errors, login loops, worker crashes,
  file access issues, logging, OpenTelemetry)
- multi-replica.mdx: Add scaling guide link in intro and Related Docs,
  plus links to Redis and RAG troubleshooting
- performance.md: Add scaling guide link in Scaling Infrastructure section
- redis.md: Add scaling guide link in "When is Redis Required?"
- connection-error.mdx: Add links to Redis tutorial, scaling guide,
  and multi-replica WebSocket troubleshooting
- Helm.md: Add scaling guide link alongside existing HA guide link

https://claude.ai/code/session_01TPoquFdHG6dZxRrZ4Jormh
2026-02-16 13:02:57 +00:00
Claude
c270f4b934 Document ChromaDB SQLite limitation for multi-worker/multi-replica deployments
ChromaDB's default local PersistentClient uses SQLite which is not fork-safe.
When uvicorn forks multiple workers, concurrent writes crash workers instantly.
Added warnings and guidance across env config, HA/scaling, performance,
troubleshooting, Docker Swarm, Helm, Redis, RAG, and enterprise architecture docs.

https://claude.ai/code/session_01TPoquFdHG6dZxRrZ4Jormh
2026-02-16 11:04:51 +00:00
DrMelone
8edb316819 HTTPS moved 2026-02-14 11:37:32 +01:00
DrMelone
583d65e13b slugs 2026-02-14 01:19:41 +01:00
DrMelone
c10c8d15ec complete docs overhaul 2026-02-14 01:02:31 +01:00
DrMelone
6ecf9275e4 nginx 2026-02-04 18:41:45 +01:00
DrMelone
e005a46a51 NGINX CRITICAL BUFFERING CACHE PROXY 2026-01-11 18:04:38 +01:00
DrMelone
1357ee1f7a update 2026-01-11 13:33:56 +01:00