docrag

Author	SHA1	Message	Date
Z User	b811162f78	Implement tool calling loop for LLM - Pass all registered tools to LLM during chat completion - Handle tool_calls from LLM response - Execute tools and feed results back to LLM - Loop until LLM returns final response - Updated system prompt to encourage tool use - Updated streaming to handle tool calls - Increased MAX_TOOL_ITERATIONS to 5	2026-03-29 16:07:56 +00:00
Z User	eabdadfb62	Implement full DocRAG server with OpenAI-compatible API Features: - FastAPI server with OpenAI-compatible endpoints (/v1/chat/completions, /v1/models) - RAG system with document processing and vector storage - Support for multiple document formats (PDF, DOCX, HTML, text, code) - Streaming response support - Tool integration with website_downloader - Document management API endpoints - GLM-4.7-Flash integration via z-ai-web-dev-sdk - Works transparently with Open WebUI and other OpenAI clients Components: - main.py: FastAPI application with OpenAI-compatible API - rag/: RAG system (document processor, vector store, retriever) - tools/: Tool manager with website_downloader integration - .env.example: Configuration template	2026-03-29 00:57:37 +00:00
Z User	aa69b2f496	Add website downloader tool wrapper for GLM-4.7-Flash - Create website_downloader_tool.py with OpenAI function calling schema - Add comprehensive tool documentation - Update README with usage examples - Update requirements.txt with optional sdk dependency	2026-03-29 00:16:54 +00:00
butterfly	c02c032c67	Initial commit	2026-03-28 15:51:14 -07:00

4 Commits