Tag: rag
7 discussions across 5 posts tagged "rag".
AI Signal - May 05, 2026
-
Claude Code skill that builds knowledge graphs of entire codebases using Leiden community detection, giving Claude persistent memory at 71x fewer tokens per query vs reading raw files. Viral success (450k+ downloads, ~40k GitHub stars) demonstrates demand for better codebase context management. People building on top without the author's involvement.
AI Signal - April 14, 2026
-
A clear architectural distinction between traditional RAG (linear: query → search → respond) and agentic RAG (non-linear: aggregator agent plans, delegates to specialized sub-agents for local data, APIs, web search, then synthesizes). The post is practical, includes a concrete architecture diagram in prose, and is directly relevant to anyone building production retrieval systems that need to handle complex, multi-source queries.
AI Signal - April 07, 2026
-
Built from Karpathy's workflow, the Graphify tool compiles raw folders into structured knowledge graphs, achieving 71.5× token reduction. Instead of reloading raw files every session, it creates a queryable wiki structure that Claude Code can navigate efficiently.
AI Signal - March 17, 2026
- I used Obsidian as a persistent brain for Claude Code and built a full open source tool over a weekend. r/ClaudeAI Score: 622
A practical approach to giving Claude Code persistent memory using Obsidian as a knowledge base. The author built custom commands and agent personas that reference a structured vault, enabling Claude to maintain context across sessions. The setup will be open-sourced, offering a blueprint for others to implement persistent agent memory.
AI Signal - March 03, 2026
- A 16-problem RAG failure map that LlamaIndex just adopted (semantic firewall, MIT, step-by-step examples) r/LlamaIndex Score: 7
The author published a structured failure-mode checklist for RAG systems covering 16 reproducible failure categories — and LlamaIndex adopted it into their official RAG troubleshooting docs. The post walks through each failure mode with concrete LlamaIndex examples. For anyone building production RAG pipelines, this is a structured diagnostic tool worth bookmarking.
-
Onyx is a self-hostable AI chat platform supporting any LLM, with built-in support for custom agents, knowledge source connections, and hybrid search/retrieval workflows. This is squarely in the intersection of self-hosted AI and RAG interests — a production-grade platform, not a toy demo.
- I made an open source one image debug poster for RAG failures. Feel free to just take it and use it r/OpenSourceAI Score: 5
A single-image RAG debugging reference that can be uploaded directly into any LLM alongside a failing run to get structured diagnostic suggestions — no install required. The "upload to LLM" use pattern is a clever zero-friction distribution mechanism for debugging tools.