AI Strategy: Why We Moved from Local Llama to OpenAI
A pragmatic analysis of the costs, performance, and complexity of running local LLMs versus cloud providers, and why we adopted a hybrid architecture.
14 articles found
A pragmatic analysis of the costs, performance, and complexity of running local LLMs versus cloud providers, and why we adopted a hybrid architecture.
How we use Model Context Protocol to create a closed-loop design system where agents verify implementation against Figma specs automatically.
How to extract entities (NER) from documents using LLMs and model them effectively in FalkorDB for knowledge graphs.
Techniques for rewriting user queries (HyDE, Expansion) and reranking results to boost retrieval accuracy in Retrieval-Augmented Generation.
Implement effective text chunking strategies for RAG pipelines with semantic boundaries, overlap, and metadata preservation.
Extract named entities from documents using spaCy NLP service for building knowledge graphs and improving search relevance.
A practical cluster setup guide for running BlueRobin-style agentic services with reliable data, messaging, and observability foundations.
Revolutionizing cluster operations by replacing kubectl with natural language interaction using the Kubernetes MCP Server.
Deploy and configure Qdrant for production semantic search with collection management, filtering, and high availability on Kubernetes.
Implement a complete Retrieval-Augmented Generation pipeline that combines semantic search with local LLM inference for intelligent document Q&A.
Build a powerful hybrid search system that combines vector embeddings with traditional keyword search for comprehensive document retrieval.
Bridging the gap between design and development by using the Model Context Protocol (MCP) to automate UI generation from Figma to Blazor.
Create a production-ready embedding pipeline in .NET that generates vector embeddings with Ollama and stores them in Qdrant for semantic search.
Learn how to integrate Docling, an AI-powered document understanding library, into your .NET application for high-quality OCR with layout preservation.