Showing 1061–1080 of 1502 insights
| Title | Episode | Published | Category | Domain | Tool Type | Preview |
|---|---|---|---|---|---|---|
| Vectorized RAG Pipeline | Ep 8 (Audio Only) | 7/18/2025 | Frameworks | Ai-development | - | A simple vectorized Retrieval-Augmented Generation pipeline can outperform complex engineered agent solutions for certain AI tasks by efficiently retr... |
| Enterprise AI Compliance | Ep 8 (Audio Only) | 7/18/2025 | Frameworks | Ai-development | - | Guaranteeing that no code will be leaked is a systematic strategy when pitching AI coding tools to enterprises to address data-security concerns. |
| Seat-to-Consumption Model | Ep 8 (Audio Only) | 7/18/2025 | Frameworks | Ai-development | - | Transition SaaS pricing from per-seat licensing to a consumption-based, agentic model where customers pay for AI executions rather than user seats. |
| CAC Payback Calculation | Ep 8 (Audio Only) | 7/18/2025 | Frameworks | Frontend | - | Compute CAC payback by dividing total sales and marketing costs per customer by the average customer lifetime value to estimate months to recoup acqui... |
| Tool Overload Testing | Ep 8 (Audio Only) | 7/18/2025 | Frameworks | Ai-development | - | Use an agent evaluation methodology by overloading an LLM with 30–40 distinct tools to observe its decision-making and tool-selection accuracy under h... |
| Low-Rank Adaptation Use | Ep 8 (Audio Only) | 7/18/2025 | Frameworks | Ai-development | - | Apply low-rank adaptation (LoRA) for very cheap, narrow-task fine-tuning when absolutely necessary to specialize a general model. |
| Eval Set Optimization | Ep 8 (Audio Only) | 7/18/2025 | Frameworks | Ai-development | - | If you have a high-quality evaluation set, you can iterate on prompts and inference strategies instead of fine-tuning the base model to achieve great ... |
| Domain-Specific Fine-Tuning | Ep 8 (Audio Only) | 7/18/2025 | Frameworks | Ai-development | - | Train or fine-tune large models on specialized domain corpora—like medical literature—to create agents with deep, expert-level knowledge in that field... |
| Embedded Tool Calling | Ep 8 (Audio Only) | 7/18/2025 | Frameworks | Ai-development | - | Instead of relying solely on external dev tooling, embed tool-calling capabilities directly within the base AI model so it can act as an "intellectual... |
| Tool-Use Synthetic Pipeline | Ep 8 (Audio Only) | 7/18/2025 | Frameworks | Ai-development | - | A pipeline gathering real developer MCP examples to generate vast synthetic tool-calling data, judged by an LLM rubric and refined via reinforcement l... |
| Synthetic Data RL | Ep 8 (Audio Only) | 7/18/2025 | Frameworks | Ai-development | - | Models at Google DeepMind generate their own synthetic data via reinforcement learning to extend token limits and advance capabilities without externa... |
| Sparse MoE for Efficiency | Ep 8 (Audio Only) | 7/18/2025 | Frameworks | Ai-development | - | Moonshot’s trillion-parameter model uses a mixture-of-experts sparse attention design that activates only 32 billion parameters at once, demonstrating... |
| Agent-based architecture pivot | Ep 8 (Audio Only) | 7/18/2025 | Frameworks | Ai-development | - | Many AI stacks are "pivoted to agents," suggesting building AI systems centered around autonomous agent frameworks. |
| Multi-Layered Memory Architecture | Ep 8 - Kimi2, Is RAG still a thing? and the coming SaaS bloodbath. | 7/18/2025 | Frameworks | Ai-development | - | Apply a multi-layered approach to agent memory—drawing on MEM0 and MM OS papers—to structure long-term and short-term memory in AI agents. |
| Vector Conversion Pipelines | Ep 8 - Kimi2, Is RAG still a thing? and the coming SaaS bloodbath. | 7/18/2025 | Frameworks | Ai-development | - | Leverage specialized services like Pinetone to automate data conversion pipelines into vector space with algorithmic variations tailored to each probl... |
| Local model metadata extraction | Ep 8 - Kimi2, Is RAG still a thing? and the coming SaaS bloodbath. | 7/18/2025 | Frameworks | Ai-development | - | Use smaller local models to extract key metadata or summaries from documents to handle tasks without requiring large-scale vector storage. |
| Domain-specific vectorization | Ep 8 - Kimi2, Is RAG still a thing? and the coming SaaS bloodbath. | 7/18/2025 | Frameworks | Ai-development | - | Vectorize and store embeddings only for narrow domain data sets (e.g., per-property JSON with 500 fields) to achieve better performance than prompt en... |
| Two-Stage Retrieval Pipeline | Ep 8 - Kimi2, Is RAG still a thing? and the coming SaaS bloodbath. | 7/18/2025 | Frameworks | Ai-development | - | Implement a two-step RAG pipeline by first running an embeddings-based similarity search to get a pointer, then executing a SQL or graph query to fetc... |
| Graph-Based Memory Retrieval | Ep 8 - Kimi2, Is RAG still a thing? and the coming SaaS bloodbath. | 7/18/2025 | Frameworks | Ai-development | - | Use a graph database like Neo4J to represent LLM memory and accelerate retrieval of contextual or social relationship data for tasks such as user pref... |
| Vector vs Graph RAG | Ep 8 - Kimi2, Is RAG still a thing? and the coming SaaS bloodbath. | 7/18/2025 | Frameworks | Devops | - | Combine vector-based semantic clustering with graph-based relationships to leverage cosine similarity and entity connections in your augmented generat... |
© 2025 The Build. All rights reserved.
Privacy Policy