Showing 1061–1080 of 1502 insights
TitleEpisodePublishedCategoryDomainTool TypePreview
Vectorized RAG PipelineEp 8 (Audio Only)7/18/2025FrameworksAi-development-
A simple vectorized Retrieval-Augmented Generation pipeline can outperform complex engineered agent solutions for certain AI tasks by efficiently retr...
Enterprise AI ComplianceEp 8 (Audio Only)7/18/2025FrameworksAi-development-
Guaranteeing that no code will be leaked is a systematic strategy when pitching AI coding tools to enterprises to address data-security concerns.
Seat-to-Consumption ModelEp 8 (Audio Only)7/18/2025FrameworksAi-development-
Transition SaaS pricing from per-seat licensing to a consumption-based, agentic model where customers pay for AI executions rather than user seats.
CAC Payback CalculationEp 8 (Audio Only)7/18/2025FrameworksFrontend-
Compute CAC payback by dividing total sales and marketing costs per customer by the average customer lifetime value to estimate months to recoup acqui...
Tool Overload TestingEp 8 (Audio Only)7/18/2025FrameworksAi-development-
Use an agent evaluation methodology by overloading an LLM with 30–40 distinct tools to observe its decision-making and tool-selection accuracy under h...
Low-Rank Adaptation UseEp 8 (Audio Only)7/18/2025FrameworksAi-development-
Apply low-rank adaptation (LoRA) for very cheap, narrow-task fine-tuning when absolutely necessary to specialize a general model.
Eval Set OptimizationEp 8 (Audio Only)7/18/2025FrameworksAi-development-
If you have a high-quality evaluation set, you can iterate on prompts and inference strategies instead of fine-tuning the base model to achieve great ...
Domain-Specific Fine-TuningEp 8 (Audio Only)7/18/2025FrameworksAi-development-
Train or fine-tune large models on specialized domain corpora—like medical literature—to create agents with deep, expert-level knowledge in that field...
Embedded Tool CallingEp 8 (Audio Only)7/18/2025FrameworksAi-development-
Instead of relying solely on external dev tooling, embed tool-calling capabilities directly within the base AI model so it can act as an "intellectual...
Tool-Use Synthetic PipelineEp 8 (Audio Only)7/18/2025FrameworksAi-development-
A pipeline gathering real developer MCP examples to generate vast synthetic tool-calling data, judged by an LLM rubric and refined via reinforcement l...
Synthetic Data RLEp 8 (Audio Only)7/18/2025FrameworksAi-development-
Models at Google DeepMind generate their own synthetic data via reinforcement learning to extend token limits and advance capabilities without externa...
Sparse MoE for EfficiencyEp 8 (Audio Only)7/18/2025FrameworksAi-development-
Moonshot’s trillion-parameter model uses a mixture-of-experts sparse attention design that activates only 32 billion parameters at once, demonstrating...
Agent-based architecture pivotEp 8 (Audio Only)7/18/2025FrameworksAi-development-
Many AI stacks are "pivoted to agents," suggesting building AI systems centered around autonomous agent frameworks.
Multi-Layered Memory ArchitectureEp 8 - Kimi2, Is RAG still a thing? and the coming SaaS bloodbath.7/18/2025FrameworksAi-development-
Apply a multi-layered approach to agent memory—drawing on MEM0 and MM OS papers—to structure long-term and short-term memory in AI agents.
Vector Conversion PipelinesEp 8 - Kimi2, Is RAG still a thing? and the coming SaaS bloodbath.7/18/2025FrameworksAi-development-
Leverage specialized services like Pinetone to automate data conversion pipelines into vector space with algorithmic variations tailored to each probl...
Local model metadata extractionEp 8 - Kimi2, Is RAG still a thing? and the coming SaaS bloodbath.7/18/2025FrameworksAi-development-
Use smaller local models to extract key metadata or summaries from documents to handle tasks without requiring large-scale vector storage.
Domain-specific vectorizationEp 8 - Kimi2, Is RAG still a thing? and the coming SaaS bloodbath.7/18/2025FrameworksAi-development-
Vectorize and store embeddings only for narrow domain data sets (e.g., per-property JSON with 500 fields) to achieve better performance than prompt en...
Two-Stage Retrieval PipelineEp 8 - Kimi2, Is RAG still a thing? and the coming SaaS bloodbath.7/18/2025FrameworksAi-development-
Implement a two-step RAG pipeline by first running an embeddings-based similarity search to get a pointer, then executing a SQL or graph query to fetc...
Graph-Based Memory RetrievalEp 8 - Kimi2, Is RAG still a thing? and the coming SaaS bloodbath.7/18/2025FrameworksAi-development-
Use a graph database like Neo4J to represent LLM memory and accelerate retrieval of contextual or social relationship data for tasks such as user pref...
Vector vs Graph RAGEp 8 - Kimi2, Is RAG still a thing? and the coming SaaS bloodbath.7/18/2025FrameworksDevops-
Combine vector-based semantic clustering with graph-based relationships to leverage cosine similarity and entity connections in your augmented generat...
PreviousPage 54 of 76Next