Bob Belderbos: Ask the Canon: Semantic Search Without a Vector Database

Original Article Summary
Running a semantic search over 100 classics on a droplet with embeddings, NumPy, and np.argsort. No Pinecone, no API calls at query time. Starting simple.
Read full article at Belderbos.devâ¨Our Analysis
Bob Belderbos' implementation of semantic search without a vector database, as outlined in his blog post "Ask the Canon: Semantic Search Without a Vector Database", demonstrates a significant approach to simplifying complex search queries. By utilizing embeddings, NumPy, and np.argsort to run semantic searches over 100 classics on a droplet, Belderbos showcases an innovative method that eliminates the need for vector databases like Pinecone and API calls at query time. This development has important implications for website owners, particularly those with large volumes of text-based content. By adopting similar techniques, website owners can improve their site's search functionality without incurring the costs and complexities associated with traditional vector database solutions. Moreover, this approach can help website owners better understand and manage AI bot traffic, as they can more easily analyze and optimize their site's search queries. To take advantage of this innovation, website owners can consider the following actionable tips: first, explore the use of embeddings and NumPy to enhance their site's search capabilities; second, investigate alternatives to traditional vector databases, such as the approach outlined by Belderbos; and third, review their llms.txt files to ensure they are optimized for semantic search queries, allowing them to better manage AI bot traffic and improve overall site performance.
Related Topics
Track AI Bots on Your Website
See which AI crawlers like ChatGPT, Claude, and Gemini are visiting your site. Get real-time analytics and actionable insights.
Start Tracking Free â

