Deep Dives
Tutorials and in-depth explainers on AI tools and concepts.
Pinecone Picks
Making Retrieval Augmented Generation Fast
Retrieval Augmented Generation (RAG) is the go-to method for adding external knowledge to Large Language Models (LLMs). RAG with agents can be slow, but we can make it much faster using NVIDIA NeMo Guardrails. We explain how here.
10 min read
Llama 2: AI Developers Handbook
Llama 2 is the latest Large Language Model (LLM) from Meta AI. It has been released as an open-access model, enabling unrestricted access to corporations and open-source hackers alike. Here we learn how to use it with Hugging Face, LangChain, and as a conversational agent.
7 min read
The Missing WHERE Clause in Vector Search
Vector similarity search makes massive datasets searchable in fractions of a second. Yet despite the brilliance and utility of this technology, often what seem to be the most straightforward problems are the most difficult to solve. Such as filtering.
12 min read
Evaluation Measures in Information Retrieval
Evaluation of information retrieval (IR) systems is critical to making well-informed design decisions. From search to recommendations, evaluation measures are paramount to understanding what does and does not work in retrieval.
14 min read
Browse All




































