In the Pinecone docs, you'll find information on using Pinecone through a client or our REST APIs. Our quickstart guide explains how you can get a production-ready similarity search service up and running in minutes. You'll also find answers to troubleshooting and FAQs.
Below, you'll find a summary of Pinecone.
We explain the key concepts, the workflow, example Use Cases, why use Pinecone, supported indexes, and deployment options.
Pinecone is a managed similarity search service that enables you to add vector search to your applications. This is something that traditional databases don't tend to do well.
You can quickly search for objects, such as images, audio files and documents, that are similar to each other.
Pinecone indexes and searches vector representations of data to find items that are similar to the query. You can index billions of items in real-time and search for the closest matches, with millisecond latency.
Our Learn section explains the basics of vector databases and similarity search as a service.
This is a new method of searching through big data. Unlike traditional search methods, it indexes and searches vector representations of data to find items in close proximity to the query.
Vector embeddings, or “vectors,” are sets of floating-point numbers that represent objects, such as images and documents. They are often generated by Machine Learning (ML) models trained to capture the semantic similarity of objects. Deep Learning models almost always use vectors.
You need vector embeddings. That means finding an embedding model and running it somewhere.
Want to start with working examples? See: Example Applications
Example use cases of similarity search include:
Build semantic text search into your applications. After converting text data into vector embeddings using an NLP transformer (eg, a sentence embedding model) you can store, index, and search through those vectors using Pinecone.
Create an image similarity search backend service.
You can transform image data into vector embeddings and build an index with Pinecone to store these vector embeddings. This enables you to send a new image as query, and retrieve similar images in the index.
Build an audio search application.
The vector embeddings are rich, mathematical representations of the audio recordings. They make it possible to determine how similar recordings are to one another, by using algorithms.
This enables you to:
- Find songs and metadata within a catalog, based on a sample
- Find similar sounds in an audio library
- Detect who's speaking in an audio file
- Take some new (unseen) audio recordings and search through the index to find the most similar matches, along with their YouTube links.
Build a question answering application.
You can index a set of questions and retrieve the most similar stored questions for a new (unseen) question. This enables you to link a new question to answers you might already have.
You can generate product recommendations for ecommerce customers based on previous orders and trending items.
The key steps are:
From there, you can:
- Fetch data
- Insert more data or update existing vectors
- Manage the index
- Manage data
- List namespaces
- Delete the index
Using Pinecone means no more hassles of benchmarking and tuning algorithms or building and maintaining infrastructure for vector search.
Key benefits of Pinecone:
- Managed: Launch a service and make API calls — leave infrastructure to us.
- Scalable: Each service is persistent, consistent, sharded, and replicated.
- Centralized: Load vector embeddings in streams or batches from your models, data lakes, or feature stores.
- Fresh: New and updated items become searchable in milliseconds.
- Fast: Low latency even with billions of items.
- Accurate: Our fast algorithms are more accurate than open-source options.
You must declare the index type and distance metric when you create a new index.
You can use different types of metric in your vector index:
- This is used to calculate the distance between two data points in a plane. It is one of the most commonly used distance metric. For an example, see our image similarity search example.
- This is often used to find similarities between different documents. The advantage is that the scores are normalized to [0,1] range.
- This is used to multiply two vectors. You can use it to tell us how similar the two vectors are. The more positive the answer is, the closer the two vectors are in terms of their directions.
Depending on your application, some metrics have better recall and precision performance than others. For more information, see: What is Vector Similarity Search?
Approximate nearest neighbor search
approximatedengine uses fast approximate search algorithms developed by Pinecone; it is fast and highly accurate.
Pinecone can be deployed in these ways:
- Multi-tenant environment, an API call away. Try it for free.
- Specify regions and Availability Zones (AZs), and we’ll spin up dedicated clusters in our VPC on AWS/GCP. Connect via AWS PrivateLink or Google Private Access. Contact us for pricing.
- Run Pinecone in your own AWS/GCP VPC and grant permissions to Pinecone. Contact us for pricing.
Go to the quickstart guide to get a production-ready similarity search service up and running in minutes.