Pinecone overview

Pinecone makes it easy to provide long-term memory for high-performance AI applications. It’s a managed, cloud-native vector database with a simple API and no infrastructure hassles. Pinecone serves fresh, filtered query results with low latency at the scale of billions of vectors.

Vector embeddings provide long-term memory for AI

Applications that involve large language models, generative AI, and semantic search rely on vector embeddings, a type of data that represents semantic information. This information allows AI applications to gain understanding and maintain a long-term memory that they can draw upon when executing complex tasks.

Vector databases store and query embeddings quickly and at scale

Vector databases like Pinecone offer optimized storage and querying capabilities for embeddings. Traditional scalar-based databases can’t keep up with the complexity and scale of such data, making it difficult to extract insights and perform real-time analysis. Vector indexes like FAISS lack useful features that are present in any database. Vector databases combine the familiar features of traditional databases with the optimized performance of vector indexes.

Pinecone indexes store records with vector data

Each record in a Pinecone index contains a unique ID and an array of floats representing a dense vector embedding.

Pinecone record diagram

Indexes may also contain a sparse vector embedding for hybrid search and metadata key-value pairs for filtered queries.

Pinecone queries are fast and fresh

Pinecone returns low-latency, accurate results for indexes with billions of vectors. Queries reflect up-to-the-second updates such as upserts and deletes. Filter by namespaces and metadata to improve query performance.

Upsert and query vector embeddings with the Pinecone API

Specify the distance metric your index uses to evaluate vector similarity, along with dimensions, cloud provider, and region.

from pinecone import Pinecone, ServerlessSpec

pc = Pinecone(api_key="YOUR_API_KEY")

pc.create_index(
  name="docs-overview-index",
  dimension=8,
  metric="cosine",
  spec=ServerlessSpec(
    cloud="aws",
    region="us-east-1"
  )
)

Perform CRUD operations and query your vectors using HTTP, Python, Node.js, or Java.

index = pc.Index(index_name)

index.upsert(
    vectors=[
        {"id": "vec1", "values": [0.1, 0.1, 0.1, 0.1, 0.1, 0.1, 0.1, 0.1]},
        {"id": "vec2", "values": [0.2, 0.2, 0.2, 0.2, 0.2, 0.2, 0.2, 0.2]},
        {"id": "vec3", "values": [0.3, 0.3, 0.3, 0.3, 0.3, 0.3, 0.3, 0.3]},
        {"id": "vec4", "values": [0.4, 0.4, 0.4, 0.4, 0.4, 0.4, 0.4, 0.4]}
    ],
    namespace="ns1"
)

index.upsert(
    vectors=[
        {"id": "vec5", "values": [0.5, 0.5, 0.5, 0.5, 0.5, 0.5, 0.5, 0.5]},
        {"id": "vec6", "values": [0.6, 0.6, 0.6, 0.6, 0.6, 0.6, 0.6, 0.6]},
        {"id": "vec7", "values": [0.7, 0.7, 0.7, 0.7, 0.7, 0.7, 0.7, 0.7]},
        {"id": "vec8", "values": [0.8, 0.8, 0.8, 0.8, 0.8, 0.8, 0.8, 0.8]}
    ],
    namespace="ns2"
)

Find the top k most similar vectors, or query by ID.

index.query(
    namespace="ns1",
    vector=[0.3, 0.3, 0.3, 0.3, 0.3, 0.3, 0.3, 0.3],
    top_k=3,
    include_values=True
)

index.query(
    namespace="ns2",
    vector=[0.7, 0.7, 0.7, 0.7, 0.7, 0.7, 0.7, 0.7],
    top_k=3,
    include_values=True
)

# Returns:
# {'matches': [{'id': 'vec3',
#               'score': 0.0,
#               'values': [0.3, 0.3, 0.3, 0.3, 0.3, 0.3, 0.3, 0.3]},
#              {'id': 'vec4',
#               'score': 0.0799999237,
#               'values': [0.4, 0.4, 0.4, 0.4, 0.4, 0.4, 0.4, 0.4]},
#              {'id': 'vec2',
#               'score': 0.0800000429,
#               'values': [0.2, 0.2, 0.2, 0.2, 0.2, 0.2, 0.2, 0.2]}],
#  'namespace': 'ns1',
#  'usage': {'read_units': 6}}
# {'matches': [{'id': 'vec7',
#               'score': 0.0,
#               'values': [0.7, 0.7, 0.7, 0.7, 0.7, 0.7, 0.7, 0.7]},
#              {'id': 'vec8',
#               'score': 0.0799999237,
#               'values': [0.8, 0.8, 0.8, 0.8, 0.8, 0.8, 0.8, 0.8]},
#              {'id': 'vec6',
#               'score': 0.0799999237,
#               'values': [0.6, 0.6, 0.6, 0.6, 0.6, 0.6, 0.6, 0.6]}],
#  'namespace': 'ns2',
#  'usage': {'read_units': 6}}

Get started

Go to the quickstart guide to get a production-ready vector search service up and running in minutes.

Getting started

Organizations

Projects

Indexes

Data

Operations

Vector embeddings provide long-term memory for AI

Vector databases store and query embeddings quickly and at scale

Pinecone indexes store records with vector data

Pinecone queries are fast and fresh

Upsert and query vector embeddings with the Pinecone API

Get started

Getting started

Organizations

Projects

Indexes

Data

Operations

​Vector embeddings provide long-term memory for AI

​Vector databases store and query embeddings quickly and at scale

​Pinecone indexes store records with vector data

​Pinecone queries are fast and fresh

​Upsert and query vector embeddings with the Pinecone API

​Get started

Vector embeddings provide long-term memory for AI

Vector databases store and query embeddings quickly and at scale

Pinecone indexes store records with vector data

Pinecone queries are fast and fresh

Upsert and query vector embeddings with the Pinecone API

Get started