Pinecone and Amazon Web Services (AWS)

Bring scalable Gen AI applications to market with cost-efficiency

Building with Pinecone and AWS

Pinecone and Amazon Web Services (AWS) empower you to build highly performant, scalable, and reliable production-ready Gen AI applications with ease.

Amazon Bedrock for Pinecone

Amazon Bedrock provides access to pre-trained foundation models through an API, allowing users to experiment with different foundation models with the ability to easily finetune and augment them.

With this integration, AWS customers can quickly and effortlessly build search and GenAI applications with a workflow called Retrieval Augmented Generation (RAG). RAG powered by Pinecone ensures the most relevant, accurate, and fast responses to end users.

Learn more

Amazon SageMaker for Pinecone

SageMaker is a machine learning service that allows data scientists and developers to easily build, train Large Language Models (LLMs), and then directly deploy them into a production-ready hosted environment.

The integration allows customers to use SageMaker compute and model hosting for LLMs and Pinecone as the knowledge base that keeps our LLMs up to date with the latest information to reduce hallucinations.

Learn more

Pinecone on AWS Marketplace

Develop highly scalable Gen AI apps with Pinecone on the AWS marketplace. Pinecone offers a usage-based pricing model with no minimums or upfront commitments. Easy billing management through your AWS account.

You will be billed per minute from the moment your index is live, with monthly invoices.

Read the announcement Learn how to set up billing

Learn More Contact Sales

Why Build with Pinecone and AWS

High Performance

Low latency and high throughput. Speed through data in milliseconds
Quick and accurate results with real-time data freshness and metadata filtering
Support both vector search and hybrid search
Scale beyond billions of vectors cost-effectively without compromising performance

Developer Favorite

Get started in seconds with our console that requires no AI expertise
Fully managed service without the need to maintain infrastructure, or monitor services
Intuitive APIs and SDKs
Compatible with any LLMs

Enterprise Ready

Enterprise-grade security and compliance: GDPR-ready, SOC2 Type II certified, and HIPPA-compliant
Data encryption in transit and at rest
Stringent access controls
Uptime and response time SLAs

Deploying Open Source LLMs for RAG with SageMaker
In this article, we'll learn how to build LLM + RAG pipelines using open-source models from Hugging Face deployed on AWS SageMaker.
Read Article
Build enterprise-grade Q&A at scale with Open LLMs on AWS
In this video, we’ll explore how developers can build a reliable and scalable question-answering system on Amazon Web Services (AWS) using open LLMs.
Watch Video

Start building in minutes

Create an account and your first index with a few clicks or API calls. Our simple REST API and growing number of SDKs makes building with Pinecone a breeze.

Pinecone and Amazon Web Services (AWS)

Building with Pinecone and AWS

Amazon Bedrock for Pinecone

Amazon SageMaker for Pinecone

Pinecone on AWS Marketplace

Why Build with Pinecone and AWS

High Performance

Developer Favorite

Enterprise Ready

Deploying Open Source LLMs for RAG with SageMaker

Build enterprise-grade Q&A at scale with Open LLMs on AWS

Ready to get started?

Docs

Learning Center

Community

Start building in minutes