AnnouncementPinecone serverless on AWS is now generally availableLearn more
Models

instructor-base

Small text embedding model that can generate text embeddings tailored to any task or domain via natural language instructions.
Dimension:Size of a single vector
supported by this model.
768
Distance Metric:Used to measure similarity
between vectors.
cosine or dot product
Max Seq. Length:Number of tokens the model
can process at once.
512
An instruction-finetuned text embedding model that can generate text embeddings tailored to any task (e.g., classification, retrieval, clustering, text evaluation, etc.) or domain (e.g., science, finance, etc.) by simply providing the task instruction in natural language. Embedding model trained on instructions for specific domains. Takes customized text units (e.g. paragraph, sentence, document). Smallest model of the Instructor family.
Learn more about instructor-base