Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.trieve.ai/llms.txt

Use this file to discover all available pages before exploring further.

Custom or fine tuned models in Trieve Vector Inference

The open source text models on Hugging Face may not be what you always want,
1

Update embedding_models.yaml

To use a private or custom model with Trieve Vector Inference, you will need to update your embedding_models.yaml file.If the model is a private or gated Hugging Face model, you will need to include your Hugging Face API token.
embedding_models.yaml
...
models:
  ...
  my-custom-model:
    replicas: 1
    revision: main
    modelName: trieve/private-model-example
	hfToken: "hf_**********************************"
...
2

Update your TVI cluster

Update TVI to include your models
helm upgrade -i vector-inference \
    oci://709825985650.dkr.ecr.us-east-1.amazonaws.com/trieve/trieve-embeddings  \
    -f embedding_models.yaml
3

Get embeddings endpoint

kubectl get ing