Recommended CPU / RAM / Disk to run RECAP
vespaengine/vespa-generic-intel-x86_64
image in your docker-compose.yml
. This generic image is slower but ensures compatibility. For details, see Vespa CPU Support.
To clean up these unused images, run: docker system prune --all
.m7g.xlarge
instance by default,
which has 4vCPU cores + 16GB of RAM + 200GB of disk space. We’re comfortable using 4vCPU cores in a
production setting since we have dedicated GPU instances that run the embedding / cross-encoder models.
If you do not plan on setting that up, we would recommend going with 8vCPU cores (if possible) for a
production deployment.
api_server
- 1 CPU, 2Gi Memory
background
- 2 CPU, 4Gi Memory
indexing_model_server
/ inference_model_server
- 2 CPU, 4Gi Memory
postgres
- 500m CPU, 2Gi Memory
vespa
- >=4 CPU, >= 4Gi Memory. This is the bare minimum, and we would generally recommend
higher than this. The resources required here also scales linearly with the number of documents indexed.
For reference, with 50GB of documents, we would generally recommend at least 10 CPU, 20Gi Memory +
tuning the VESPA_SEARCHER_THREADS
environment variable.
nginx
- 250m CPU, 128Mi Memory