Configure RECAP to use a Custom Model Server via requests
RECAP can also make requests to an arbitrary model server via REST requests. Optionally an access token can be passed
in. To customize the request format and handling of the response, it may be necessary to update/rebuild the RECAP
containers.
Extending RECAP to be compatible with your custom model server
There’s a very minimal interface to be implemented which can support any arbitrary LLM Model Server.
Simply update the code here
and rebuild.The default implementation is compatible with the blog demo shown below.