LLama 7B Example
Use theEngineBuilderLLMChainlet baseclass to configure an LLM engine. The additional engine_builder_config field specifies model architecture, repository, and runtime parameters and more, the full options are detailed in the Engine Builder configuration guide.
Differences from Standard Chainlets
- No
run_remoteimplementation: Unlike regular Chainlets,EngineBuilderLLMChainletdoes not require users to implementrun_remote(). Instead, it automatically wires into the deployed engine’s API. All LLM Chainlets have the same function signature:chains.EngineBuilderLLMInputas input and a stream (AsyncIterator) of strings as output. LikewiseEngineBuilderLLMChainlets can only be used as dependencies, but not have dependencies themselves. - No
run_local(guide) andwatch(guide) Standard Chains support a local debugging mode and watch. However, when usingEngineBuilderLLMChainlet, local execution is not available, and testing must be done after deployment. For a faster dev loop of the rest of your chain (everything except the engine builder chainlet) you can substitute those chainlets with stubs like you can do for an already deployed truss model [guide].
Integrate the Engine Builder Chainlet
After defining anEngineBuilderLLMInput like Llama7BChainlet above, you can use it as a dependency in other conventional chainlets: