From model weights to API endpoint
Phi 3 Mini
.
You should see the following file structure:
model/model.py
and config.yaml
.
model/model.py
:
predict()
function:
config.yaml
, define the Python environment and compute resources:
truss
will ask for your API-key and save it for future runs.
Monitor the deployment from your Baseten dashboard.
model_id
from your deployment.
model.py
, save, and test the API again.