# Baseten ## Docs - [AI tools](https://docs.baseten.co/ai-tools.md): Connect AI tools to Baseten documentation for context-aware assistance with deploying and serving models. - [Cancel a queued async request.](https://docs.baseten.co/api-reference/cancel-a-queued-async-request.md): Cancels an async request. Only requests with `QUEUED` status may be canceled. Rate limited to 20 requests per second. - [Get the status of an async request.](https://docs.baseten.co/api-reference/get-the-status-of-an-async-request.md): Returns the current status of an async model or chain request. Rate limited to 20 requests per second. - [Asynchronously call a named environment of a chain.](https://docs.baseten.co/api-reference/non-regional/asynchronously-call-a-named-environment-of-a-chain.md) - [Asynchronously call a named environment of a model.](https://docs.baseten.co/api-reference/non-regional/asynchronously-call-a-named-environment-of-a-model.md) - [Asynchronously call a specific deployment of a chain.](https://docs.baseten.co/api-reference/non-regional/asynchronously-call-a-specific-deployment-of-a-chain.md) - [Asynchronously call a specific deployment of a model.](https://docs.baseten.co/api-reference/non-regional/asynchronously-call-a-specific-deployment-of-a-model.md) - [Asynchronously call the development deployment of a chain.](https://docs.baseten.co/api-reference/non-regional/asynchronously-call-the-development-deployment-of-a-chain.md) - [Asynchronously call the development deployment of a model.](https://docs.baseten.co/api-reference/non-regional/asynchronously-call-the-development-deployment-of-a-model.md) - [Asynchronously call the production environment of a chain.](https://docs.baseten.co/api-reference/non-regional/asynchronously-call-the-production-environment-of-a-chain.md): Enqueues an asynchronous request for the chain deployment promoted to the production environment. - [Asynchronously call the production environment of a model.](https://docs.baseten.co/api-reference/non-regional/asynchronously-call-the-production-environment-of-a-model.md): Enqueues an asynchronous predict request for the deployment promoted to the production environment. Returns a request ID that can be used to poll for status or cancel the request. - [Call a specific chain deployment by deployment ID.](https://docs.baseten.co/api-reference/non-regional/call-a-specific-chain-deployment-by-deployment-id.md) - [Call a specific deployment of a model by deployment ID.](https://docs.baseten.co/api-reference/non-regional/call-a-specific-deployment-of-a-model-by-deployment-id.md): Sends a synchronous predict request to the specified deployment. - [Call the chain deployment associated with a specified environment.](https://docs.baseten.co/api-reference/non-regional/call-the-chain-deployment-associated-with-a-specified-environment.md) - [Call the development deployment of a chain.](https://docs.baseten.co/api-reference/non-regional/call-the-development-deployment-of-a-chain.md) - [Call the development deployment of a model.](https://docs.baseten.co/api-reference/non-regional/call-the-development-deployment-of-a-model.md): Sends a synchronous predict request to the development deployment. - [Call the model deployment associated with a specified environment.](https://docs.baseten.co/api-reference/non-regional/call-the-model-deployment-associated-with-a-specified-environment.md): Sends a synchronous predict request to the deployment promoted to the specified environment. - [Call the production environment of a chain.](https://docs.baseten.co/api-reference/non-regional/call-the-production-environment-of-a-chain.md): Sends a synchronous request to the chain deployment promoted to the production environment. The request body is forwarded to the chain's `run_remote` entrypoint. - [Call the production environment of a model.](https://docs.baseten.co/api-reference/non-regional/call-the-production-environment-of-a-model.md): Sends a synchronous predict request to the deployment promoted to the production environment. The request body is forwarded directly to the model's `predict` function. - [Get async queue status for a named environment.](https://docs.baseten.co/api-reference/non-regional/get-async-queue-status-for-a-named-environment.md) - [Get async queue status for a specific deployment.](https://docs.baseten.co/api-reference/non-regional/get-async-queue-status-for-a-specific-deployment.md) - [Get async queue status for the development deployment.](https://docs.baseten.co/api-reference/non-regional/get-async-queue-status-for-the-development-deployment.md) - [Get async queue status for the production environment.](https://docs.baseten.co/api-reference/non-regional/get-async-queue-status-for-the-production-environment.md): Returns the number of queued and in-progress async requests for the deployment promoted to the production environment. Rate limited to 20 requests per second. - [Wake a named environment of a model.](https://docs.baseten.co/api-reference/non-regional/wake-a-named-environment-of-a-model.md) - [Wake a specific deployment of a model by deployment ID.](https://docs.baseten.co/api-reference/non-regional/wake-a-specific-deployment-of-a-model-by-deployment-id.md) - [Wake the development deployment of a model.](https://docs.baseten.co/api-reference/non-regional/wake-the-development-deployment-of-a-model.md) - [Wake the production environment of a model.](https://docs.baseten.co/api-reference/non-regional/wake-the-production-environment-of-a-model.md): Triggers a wake for the deployment promoted to the production environment. Returns immediately with 202 Accepted. - [Asynchronously call a regional environment of a chain.](https://docs.baseten.co/api-reference/regional/asynchronously-call-a-regional-environment-of-a-chain.md): Enqueues an asynchronous run_remote request via a regional hostname. The environment is determined by the hostname, not the path. - [Asynchronously call a regional environment of a model.](https://docs.baseten.co/api-reference/regional/asynchronously-call-a-regional-environment-of-a-model.md): Enqueues an asynchronous predict request via a regional hostname. The environment is determined by the hostname, not the path. - [Call a regional environment of a chain.](https://docs.baseten.co/api-reference/regional/call-a-regional-environment-of-a-chain.md): Sends a synchronous run_remote request via a regional hostname. The environment is determined by the hostname, not the path. - [Call a regional environment of a model.](https://docs.baseten.co/api-reference/regional/call-a-regional-environment-of-a-model.md): Sends a synchronous predict request via a regional hostname. The environment is determined by the hostname, not the path. - [Get async queue status for a regional environment.](https://docs.baseten.co/api-reference/regional/get-async-queue-status-for-a-regional-environment.md) - [Wake a regional environment of a model.](https://docs.baseten.co/api-reference/regional/wake-a-regional-environment-of-a-model.md) - [How Baseten works](https://docs.baseten.co/concepts/howbasetenworks.md): Follow a model from truss push to a running endpoint: the build pipeline, request routing, autoscaling, and deployment lifecycle. - [Why Baseten](https://docs.baseten.co/concepts/whybaseten.md): Mission-critical inference with dedicated infrastructure, global scale, and full control. - [Cold starts](https://docs.baseten.co/deployment/autoscaling/cold-starts.md): Understand cold starts and how to minimize their impact on your deployments. - [Autoscaling](https://docs.baseten.co/deployment/autoscaling/overview.md): Configure autoscaling to dynamically adjust replicas based on traffic while minimizing idle compute costs. - [Request lifecycle](https://docs.baseten.co/deployment/autoscaling/request-lifecycle.md): What happens to a request from submission to response, including routing, queuing, the 1200-second sync predict timeout, and error handling. - [Traffic patterns](https://docs.baseten.co/deployment/autoscaling/traffic-patterns.md): Identify your traffic pattern and configure autoscaling settings to match. - [CI/CD](https://docs.baseten.co/deployment/ci-cd.md): Automate Truss deployments with GitHub Actions. - [Concepts](https://docs.baseten.co/deployment/concepts.md): Deployments, environments, resources, autoscaling, and CI/CD on Baseten. - [Deployments](https://docs.baseten.co/deployment/deployments.md): Deploy, manage, and scale machine learning models with Baseten - [Environments](https://docs.baseten.co/deployment/environments.md): Manage your model's release cycles with environments. - [Regional environments](https://docs.baseten.co/deployment/regional-environments.md): Guarantee inference data stays in a specific geographic region with regional environments. - [Resources](https://docs.baseten.co/deployment/resources.md): Manage and configure model resources - [Rolling deployments](https://docs.baseten.co/deployment/rolling-deployments.md): Gradually shift traffic to a new deployment with replica-based rolling deployments. - [Binary IO](https://docs.baseten.co/development/chain/binaryio.md): Performant serialization of numeric data - [Concepts](https://docs.baseten.co/development/chain/concepts.md): Glossary of Chains concepts and terminology - [Deploy](https://docs.baseten.co/development/chain/deploy.md): Deploy your Chain on Baseten - [Architecture and design](https://docs.baseten.co/development/chain/design.md): How to structure your Chainlets - [Engine-Builder LLM Models](https://docs.baseten.co/development/chain/engine-builder-models.md): Engine-Builder LLM models are pre-trained models that are optimized for specific inference tasks. - [Error Handling](https://docs.baseten.co/development/chain/errorhandling.md): Understanding and handling Chains errors - [Your first Chain](https://docs.baseten.co/development/chain/getting-started.md): Build and deploy two example Chains - [Invocation](https://docs.baseten.co/development/chain/invocation.md): Call your deployed Chain - [Local Development](https://docs.baseten.co/development/chain/localdev.md): Iterating, Debugging, Testing, Mocking - [Overview](https://docs.baseten.co/development/chain/overview.md) - [Streaming](https://docs.baseten.co/development/chain/streaming.md): Streaming outputs, reducing latency, SSEs - [Truss Integration](https://docs.baseten.co/development/chain/stub.md): Integrate deployed Truss models with stubs - [Subclassing](https://docs.baseten.co/development/chain/subclassing.md): Modularize and re-use Chainlet implementations - [Watch](https://docs.baseten.co/development/chain/watch.md): Live-patch deployed code - [Overview](https://docs.baseten.co/development/concepts.md): Choose between self-deployed models and Chains, and learn the development cycle that applies to both. - [b10cache](https://docs.baseten.co/development/model/b10cache.md): Persist data across replicas or deployments - [Base Docker images](https://docs.baseten.co/development/model/base-images.md): A guide to configuring a base image for your truss - [Baseten Delivery Network](https://docs.baseten.co/development/model/bdn.md): Optimize cold starts with multi-tier caching and data delivery - [Custom build commands](https://docs.baseten.co/development/model/build-commands.md): How to run your own docker commands during the build stage - [Build your first model](https://docs.baseten.co/development/model/build-your-first-model.md): Deploy a model to Baseten with just a config file. Pick an open-source model from Hugging Face, choose a GPU, and get an endpoint in minutes. - [Configuration](https://docs.baseten.co/development/model/configuration.md): How to configure your model. - [Custom health checks](https://docs.baseten.co/development/model/custom-health-checks.md): Customize the health of your deployments. - [Custom model code](https://docs.baseten.co/development/model/custom-model-code.md): Deploy a model with custom Python using the Truss Model class. - [Custom Docker containers](https://docs.baseten.co/development/model/custom-server.md): Deploy custom Docker containers to run inference servers like vLLM, SGLang, Triton, or any containerized application. - [Data and storage](https://docs.baseten.co/development/model/data-directory.md): Get files into your deployment and persist files your model writes at runtime. - [Deploy and iterate](https://docs.baseten.co/development/model/deploy-and-iterate.md): Use development deployments with live patching for rapid iteration, then promote to production. - [Access model environments](https://docs.baseten.co/development/model/environments.md): Configure model behavior based on environment - [gRPC](https://docs.baseten.co/development/model/grpc.md): Invoke your model over gRPC. - [HTTP endpoints](https://docs.baseten.co/development/model/http-endpoints.md): Expose server HTTP endpoints from custom model code. - [Implementation](https://docs.baseten.co/development/model/implementation.md): How to implement your model. - [Cached weights](https://docs.baseten.co/development/model/model-cache.md): Accelerate cold starts and availability by prefetching and caching your weights. - [Developing a model on Baseten](https://docs.baseten.co/development/model/overview.md): This page introduces the key concepts and workflow you'll use to package, configure, and iterate on models using Baseten's developer tooling. - [Performance optimization](https://docs.baseten.co/development/model/performance-optimization.md): Optimize model latency, throughput, and cost with Baseten engines - [Private Docker registries](https://docs.baseten.co/development/model/private-registries.md): Pull images from private container registries in Baseten deployments. - [Using request objects / cancellation](https://docs.baseten.co/development/model/requests.md): Get more control by directly using the request object. - [Custom responses](https://docs.baseten.co/development/model/responses.md): Get more control by directly creating the response object. - [Secrets](https://docs.baseten.co/development/model/secrets.md): Use secrets securely in your models - [Streaming output](https://docs.baseten.co/development/model/streaming.md): Streaming Output for LLMs - [Torch compile caching](https://docs.baseten.co/development/model/torch-compile-cache.md): Accelerate cold starts by loading in previous compilation artifacts. - [WebSockets](https://docs.baseten.co/development/model/websockets.md): Enable real-time, streaming, bidirectional communication using WebSockets for Truss models and Chains. - [BEI-Bert](https://docs.baseten.co/engines/bei/bei-bert.md): BERT-optimized embeddings with cold-start performance - [Configuration reference](https://docs.baseten.co/engines/bei/bei-reference.md): Complete reference config for BEI and BEI-Bert engines - [Overview](https://docs.baseten.co/engines/bei/overview.md): Production-grade embeddings, reranking, and classification models - [Gated features for BIS-LLM](https://docs.baseten.co/engines/bis-llm/advanced-features.md): KV-aware routing, disaggregated serving, and other gated features - [Reference Config (BIS-LLM)](https://docs.baseten.co/engines/bis-llm/bis-llm-config.md): Complete reference config for V2 inference stack and MoE models - [Overview](https://docs.baseten.co/engines/bis-llm/overview.md): Next-generation engine for MoE models with advanced optimizations - [Custom engine builder](https://docs.baseten.co/engines/engine-builder-llm/custom-engine-builder.md): Implement custom model.py for business logic, logging, and advanced inference patterns - [Reference config (Engine-Builder-LLM)](https://docs.baseten.co/engines/engine-builder-llm/engine-builder-config.md): Complete reference config for dense text generation models - [Speculative decoding guide](https://docs.baseten.co/engines/engine-builder-llm/lookahead-decoding.md): Faster inference with speculative decoding for coding agents and text generation - [LoRA support](https://docs.baseten.co/engines/engine-builder-llm/lora-support.md): Multi-LoRA adapters for Engine-Builder-LLM engine - [Overview](https://docs.baseten.co/engines/engine-builder-llm/overview.md): Dense LLM text generation with lookahead decoding and structured outputs - [Overview](https://docs.baseten.co/engines/index.md): Engine selection guide for embeddings, dense LLMs, and MoE models - [Autoscaling engines](https://docs.baseten.co/engines/performance-concepts/autoscaling-engines.md): Engine-specific autoscaling settings for BEI and Engine-Builder-LLM - [Deploy from cloud storage](https://docs.baseten.co/engines/performance-concepts/cloud-storage-deployment.md): Connect your S3 bucket, GCS bucket, Azure container, or Hugging Face repository to Baseten's TRT-LLM inference engines and deploy without re-uploading weights. - [Quantization guide](https://docs.baseten.co/engines/performance-concepts/quantization-guide.md): FP8 and FP4 trade-offs and hardware requirements for all engines - [Serve embeddings with BEI](https://docs.baseten.co/examples/bei.md): Deploy embedding, reranking, and classification models on Baseten Embeddings Inference. - [Transcribe audio with Chains](https://docs.baseten.co/examples/chains-audio-transcription.md): Process hours of audio in seconds using efficient chunking, distributed inference, and optimized GPU resources. - [Build a RAG pipeline with Chains](https://docs.baseten.co/examples/chains-build-rag.md): Combine retrieval and generation into a single compound workflow. - [Customize a model](https://docs.baseten.co/examples/customize-a-model.md): Deploy a model with custom Python code using the Truss Model class. - [Build and deploy an LLM](https://docs.baseten.co/examples/deploy-a-llm.md): Package and deploy an LLM with Truss, from model setup to inference. - [Deploy your first model](https://docs.baseten.co/examples/deploy-your-first-model.md): Deploy an open-source LLM to Baseten with just a config file and get an OpenAI-compatible API endpoint. - [Deploy a Dockerized model](https://docs.baseten.co/examples/docker.md): Deploy any model in a pre-built Docker container. - [Generate images with Flux](https://docs.baseten.co/examples/image-generation.md): Deploy Flux Schnell as a text-to-image endpoint. - [Deploy LLMs with Ollama](https://docs.baseten.co/examples/ollama.md): Run LLMs on Ollama as a custom Docker server. - [Building with Baseten](https://docs.baseten.co/examples/overview.md) - [Deploy LLMs with SGLang](https://docs.baseten.co/examples/sglang.md): Run LLMs on SGLang's high-performance serving framework. - [Configure speculative decoding](https://docs.baseten.co/examples/speculative-decoding.md): Accelerate LLM inference with lookahead decoding. - [Stream LLM responses](https://docs.baseten.co/examples/streaming.md): Stream LLM output token by token. - [Add system packages](https://docs.baseten.co/examples/system-packages.md): Deploy a model with both Python and system dependencies. - [Deploy LLMs with TensorRT-LLM](https://docs.baseten.co/examples/tensorrt-llm.md): Optimize LLMs for low latency and high throughput. - [Generate speech with Kokoro](https://docs.baseten.co/examples/text-to-speech.md): Deploy Kokoro as a text-to-speech endpoint. - [Deploy LLMs with vLLM](https://docs.baseten.co/examples/vllm.md): Run any open-source LLM on vLLM's serving framework. - [Manage groups and API keys](https://docs.baseten.co/frontier-gateway/api-keys.md): Walk the full lifecycle: create groups, build a hierarchy, mint and revoke API keys, and delete groups when a customer churns. - [Billing webhooks](https://docs.baseten.co/frontier-gateway/billing-webhooks.md): Receive signed per-request usage events from Frontier Gateway and pipe them into your billing provider out-of-band from the inference path. - [Calling your model](https://docs.baseten.co/frontier-gateway/calling-your-model.md): Make your first inference call through Baseten Frontier Gateway with a federated API key issued by your AI lab. - [Get started](https://docs.baseten.co/frontier-gateway/get-started.md): Create a group, mint an API key, and call your model through the gateway. - [Baseten Frontier Gateway](https://docs.baseten.co/frontier-gateway/overview.md): A managed API gateway for AI labs to serve hosted models under a branded URL with hierarchical groups, inherited rate and usage limits, and billing webhooks. - [Rate and usage limits](https://docs.baseten.co/frontier-gateway/rate-limits.md): Per-group, per-model token and request limits, two inheritance modes, and how Frontier Gateway computes the effective limits the runtime enforces. - [Async inference](https://docs.baseten.co/inference/async.md): Run asynchronous inference on deployed models - [Call your model](https://docs.baseten.co/inference/calling-your-model.md): Run inference on deployed models - [Function calling](https://docs.baseten.co/inference/function-calling.md): Tool selection and structured function calls with LLMs - [Configure HTTP clients](https://docs.baseten.co/inference/http-client-configuration.md): Connection pooling, retries, and timeouts for reliable inference requests. Baseten's default request timeout is 20 minutes (1200 seconds) for sync predict and 60 minutes (3600 seconds) for async submit. - [Integrations](https://docs.baseten.co/inference/integrations.md): Integrate your models with tools and use Baseten anywhere - [JSON mode](https://docs.baseten.co/inference/json-mode.md): Constrain model output to syntactically valid JSON - [Deprecation](https://docs.baseten.co/inference/model-apis/deprecation.md): Baseten's deprecation policy for Model APIs - [Model APIs](https://docs.baseten.co/inference/model-apis/overview.md): OpenAI-compatible endpoints for high-performance LLMs - [Rate limits and budgets](https://docs.baseten.co/inference/model-apis/rate-limits-and-budgets.md): Rate limits and usage budgets for Model APIs - [Reasoning](https://docs.baseten.co/inference/model-apis/reasoning.md): Control extended thinking for reasoning-capable models - [Vision](https://docs.baseten.co/inference/model-apis/vision.md): Send images and videos alongside text to vision-capable models - [Model I/O in binary](https://docs.baseten.co/inference/output-format/binary.md): Decode and save binary model output - [Model I/O with files](https://docs.baseten.co/inference/output-format/files.md): Call models by passing a file or URL - [Overview](https://docs.baseten.co/inference/overview.md): Inference on Baseten: Model APIs, self-deployed models, how responses are delivered, structured outputs, tool calling, and client configuration. - [Performance client](https://docs.baseten.co/inference/performance-client.md): High-performance client library for embeddings, reranking, classification, and generic batch requests - [SSH access](https://docs.baseten.co/inference/ssh.md): Connect to running model deployments directly from your terminal with standard SSH. - [Streaming](https://docs.baseten.co/inference/streaming.md): How to call a model that has a streaming-capable endpoint. - [Structured outputs](https://docs.baseten.co/inference/structured-outputs.md): JSON schema validation and controlled text generation across all engines - [Concepts](https://docs.baseten.co/loops/concepts.md): How Loops sessions, trainer servers, sampling servers, and checkpoints fit together. - [Loops](https://docs.baseten.co/loops/overview.md): A training SDK that supports long sequence length, async RL, and one-click checkpoint deploys on the Baseten Inference Stack. - [Quickstart](https://docs.baseten.co/loops/quickstart.md): Train a Loops checkpoint and list its files in five minutes. - [Supported base models](https://docs.baseten.co/loops/supported-models.md): Hugging Face base models Loops accepts, with dtype and sequence-length limits. - [Tinker compatibility](https://docs.baseten.co/loops/tinker-compatibility.md): Most Tinker code runs on Loops with one install change. Three things behave differently: paginated checkpoints, auth, and cluster routing. - [Export to Datadog](https://docs.baseten.co/observability/export-metrics/datadog.md): Export metrics from Baseten to Datadog - [Export to Grafana Cloud](https://docs.baseten.co/observability/export-metrics/grafana.md): Export metrics from Baseten to Grafana Cloud - [Export to New Relic](https://docs.baseten.co/observability/export-metrics/new-relic.md): Export metrics from Baseten to New Relic - [Overview](https://docs.baseten.co/observability/export-metrics/overview.md): Export metrics from Baseten to your observability stack - [Export to Prometheus](https://docs.baseten.co/observability/export-metrics/prometheus.md): Export metrics from Baseten to Prometheus - [Metrics support matrix](https://docs.baseten.co/observability/export-metrics/supported-metrics.md): Which metrics can be exported - [Status and health](https://docs.baseten.co/observability/health.md): Every model deployment in your Baseten workspace has a status to represent its activity and health. - [Logs](https://docs.baseten.co/observability/logs.md): Scope logs by environment or deployment, then filter by request ID for individual predictions. - [Metrics](https://docs.baseten.co/observability/metrics.md): Understand the load and performance of your model - [Secure model inference](https://docs.baseten.co/observability/security.md): Keeping your models safe and private - [Tracing](https://docs.baseten.co/observability/tracing.md): Investigate the prediction flow in detail - [Billing and usage](https://docs.baseten.co/observability/usage.md): Manage payments and track overall Baseten usage - [Access control](https://docs.baseten.co/organization/access.md): Manage access to your Baseten organization with role-based access control. - [API keys](https://docs.baseten.co/organization/api-keys.md): Authenticate requests to Baseten for deployment, inference, and management. - [Audit logs](https://docs.baseten.co/organization/audit-logs.md): Track configuration and access changes across your Baseten organization, and export audit events to your SIEM. - [OpenID Connect (OIDC) authentication](https://docs.baseten.co/organization/oidc.md): Use short-lived OIDC tokens to securely authenticate to cloud resources - [Organization settings](https://docs.baseten.co/organization/overview.md): Manage your Baseten organization's access, security, and resources. - [Restricted environments](https://docs.baseten.co/organization/restricted-environments.md): Control access to sensitive environments like production with environment-level permissions. - [Secrets](https://docs.baseten.co/organization/secrets.md): Store and access sensitive credentials in your deployed models. - [SSO and SCIM](https://docs.baseten.co/organization/sso-and-scim.md): Authenticate Baseten users through your identity provider and automatically provision accounts, directory groups, and roles. - [Teams](https://docs.baseten.co/organization/teams.md): Organize your organization into multiple teams with isolated resources and granular access control. - [Overview](https://docs.baseten.co/overview.md): Baseten helps you train, deploy, and serve AI models at scale with high performance and cost efficiency. - [Quickstart](https://docs.baseten.co/quickstart.md): Start running inference on Baseten. - [Truss Push GitHub Action](https://docs.baseten.co/reference/ci/github-action.md): Deploy and validate a Truss model or chain on Baseten from GitHub Actions. - [Chains CLI reference](https://docs.baseten.co/reference/cli/chains/chains-cli.md): Deploy, manage, and develop Chains using the Truss CLI. - [Truss CLI overview](https://docs.baseten.co/reference/cli/index.md): Install and configure the Truss CLI for deploying models, chains, and training jobs. - [Training CLI reference](https://docs.baseten.co/reference/cli/training/training-cli.md): Deploy, manage, and monitor training jobs using the Truss CLI. - [truss auth](https://docs.baseten.co/reference/cli/truss/auth.md): Manage authentication with Baseten remotes. - [truss cleanup](https://docs.baseten.co/reference/cli/truss/cleanup.md): Clean up Truss data. - [truss configure](https://docs.baseten.co/reference/cli/truss/configure.md): Configure Truss settings. - [truss container](https://docs.baseten.co/reference/cli/truss/container.md): Run and manage Truss containers locally. - [truss download](https://docs.baseten.co/reference/cli/truss/download.md): Download the Truss for a deployed model. - [truss image](https://docs.baseten.co/reference/cli/truss/image.md): Build and manage Truss Docker images. - [truss init](https://docs.baseten.co/reference/cli/truss/init.md): Create a new Truss project. - [truss login](https://docs.baseten.co/reference/cli/truss/login.md): Authenticate with Baseten. - [truss migrate](https://docs.baseten.co/reference/cli/truss/migrate.md): Migrate model_cache and external_data to the unified weights API. - [truss model-config](https://docs.baseten.co/reference/cli/truss/model-config.md): Fetch the config of a deployed model. - [truss model-logs](https://docs.baseten.co/reference/cli/truss/model-logs.md): Fetch logs for a deployed model. - [Truss CLI reference](https://docs.baseten.co/reference/cli/truss/overview.md): Deploy, manage, and develop models using the Truss CLI. - [truss predict](https://docs.baseten.co/reference/cli/truss/predict.md): Call the packaged model. - [truss push](https://docs.baseten.co/reference/cli/truss/push.md): Deploy a model to Baseten. - [truss run-python](https://docs.baseten.co/reference/cli/truss/run-python.md): Run a Python script in the Truss environment. - [truss ssh](https://docs.baseten.co/reference/cli/truss/ssh.md): SSH access to Baseten workloads. - [truss upgrade](https://docs.baseten.co/reference/cli/truss/upgrade.md): Upgrade the truss package to the latest or a specified version. - [truss watch](https://docs.baseten.co/reference/cli/truss/watch.md): Live reload during development. - [truss whoami](https://docs.baseten.co/reference/cli/truss/whoami.md): Show user information. - [Create an API key](https://docs.baseten.co/reference/gateway/api-keys/create-an-api-key.md): Mint a federated API key under a Frontier Gateway group. The plaintext key is returned exactly once. - [Get an API key](https://docs.baseten.co/reference/gateway/api-keys/get-an-api-key.md): Fetch metadata for one federated API key by its prefix. The plaintext key is never returned after creation. - [List API keys for a group](https://docs.baseten.co/reference/gateway/api-keys/list-api-keys-for-a-group.md): List the federated API keys minted under a Frontier Gateway group. Cursor-paginated. - [Revoke an API key](https://docs.baseten.co/reference/gateway/api-keys/revoke-an-api-key.md): Revoke a federated API key by its prefix. Other keys under the same group are unaffected. - [Billing webhooks](https://docs.baseten.co/reference/gateway/billing-webhooks.md): Payload, header, and signature reference for Frontier Gateway billing webhooks. - [Create a group](https://docs.baseten.co/reference/gateway/groups/create-a-group.md): Create a Frontier Gateway group with its model set, per-model limits, and a place in the hierarchy. - [Delete a group](https://docs.baseten.co/reference/gateway/groups/delete-a-group.md): Delete a Frontier Gateway group, recursively remove its descendants, and revoke every key in the subtree. - [Get a group](https://docs.baseten.co/reference/gateway/groups/get-a-group.md): Fetch a single Frontier Gateway group by its internal id, including its effective limits after inheritance. - [List groups](https://docs.baseten.co/reference/gateway/groups/list-groups.md): List Frontier Gateway groups in your workspace. Cursor-paginated, with optional lookup by external identifier. - [Update a group](https://docs.baseten.co/reference/gateway/groups/update-a-group.md): Update a Frontier Gateway group's display name or model configuration. Hierarchy and enforcement mode are immutable. - [Chat Completions](https://docs.baseten.co/reference/inference-api/chat-completions.md): Create chat completions using Baseten Model APIs, an OpenAI-compatible endpoint for managed LLMs. - [Messages](https://docs.baseten.co/reference/inference-api/messages.md): Create Anthropic Messages API requests against Baseten Model APIs. - [Overview](https://docs.baseten.co/reference/inference-api/overview.md): Baseten provides two ways to call models: Model APIs for managed LLMs and deployed model endpoints for custom models and chains. - [Websocket deployment](https://docs.baseten.co/reference/inference-api/predict-endpoints/deployment-websocket.md): Connect via WebSocket to a specific deployment. - [Websocket development](https://docs.baseten.co/reference/inference-api/predict-endpoints/development-websocket.md): Connect via WebSocket to the development deployment of a model or chain. - [Websocket environment](https://docs.baseten.co/reference/inference-api/predict-endpoints/environments-websocket.md): Connect via WebSocket to the deployment associated with an environment. - [Transcribe Streaming Audio](https://docs.baseten.co/reference/inference-api/predict-endpoints/streaming-transcription-api.md): Transcribe audio in real time over a WebSocket connection. - [Transcribe Pre-Recorded Audio](https://docs.baseten.co/reference/inference-api/predict-endpoints/transcription-api.md): Transcribe a pre-recorded audio file using a deployed transcription model. - [Loops API reference](https://docs.baseten.co/reference/loops-api/overview.md): HTTP routes for Loops sessions, runs, samplers, checkpoints, and deployments. - [Create an API key](https://docs.baseten.co/reference/management-api/api-keys/creates-an-api-key.md): Creates an API key with the provided name and type. The API key is returned in the response. - [Delete an API key](https://docs.baseten.co/reference/management-api/api-keys/delete-an-api-key.md): Deletes an API key by prefix and returns info about the API key. - [Get all API keys](https://docs.baseten.co/reference/management-api/api-keys/lists-the-users-api-keys.md): Lists all API keys your account has access to. - [Get billing usage summary](https://docs.baseten.co/reference/management-api/billing/gets-billing-usage-summary-for-a-date-range.md): Returns billing usage data within the specified date range. Includes dedicated model serving, training, and model APIs usage. The date range must not exceed 31 days. - [Delete chains](https://docs.baseten.co/reference/management-api/chains/deletes-a-chain-by-id.md) - [By ID](https://docs.baseten.co/reference/management-api/chains/gets-a-chain-by-id.md) - [All chains](https://docs.baseten.co/reference/management-api/chains/gets-all-chains.md) - [Any deployment by ID](https://docs.baseten.co/reference/management-api/deployments/activate/activates-a-deployment.md): Activates an inactive deployment and returns the activation status. - [Activate environment deployment](https://docs.baseten.co/reference/management-api/deployments/activate/activates-a-deployment-associated-with-an-environment.md): Activates an inactive deployment associated with an environment and returns the activation status. - [Development deployment](https://docs.baseten.co/reference/management-api/deployments/activate/activates-a-development-deployment.md): Activates an inactive development deployment and returns the activation status. - [Activate production deployment](https://docs.baseten.co/reference/management-api/deployments/activate/activates-production-deployment.md): Activates an inactive production deployment and returns the activation status. - [Update chainlet environment's autoscaling settings](https://docs.baseten.co/reference/management-api/deployments/autoscaling/update-a-chainlet-environments-autoscaling-settings.md): Updates a chainlet environment's autoscaling settings and returns the updated chainlet environment settings. - [Any model deployment by ID](https://docs.baseten.co/reference/management-api/deployments/autoscaling/updates-a-deployments-autoscaling-settings.md): Updates a deployment's autoscaling settings and returns the update status. - [Development model deployment](https://docs.baseten.co/reference/management-api/deployments/autoscaling/updates-a-development-deployments-autoscaling-settings.md): Updates a development deployment's autoscaling settings and returns the update status. - [Update production deployment autoscaling settings](https://docs.baseten.co/reference/management-api/deployments/autoscaling/updates-production-deployment-autoscaling-settings.md): Updates a production deployment's autoscaling settings and returns the update status. - [Chain deployment](https://docs.baseten.co/reference/management-api/deployments/deactivate/deactivates-a-chain-deployment.md): Deactivates a chain deployment and returns the deactivation status. - [Any deployment by ID](https://docs.baseten.co/reference/management-api/deployments/deactivate/deactivates-a-deployment.md): Deactivates a deployment and returns the deactivation status. - [Deactivate environment deployment](https://docs.baseten.co/reference/management-api/deployments/deactivate/deactivates-a-deployment-associated-with-an-environment.md): Deactivates a deployment associated with an environment and returns the deactivation status. - [Development deployment](https://docs.baseten.co/reference/management-api/deployments/deactivate/deactivates-a-development-deployment.md): Deactivates a development deployment and returns the deactivation status. - [Deactivate production deployment](https://docs.baseten.co/reference/management-api/deployments/deactivate/deactivates-production-deployment.md): Deactivates a production deployment and returns the deactivation status. - [Delete chain deployment](https://docs.baseten.co/reference/management-api/deployments/deletes-a-chain-deployment-by-id.md) - [Delete model deployments](https://docs.baseten.co/reference/management-api/deployments/deletes-a-models-deployment-by-id.md): Deletes a model's deployment by ID and returns the tombstone of the deployment. - [Get model deployment logs](https://docs.baseten.co/reference/management-api/deployments/get-deployment-logs.md): Gets all the logs for a model deployment in the given time range. - [Any chain deployment by ID](https://docs.baseten.co/reference/management-api/deployments/gets-a-chain-deployment-by-id.md) - [Any model deployment by ID](https://docs.baseten.co/reference/management-api/deployments/gets-a-models-deployment-by-id.md): Gets a model's deployment by ID and returns the deployment. - [Development model deployment](https://docs.baseten.co/reference/management-api/deployments/gets-a-models-development-deployment.md): Gets a model's development deployment and returns the deployment. - [Production model deployment](https://docs.baseten.co/reference/management-api/deployments/gets-a-models-production-deployment.md): Gets a model's production deployment and returns the deployment. - [Get all chain deployments](https://docs.baseten.co/reference/management-api/deployments/gets-all-chain-deployments.md) - [Get all model deployments](https://docs.baseten.co/reference/management-api/deployments/gets-all-deployments-of-a-model.md) - [Cancel model promotion](https://docs.baseten.co/reference/management-api/deployments/promote/cancel-promotion.md): Cancels an ongoing promotion to an environment and returns the cancellation status. - [Force cancel rolling deployment](https://docs.baseten.co/reference/management-api/deployments/promote/force-cancel-promotion.md): Immediately cancels an in-progress rolling promotion and triggers rollback to the previous version. - [Force roll forward promotion](https://docs.baseten.co/reference/management-api/deployments/promote/force-roll-forward-promotion.md): Immediately completes the rolling promotion, shifting all traffic to the new version. This works even if the promotion is in the process of rolling back. - [Pause rolling deployment](https://docs.baseten.co/reference/management-api/deployments/promote/pause-promotion.md): Pauses an in-progress rolling promotion after the current step completes. No further scaling changes are made until resumed. - [Promote to chain environment](https://docs.baseten.co/reference/management-api/deployments/promote/promotes-a-chain-deployment-to-an-environment.md): Promotes an existing chain deployment to an environment and returns the promoted chain deployment. - [Promote to model environment](https://docs.baseten.co/reference/management-api/deployments/promote/promotes-a-deployment-to-an-environment.md): Promotes an existing deployment to an environment and returns the promoted deployment. - [Any model deployment by ID](https://docs.baseten.co/reference/management-api/deployments/promote/promotes-a-deployment-to-production.md): Promotes an existing deployment to production and returns the same deployment. - [Development model deployment](https://docs.baseten.co/reference/management-api/deployments/promote/promotes-a-development-deployment-to-production.md): Creates a new production deployment from the development deployment, the currently building deployment is returned. - [Resume rolling deployment](https://docs.baseten.co/reference/management-api/deployments/promote/resume-promotion.md): Resumes a paused rolling promotion, continuing from where it was paused. - [Any deployment by ID](https://docs.baseten.co/reference/management-api/deployments/retry/retries-a-deployment.md): Retries a failed deployment and returns the retry status and updated deployment. - [Development deployment](https://docs.baseten.co/reference/management-api/deployments/retry/retries-a-development-deployment.md): Retries a failed development deployment and returns the retry status and updated deployment. - [Production deployment](https://docs.baseten.co/reference/management-api/deployments/retry/retries-production-deployment.md): Retries a failed production deployment and returns the retry status and updated deployment. - [Terminate deployment replica](https://docs.baseten.co/reference/management-api/deployments/terminates-deployment-replica.md): Terminates a deployment replica and returns the termination status. - [Create Chain environment](https://docs.baseten.co/reference/management-api/environments/create-a-chain-environment.md): Create a chain environment. Returns the resulting environment. - [Create environment](https://docs.baseten.co/reference/management-api/environments/create-an-environment.md): Creates an environment for the specified model and returns the environment. - [Get Chain environment](https://docs.baseten.co/reference/management-api/environments/get-a-chain-environments-details.md): Gets a chain environment's details and returns the chain environment. - [Get all Chain environments](https://docs.baseten.co/reference/management-api/environments/get-all-chain-environments.md): Gets all chain environments for a given chain - [Get all environments](https://docs.baseten.co/reference/management-api/environments/get-all-environments.md): Gets all environments for a given model - [Get environment](https://docs.baseten.co/reference/management-api/environments/get-an-environments-details.md): Gets an environment's details and returns the environment. - [Update Chain environment](https://docs.baseten.co/reference/management-api/environments/update-a-chain-environments-settings.md): Update a chain environment's settings and returns the chain environment. - [Update chainlet environment's instance type](https://docs.baseten.co/reference/management-api/environments/update-a-chainlet-environments-instance-type-settings.md): Updates a chainlet environment's instance type settings. The chainlet environment setting must exist. When updated, a new chain deployment is created and deployed. It is promoted to the chain environment according to promotion settings on the environment. - [Update model environment](https://docs.baseten.co/reference/management-api/environments/update-an-environments-settings.md): Asynchronously updates an environment's settings. Poll the GET endpoint for the applied state. - [All instance types](https://docs.baseten.co/reference/management-api/instance-types/gets-all-instance-types.md) - [Instance type prices](https://docs.baseten.co/reference/management-api/instance-types/gets-instance-type-prices.md) - [Delete models](https://docs.baseten.co/reference/management-api/models/deletes-a-model-by-id.md) - [By ID](https://docs.baseten.co/reference/management-api/models/gets-a-model-by-id.md) - [All models](https://docs.baseten.co/reference/management-api/models/gets-all-models.md) - [Overview](https://docs.baseten.co/reference/management-api/overview.md): Manage models and deployments with the Baseten management API. It supports monitoring, CI/CD, and automation at both the model and workspace levels. - [Rate limits](https://docs.baseten.co/reference/management-api/rate-limits.md): Rate limits, response shape, and retry handling for the Baseten management API. - [Get all secrets](https://docs.baseten.co/reference/management-api/secrets/gets-all-secrets.md) - [Upsert a secret](https://docs.baseten.co/reference/management-api/secrets/upserts-a-secret.md): Creates or updates a secret by name. Scoped to the caller's primary team — use the team-scoped variant to target a specific team. - [Create a team API key](https://docs.baseten.co/reference/management-api/teams/creates-a-team-api-key.md): Creates a team API key with the provided name and type. The API key is returned in the response. - [Create a team training project](https://docs.baseten.co/reference/management-api/teams/creates-a-team-training-project.md): Upserts a training project with the specified metadata for a team. - [Get all team secrets](https://docs.baseten.co/reference/management-api/teams/gets-all-team-secrets.md) - [List all teams](https://docs.baseten.co/reference/management-api/teams/lists-all-teams.md): Returns a list of all teams the authenticated user has access to. - [Upsert a team secret](https://docs.baseten.co/reference/management-api/teams/upserts-a-team-secret.md): Creates a new secret or updates an existing secret if one with the provided name already exists. The name and creation date of the created or updated secret is returned. This secret belongs to the specified team - [Reference documentation](https://docs.baseten.co/reference/overview.md): For deploying, managing, and interacting with machine learning models on Baseten. - [Chains SDK Reference](https://docs.baseten.co/reference/sdk/chains.md): Python SDK Reference for Chains - [Loops SDK](https://docs.baseten.co/reference/sdk/loops.md): Python client for Loops: ServiceClient, TrainingClient, SamplingClient, and the Tinker compatibility shim. - [Training SDK](https://docs.baseten.co/reference/sdk/training.md): API reference for the Baseten training SDK. - [Truss SDK Reference](https://docs.baseten.co/reference/sdk/truss.md): Python SDK for deploying and managing models with Truss. - [Create training job](https://docs.baseten.co/reference/training-api/create-training-job.md): Creates a training job with the specified configuration. - [Create training project](https://docs.baseten.co/reference/training-api/create-training-project.md): Upserts a training project with the specified metadata. - [Delete training job](https://docs.baseten.co/reference/training-api/delete-training-job.md): Deletes a training job. Stops it first if still running. - [Delete training project](https://docs.baseten.co/reference/training-api/delete-training-project.md): Deletes a training project and all associated training jobs. - [Download training job source code](https://docs.baseten.co/reference/training-api/download-training-job.md): Get the uploaded training job as a S3 Artifact - [Get auth codes for training job](https://docs.baseten.co/reference/training-api/get-auth-codes-for-training-job.md): Get authentication codes for all nodes of a training job's interactive sessions. - [Get training job](https://docs.baseten.co/reference/training-api/get-training-job.md): Get the details of an existing training job. - [Get training job checkpoint files](https://docs.baseten.co/reference/training-api/get-training-job-checkpoint-files.md): Get presigned URLs for all checkpoint files for a training job. - [List training job checkpoints](https://docs.baseten.co/reference/training-api/get-training-job-checkpoints.md): Get the checkpoints for a training job. - [Get training job logs](https://docs.baseten.co/reference/training-api/get-training-job-logs.md): Get the logs for a training job with the provided filters. - [Get training job metrics](https://docs.baseten.co/reference/training-api/get-training-job-metrics.md): Get the metrics for a training job. - [Get training project](https://docs.baseten.co/reference/training-api/get-training-project.md): Get the details of an existing training project. - [Get training project cache summary](https://docs.baseten.co/reference/training-api/get-training-project-cache-summary.md): Get the cache summary for the most recent training job in the project. - [List training projects](https://docs.baseten.co/reference/training-api/get-training-projects.md): List all training projects for the organization. - [List training jobs](https://docs.baseten.co/reference/training-api/list-training-jobs.md): List all training jobs for the training project. - [Overview](https://docs.baseten.co/reference/training-api/overview.md): Programmatically manage Baseten Training resources. - [Recreate training job](https://docs.baseten.co/reference/training-api/recreate-training-job.md): Create a new training job with the same configuration as an existing training job. - [Search training jobs](https://docs.baseten.co/reference/training-api/search-training-jobs.md): Search training jobs for the organization. - [Stop training job](https://docs.baseten.co/reference/training-api/stop-training-job.md): Stops a training job. - [Truss configuration](https://docs.baseten.co/reference/truss-configuration.md): Set your model resources, dependencies, and more - [Baseten platform status](https://docs.baseten.co/status/status.md): Current operational status of Baseten's services. - [Building blocks](https://docs.baseten.co/training/concepts/basics.md): Learn how to get up and running on Baseten Training - [Cache](https://docs.baseten.co/training/concepts/cache.md): Learn how to use the training cache to speed up your training iterations by persisting data between jobs. - [Checkpoints](https://docs.baseten.co/training/concepts/checkpoints.md): Learn how to use Baseten's checkpointing feature to manage model checkpoints and avoid disk errors during training. - [Multinode training](https://docs.baseten.co/training/concepts/multinode.md): Learn how to configure and run multinode training jobs with Baseten Training. - [Storage and data ingestion](https://docs.baseten.co/training/concepts/storage.md): Load model weights and training data into Baseten training containers through BDN, S3, Hugging Face, and GCS. - [Deploy with optimized inference engines](https://docs.baseten.co/training/deploy-with-engine-builder.md): Deploy model checkpoints from Baseten Training directly to an inference engine without downloading or re-uploading weights. - [Serving your trained model](https://docs.baseten.co/training/deployment.md): How to deploy checkpoints from Baseten Training jobs as usable models. - [Get started](https://docs.baseten.co/training/getting-started.md): Run your first training job and deploy it to production. - [VS Code and Cursor remote tunnels](https://docs.baseten.co/training/interactive-sessions.md): Connect to training containers for remote debugging and development via VS Code or Cursor Remote Tunnels. - [Lifecycle](https://docs.baseten.co/training/lifecycle.md): Understanding the different states and transitions in a Baseten training job's lifecycle. - [Loading checkpoints](https://docs.baseten.co/training/loading.md): Resume training from existing checkpoints to continue where you left off. - [Management](https://docs.baseten.co/training/management.md): How to monitor, manage, and interact with your Baseten Training projects and jobs. - [Training on Baseten](https://docs.baseten.co/training/overview.md): Train custom models with developer-first training infrastructure on Baseten. - [Remote access](https://docs.baseten.co/training/remote-access.md): Connect to running training jobs from your local machine to debug, inspect state, and develop interactively. - [SSH access](https://docs.baseten.co/training/ssh.md): Connect to training containers directly from your terminal with standard SSH. - [Deployments](https://docs.baseten.co/troubleshooting/deployments.md): Troubleshoot common problems during model deployment - [Inference](https://docs.baseten.co/troubleshooting/inference.md): Troubleshoot common problems during model inference ## OpenAPI Specs - [spec](https://docs.baseten.co/spec) - [messages-openapi-spec](https://docs.baseten.co/reference/inference-api/messages-openapi-spec.json) - [llm-openapi-spec](https://docs.baseten.co/reference/inference-api/llm-openapi-spec.json) - [inference-api-spec](https://docs.baseten.co/reference/inference-api/inference-api-spec.json) - [meta](https://docs.baseten.co/styles/proselint/meta.json) - [Very](https://docs.baseten.co/styles/proselint/Very.yml) - [Uncomparables](https://docs.baseten.co/styles/proselint/Uncomparables.yml) - [Typography](https://docs.baseten.co/styles/proselint/Typography.yml) - [Spelling](https://docs.baseten.co/styles/proselint/Spelling.yml) - [Skunked](https://docs.baseten.co/styles/proselint/Skunked.yml) - [RASSyndrome](https://docs.baseten.co/styles/proselint/RASSyndrome.yml) - [P-Value](https://docs.baseten.co/styles/proselint/P-Value.yml) - [Oxymorons](https://docs.baseten.co/styles/proselint/Oxymorons.yml) - [Nonwords](https://docs.baseten.co/styles/proselint/Nonwords.yml) - [Needless](https://docs.baseten.co/styles/proselint/Needless.yml) - [Malapropisms](https://docs.baseten.co/styles/proselint/Malapropisms.yml) - [LGBTTerms](https://docs.baseten.co/styles/proselint/LGBTTerms.yml) - [LGBTOffensive](https://docs.baseten.co/styles/proselint/LGBTOffensive.yml) - [Jargon](https://docs.baseten.co/styles/proselint/Jargon.yml) - [Hyperbole](https://docs.baseten.co/styles/proselint/Hyperbole.yml) - [Hedging](https://docs.baseten.co/styles/proselint/Hedging.yml) - [GroupTerms](https://docs.baseten.co/styles/proselint/GroupTerms.yml) - [GenderBias](https://docs.baseten.co/styles/proselint/GenderBias.yml) - [Diacritical](https://docs.baseten.co/styles/proselint/Diacritical.yml) - [DenizenLabels](https://docs.baseten.co/styles/proselint/DenizenLabels.yml) - [DateSpacing](https://docs.baseten.co/styles/proselint/DateSpacing.yml) - [DateRedundancy](https://docs.baseten.co/styles/proselint/DateRedundancy.yml) - [DateMidnight](https://docs.baseten.co/styles/proselint/DateMidnight.yml) - [DateCase](https://docs.baseten.co/styles/proselint/DateCase.yml) - [Cursing](https://docs.baseten.co/styles/proselint/Cursing.yml) - [Currency](https://docs.baseten.co/styles/proselint/Currency.yml) - [CorporateSpeak](https://docs.baseten.co/styles/proselint/CorporateSpeak.yml) - [Cliches](https://docs.baseten.co/styles/proselint/Cliches.yml) - [But](https://docs.baseten.co/styles/proselint/But.yml) - [Archaisms](https://docs.baseten.co/styles/proselint/Archaisms.yml) - [Apologizing](https://docs.baseten.co/styles/proselint/Apologizing.yml) - [Annotations](https://docs.baseten.co/styles/proselint/Annotations.yml) - [AnimalLabels](https://docs.baseten.co/styles/proselint/AnimalLabels.yml) - [Airlinese](https://docs.baseten.co/styles/proselint/Airlinese.yml) - [WordList](https://docs.baseten.co/styles/Google/WordList.yml) - [Will](https://docs.baseten.co/styles/Google/Will.yml) - [We](https://docs.baseten.co/styles/Google/We.yml) - [Units](https://docs.baseten.co/styles/Google/Units.yml) - [Spacing](https://docs.baseten.co/styles/Google/Spacing.yml) - [Slang](https://docs.baseten.co/styles/Google/Slang.yml) - [Semicolons](https://docs.baseten.co/styles/Google/Semicolons.yml) - [Ranges](https://docs.baseten.co/styles/Google/Ranges.yml) - [Quotes](https://docs.baseten.co/styles/Google/Quotes.yml) - [Periods](https://docs.baseten.co/styles/Google/Periods.yml) - [Passive](https://docs.baseten.co/styles/Google/Passive.yml) - [Parens](https://docs.baseten.co/styles/Google/Parens.yml) - [OxfordComma](https://docs.baseten.co/styles/Google/OxfordComma.yml) - [Ordinal](https://docs.baseten.co/styles/Google/Ordinal.yml) - [OptionalPlurals](https://docs.baseten.co/styles/Google/OptionalPlurals.yml) - [LyHyphens](https://docs.baseten.co/styles/Google/LyHyphens.yml) - [Latin](https://docs.baseten.co/styles/Google/Latin.yml) - [Headings](https://docs.baseten.co/styles/Google/Headings.yml) - [HeadingPunctuation](https://docs.baseten.co/styles/Google/HeadingPunctuation.yml) - [Gender](https://docs.baseten.co/styles/Google/Gender.yml) - [FirstPerson](https://docs.baseten.co/styles/Google/FirstPerson.yml) - [Exclamation](https://docs.baseten.co/styles/Google/Exclamation.yml) - [EmDash](https://docs.baseten.co/styles/Google/EmDash.yml) - [Ellipses](https://docs.baseten.co/styles/Google/Ellipses.yml) - [DateFormat](https://docs.baseten.co/styles/Google/DateFormat.yml) - [Contractions](https://docs.baseten.co/styles/Google/Contractions.yml) - [Colons](https://docs.baseten.co/styles/Google/Colons.yml) - [Acronyms](https://docs.baseten.co/styles/Google/Acronyms.yml) - [AMPM](https://docs.baseten.co/styles/Google/AMPM.yml) - [package](https://docs.baseten.co/package.json) - [package-lock](https://docs.baseten.co/package-lock.json) - [settings](https://docs.baseten.co/.vscode/settings.json)