Skip to main content
When running LangSmith on Amazon Web Services (AWS), you can set up in either self-hosted or hybrid mode. In both cases, your workloads run on AWS infrastructure within your account, allowing you to use AWS managed services while maintaining control over your data and compute resources. This page provides AWS-specific architecture patterns, service recommendations, and best practices for deploying and operating LangSmith on AWS.
LangChain provides Terraform modules specifically for AWS to help provision infrastructure for LangSmith. These modules can quickly set up EKS clusters, RDS, ElastiCache, S3, and networking resources.View the AWS Terraform modules for documentation and examples.

Reference architecture

LangSmith on AWS leverages managed services to provide a scalable, secure, and resilient platform. The following architecture applies to both self-hosted and hybrid and aligns with the AWS Well-Architected Framework: Architecture diagram showing AWS relations to LangSmith services

LangSmith self-hosted models

You can host LangSmith on AWS using any of the three self-hosted models:
  • LangSmith Observability and Evaluation: Deploy the UI and API services (frontend, backend, platform backend, playground, queue workers, and ACE). Use external AWS managed services for RDS PostgreSQL, ElastiCache, and S3.
  • Full LangSmith Platform Observability, Evaluation, and Agent Deployment: In addition to the application services, run the Agent Server control plane and data plane in your EKS cluster. The control plane is installed via Helm; the data plane consists of Agent Server pods.
  • Standalone Agent Server: Deploy one or a few Agent Servers on EKS or Docker with external RDS PostgreSQL and ElastiCache. Use optional integration with the LangSmith UI for tracing. This model offers maximum flexibility and suits microservice architectures.
  • Hybrid: Run your data plane (Agent Servers and backing services) on AWS infrastructure while using LangChain’s managed control plane for the UI and APIs. The data plane uses the same AWS services (EKS, RDS PostgreSQL, ElastiCache) as the self-hosted models.

Compute options

LangSmith supports multiple compute options depending on your requirements:
Compute optionDescriptionSuitable for
Elastic Kubernetes Service (preferred)Advanced scaling and multi-tenant supportLarge enterprises
EC2-basedFull control, BYO-infraRegulated or air-gapped environments

AWS Well-Architected best practices

This reference is designed to align with the six pillars of the AWS Well-Architected Framework:

Operational excellence

Security

  • Use IAM roles with least-privilege policies.
  • Enable encryption at rest (RDS, S3, ClickHouse volumes) and in transit (TLS 1.2+).
  • Integrate with AWS Secrets Manager for credentials.
  • Use Amazon Cognito as an IDP in conjunction with LangSmith’s built-in authentication and authorization features to secure access to agents and their tools.

Reliability

  • Replicate the LangSmith data plane across regions: Deploy identical data planes to Kubernetes clusters in different regions for LangSmith Deployment. Deploy RDS and ECS services across Multi-AZ.
  • Implement auto-scaling for backend workers.
  • Use Amazon Route 53 health checks and failover policies.

Performance efficiency

Cost optimization

Sustainability

  • Minimize idle workloads with on-demand compute.
  • Store telemetry in low-latency, low-cost tiers.
  • Enable auto-shutdown for non-prod environments.

Security and compliance

LangSmith can be configured for: Customers can deploy in GovCloud, ISO, or HIPAA regions as needed.

Monitoring and evals

Use LangSmith to: Integrate with:
Connect these docs programmatically to Claude, VSCode, and more via MCP for real-time answers.