Spaces:
Sleeping
Sleeping
| # AWS EC2 Basic Deployment | |
| This is an example deployment to AWS EC2 Compute using [terraform](https://www.terraform.io/). | |
| This deployment will do the following: | |
| - Create a security group with required ports open (22 and 8000) | |
| - Create EC2 instance with Ubuntu 22 and deploy Chroma using docker compose | |
| - Create a data volume for Chroma data | |
| - Mount the data volume to the EC2 instance | |
| - Format the data volume with ext4 | |
| - Start Chroma | |
| ## Requirements | |
| - [Terraform CLI v1.3.4+](https://developer.hashicorp.com/terraform/tutorials/gcp-get-started/install-cli) | |
| ## Deployment with terraform | |
| This deployment uses Ubuntu 22 as foundation, but you'd like to use a different AMI (non-Debian based image) you may have to adjust the startup script. | |
| To find AWS EC2 AMIs you can use: | |
| ```bash | |
| # 099720109477 is Canonical | |
| aws ec2 describe-images \ | |
| --owners 099720109477 \ | |
| --filters 'Name=name,Values=ubuntu/images/*/ubuntu-jammy*' \ | |
| --query 'sort_by(Images,&CreationDate)[-1].ImageId' | |
| ``` | |
| ### 2. Init your terraform state | |
| ```bash | |
| terraform init | |
| ``` | |
| ### 3. Deploy your application | |
| Generate SSH key to use with your chroma instance (so you can login to the EC2): | |
| > Note: This is optional. You can use your own existing SSH key if you prefer. | |
| ```bash | |
| ssh-keygen -t RSA -b 4096 -C "Chroma AWS Key" -N "" -f ./chroma-aws && chmod 400 ./chroma-aws | |
| ``` | |
| Set up your Terraform variables and deploy your instance: | |
| ```bash | |
| #AWS access key | |
| export TF_VAR_AWS_ACCESS_KEY=<AWS_ACCESS_KEY> | |
| #AWS secret access key | |
| export TF_VAR_AWS_SECRET_ACCESS_KEY=<AWS_SECRET_ACCESS_KEY> | |
| #path to the public key you generated above (or can be different if you want to use your own key) | |
| export TF_ssh_public_key="./chroma-aws.pub" | |
| #path to the private key you generated above (or can be different if you want to use your own key) - used for formatting the Chroma data volume | |
| export TF_ssh_private_key="./chroma-aws" | |
| #set the chroma release to deploy | |
| export TF_VAR_chroma_release=0.4.12 | |
| # AWS region to deploy the chroma instance to | |
| export TF_VAR_region="us-west-1" | |
| #enable public access to the chroma instance on port 8000 | |
| export TF_VAR_public_access="true" | |
| #enable basic auth for the chroma instance | |
| export TF_VAR_enable_auth="true" | |
| #The auth type to use for the chroma instance (token or basic) | |
| export TF_VAR_auth_type="token" | |
| #optional - if you want to restore from a snapshot | |
| export TF_VAR_chroma_data_restore_from_snapshot_id="" | |
| #optional - if you want to snapshot the data volume before destroying the instance | |
| export TF_VAR_chroma_data_volume_snapshot_before_destroy="true" | |
| terraform apply -auto-approve | |
| ``` | |
| > Note: Basic Auth is supported by Chroma v0.4.7+ | |
| ### 4. Check your public IP and that Chroma is running | |
| Get the public IP of your instance | |
| ```bash | |
| terraform output instance_public_ip | |
| ``` | |
| Check that chroma is running (It should take up several minutes for the instance to be ready) | |
| ```bash | |
| export instance_public_ip=$(terraform output instance_public_ip | sed 's/"//g') | |
| curl -v http://$instance_public_ip:8000/api/v1/heartbeat | |
| ``` | |
| #### 4.1 Checking Auth | |
| ##### Token | |
| When token auth is enabled you can check the get the credentials from Terraform state by running: | |
| ```bash | |
| terraform output chroma_auth_token | |
| ``` | |
| You should see something of the form: | |
| ```bash | |
| PVcQ4qUUnmahXwUgAf3UuYZoMlos6MnF | |
| ``` | |
| You can then export these credentials: | |
| ```bash | |
| export CHROMA_AUTH=$(terraform output chroma_auth_token | sed 's/"//g') | |
| ``` | |
| Using the credentials: | |
| ```bash | |
| curl -v http://$instance_public_ip:8000/api/v1/collections -H "Authorization: Bearer ${CHROMA_AUTH}" | |
| ``` | |
| ##### Basic | |
| When basic auth is enabled you can check the get the credentials from Terraform state by running: | |
| ```bash | |
| terraform output chroma_auth_basic | |
| ``` | |
| You should see something of the form: | |
| ```bash | |
| chroma:VuA8I}QyNrm0@QLq | |
| ``` | |
| You can then export these credentials: | |
| ```bash | |
| export CHROMA_AUTH=$(terraform output chroma_auth_basic | sed 's/"//g') | |
| ``` | |
| Using the credentials: | |
| ```bash | |
| curl -v http://$instance_public_ip:8000/api/v1/collections -u "${CHROMA_AUTH}" | |
| ``` | |
| > Note: Without `-u` you should be getting 401 Unauthorized response | |
| #### 4.2 Connect (ssh) to your instance | |
| To SSH to your instance: | |
| ```bash | |
| ssh -i ./chroma-aws ubuntu@$instance_public_ip | |
| ``` | |
| ### 5. Destroy your Chroma instance | |
| You will need to change `prevent_destroy` to `false` in the `aws_ebs_volume` in `chroma.tf`. | |
| ```bash | |
| terraform destroy -auto-approve | |
| ``` | |
| ## Extras | |
| You can visualize your infrastructure with: | |
| ```bash | |
| terraform graph | dot -Tsvg > graph.svg | |
| ``` | |
| >Note: You will need graphviz installed for this to work | |