6.7 KiB
6.7 KiB
gcloud Skill
Common GCP patterns for the apes platform. All commands invoke gcloud/docker directly via Bash.
Project: apes-platform
Region: europe-west1
Zone: europe-west1-b
Current Infrastructure
| Service | Host | VM | IP | Compose dir |
|---|---|---|---|---|
| Gitea | git.unslope.com | gitea-vm | 34.78.255.104 | /opt/gitea |
| Colony | apes.unslope.com | colony-vm (e2-medium) | 35.241.200.77 | /opt/colony-src/infra/colony |
SSH into VMs
gcloud compute ssh <vm> --zone=europe-west1-b --project=apes-platform
gcloud compute ssh <vm> --zone=europe-west1-b --project=apes-platform --command="<cmd>"
Colony Deploy
# 1. SSH in and pull latest code
gcloud compute ssh colony-vm --zone=europe-west1-b --project=apes-platform \
--command='sudo bash -c "cd /opt/colony-src && git pull"'
# 2. Rebuild (deps cached, only source changes recompile ~30s)
gcloud compute ssh colony-vm --zone=europe-west1-b --project=apes-platform \
--command='sudo bash -c "cd /opt/colony-src/infra/colony && docker compose build 2>&1 | tail -5"'
# 3. Restart
gcloud compute ssh colony-vm --zone=europe-west1-b --project=apes-platform \
--command='sudo bash -c "cd /opt/colony-src/infra/colony && docker compose up -d"'
# 4. Verify
gcloud compute ssh colony-vm --zone=europe-west1-b --project=apes-platform \
--command='sudo bash -c "sleep 3 && docker logs colony 2>&1 | tail -5 && curl -s http://localhost:3001/api/health"'
Docker Compose on VMs
# Restart a service
gcloud compute ssh <vm> --zone=europe-west1-b --project=apes-platform \
--command="sudo bash -c 'cd <compose-dir> && docker compose restart <container>'"
# View logs (use 2>&1 for stderr)
gcloud compute ssh <vm> --zone=europe-west1-b --project=apes-platform \
--command='sudo docker logs <container> 2>&1 | tail -50'
# Full redeploy
gcloud compute ssh <vm> --zone=europe-west1-b --project=apes-platform \
--command="sudo bash -c 'cd <compose-dir> && docker compose pull && docker compose up -d'"
Install Docker on Debian 12
gcloud compute ssh <vm> --zone=europe-west1-b --project=apes-platform --command='sudo bash -c "
apt-get update && apt-get install -y ca-certificates curl gnupg git
install -m 0755 -d /etc/apt/keyrings
curl -fsSL https://download.docker.com/linux/debian/gpg | gpg --dearmor -o /etc/apt/keyrings/docker.gpg
chmod a+r /etc/apt/keyrings/docker.gpg
echo \"deb [arch=amd64 signed-by=/etc/apt/keyrings/docker.gpg] https://download.docker.com/linux/debian bookworm stable\" > /etc/apt/sources.list.d/docker.list
apt-get update && apt-get install -y docker-ce docker-ce-cli containerd.io docker-compose-plugin
systemctl enable docker && systemctl start docker
"'
Static IPs & DNS
# Reserve IP
gcloud compute addresses create <name> --region=europe-west1 --project=apes-platform
# Get IP
gcloud compute addresses describe <name> --region=europe-west1 --project=apes-platform --format='value(address)'
# DNS: Namecheap → Advanced DNS → Add A Record → host=<subdomain> value=<IP>
# Verify: dig @dns1.registrar-servers.com <domain> A +short
Firewall Rules
gcloud compute firewall-rules list --project=apes-platform
gcloud compute firewall-rules create <name> --allow=tcp:<port> --target-tags=web-server --project=apes-platform
gcloud compute firewall-rules delete <name> --project=apes-platform --quiet
New VM Pattern
gcloud compute instances create <name> \
--project=apes-platform \
--zone=europe-west1-b \
--machine-type=e2-small \
--image-family=debian-12 \
--image-project=debian-cloud \
--boot-disk-size=20GB \
--tags=web-server \
--address=<static-ip-name>
Gitea (git.unslope.com)
# Git push/pull: HTTP on port 3000 (no gcloud needed)
git clone http://git.unslope.com:3000/benji/apes.git
git remote set-url origin http://<user>:<token>@34.78.255.104:3000/benji/apes.git
# Create API token
curl -u user:pass -X POST 'https://git.unslope.com/api/v1/users/<user>/tokens' \
-H 'Content-Type: application/json' -d '{"name":"my-token","scopes":["all"]}'
# Create user (admin only, via SSH)
gcloud compute ssh gitea-vm --zone=europe-west1-b --project=apes-platform \
--command='sudo docker exec -u git gitea gitea admin user create --username <user> --password "<pass>" --email "<email>"'
Agent Management
# Install CLI binaries on VM (first time only, ~5min)
gcloud compute ssh colony-vm --zone=europe-west1-b --project=apes-platform \
--command='sudo bash -c "cd /opt/colony-src && git pull && source /root/.cargo/env && cargo build --release -p colony-cli -p colony-agent && cp target/release/colony target/release/colony-agent /usr/local/bin/"'
# Birth a new agent
gcloud compute ssh colony-vm --zone=europe-west1-b --project=apes-platform \
--command='sudo bash /opt/colony-src/scripts/birth.sh scout "help with research"'
# Check agent status
gcloud compute ssh colony-vm --zone=europe-west1-b --project=apes-platform \
--command='systemctl status agent-scout-worker && systemctl status agent-scout-dream.timer'
# View agent logs
gcloud compute ssh colony-vm --zone=europe-west1-b --project=apes-platform \
--command='tail -50 /home/agents/scout/memory/worker.log'
# Stop/start agent
gcloud compute ssh colony-vm --zone=europe-west1-b --project=apes-platform \
--command='sudo systemctl stop agent-scout-worker'
# Kill agent (remove everything)
gcloud compute ssh colony-vm --zone=europe-west1-b --project=apes-platform \
--command='sudo systemctl disable --now agent-scout-worker agent-scout-dream.timer && sudo userdel -r scout'
Troubleshooting
| Error | Fix |
|---|---|
| SSH timeout | VM under load (e.g. compiling Rust). Wait and retry. Don't kill builds. |
| SSH connection refused | VM still booting. Wait 30s, retry. |
| Docker not running | sudo systemctl start docker |
| Caddy cert failed | Check DNS: dig @dns1.registrar-servers.com <domain> A +short. Clear Caddy data if stuck on staging cert: sudo rm -rf <compose-dir>/caddy_data/caddy/certificates/acme-staging* |
| Container exits immediately | Check binary size — if < 1MB it's the stub binary. Rebuild with docker rmi <image> && docker compose build |
| Container restarting, no logs | Binary panicking before first print. Run interactively: docker run --rm -e DATABASE_URL=... <image> /app/<binary> |
| Git push fails (DNS) | Use IP directly: http://34.78.255.104:3000/benji/apes.git |
| Git push auth fails | Use API token, not password (special chars break URL encoding) |
| Build takes forever | First Rust build ~10min on e2-medium. Subsequent builds ~30s (deps cached). Don't use --no-cache unless Dockerfile changed. |
| Docker build SSH timeout | Build in background: nohup docker compose build > /tmp/build.log 2>&1 & then check later |