Introducing AI Infrastructure Architect

Think. Design. Simulate. Deploy.

The AI workspace for cloud infrastructure. Design AWS, Azure, and GCP architectures on a visual canvas, simulate up to 100M RPS of real traffic, see your exact monthly cost — and catch the bottleneck before your users do. Then deploy in one click.

Start Building Free
★★★★★ Loved by cloud architects SOC 2 No credit card Deploy to AWS Open-source templates
image-processing-platform · production
Simulation live
API GatewayREST · us-east-1
Lambdaresize-worker
SQS Queuejobs.fifo
S3 Bucketmedia-uploads
DynamoDBmetadata
CloudFrontglobal CDN
AI Architect

“Build an AI customer support platform.”

Generating architecture
✓ 27 resources planned ✓ VPC & networking → Wiring event bus…
Simulation Live
Current RPS0
Latency p500ms
Monthly cost$0
No errors detected
Connected
GitHub AWS Terraform Slack
AWS Deployment ready 27 resources No errors Est. $164/mo
How it works

From idea to production
in four moves.

The standard workflow makes you deploy first and find out later. Pinpole compresses design, cost estimation, load testing, and architecture review into one session — before a single dollar is spent.

01

Describe what you want.

Describe your system in plain English. Pinpole’s AI Architect designs the full architecture automatically — services, networking, permissions, everything.

“Build an event-driven image processing platform.”
Build an event-driven image processing platform.
AI Architect Designed 12 services across 3 availability zones. S3 → EventBridge → Lambda fan-out with DLQ & retries.
02

Edit visually.

Drag services, connect APIs, modify resources, and collaborate with your team in real time — on an infinite multi-cloud canvas.

  • Drag & drop 400+ cloud services
  • Live multiplayer cursors
  • Every edit stays deploy-ready
S3
Lambda
EventBridge
maya
dev
03

Simulate before you deploy.

Run production-grade traffic simulations against your design. Predict bottlenecks, estimate cloud costs, and identify failures — before they ever reach production.

  • Model RPS, latency & error rates
  • Forecast monthly cloud spend
  • Find the bottleneck before your users do
Traffic simulation 10,000 RPS burst
⚠ Bottleneck: Lambda concurrency at 92% Est. cost $182/mo
04

Deploy in one click.

Sync GitHub, generate Terraform, deploy directly to your cloud, and monitor the rollout — without leaving the canvas.

  • Sync GitHub & generate Terraform
  • Deploy directly to AWS, Azure, GCP
  • Monitor & roll back safely
pinpole deploy
$ pinpole deploy --env production
Terraform plan generated (27 resources)
GitHub synced pinpole/infra@a4f21c
VPC, subnets, security groups
Lambda functions deployed
Deployment complete in 94s — no errors
$
The platform

Four pillars.
One workspace.

Your diagram knows your cost model. Your cost model knows your traffic pattern. Your load test runs before the infrastructure exists. That's the point.

Inside the dashboard

The bridge between cloud architecture
and deployment.

Everything shipped in the Pinpole dashboard today, organized as the workflow you actually follow — design on canvas, simulate what it costs with thousands of users, connect your cloud and deploy, then manage the live stack.

Visual architecture design on an infinite canvas. No cloud account required to start.

Multi-cloud drag & drop palette

Drag services from the AWS, GCP, Azure, and Cloudflare catalogs — 13 categories from Compute to AI/ML, with provider-accurate icons and full-text search.

Production-grade templates

32 reference architectures — Stripe-like Payments, Netflix-like Streaming, RAG Chatbot, Kubernetes Platform — preview, then replace or merge into your canvas.

AI Infra Architect

Describe a workload in plain English and watch nodes and edges stream onto the canvas. Or upload a diagram photo — vision AI extracts services, connections, and groups.

Connection rule engine

Invalid source→target wiring is blocked per provider registry before it's created, with suggestions. Valid cross-cloud edges — like Cloudflare in front of AWS — are supported.

Per-service configuration

Click any node to configure VPC, region, instance type, runtime, and capacity — JSON-driven schemas per provider, with draft-before-save and per-node insights.

Groups, shapes & annotations

VPC-style group frames, text labels, zone boxes, and 18 architecture actors — users, mobile, SaaS, IoT, bank — for diagrams that read like the real system.

Personal & team workspaces

Autosave with revision tracking for solo work; team workspaces with role-based edit access and live canvas sync. Share read-only embeds without login.

A real editor

60-step undo/redo, copy/paste, right-click context menus, Cmd+K command palette, auto-organize layout, and a guided first-run tour.

Export anything

PNG for decks, JSON for tooling, animated GIF of simulations — and IaC as Terraform, CloudFormation, CDK, or ARM.

Explore the Visual Canvas →

Run traffic through your design and watch cost and behaviour tick live — before spending on real infrastructure.

Real-time tick engine

A 100ms loop updates every node and edge while the simulation runs. Pause, edit traffic config, and resume mid-run.

10 → 100M RPS, 4 patterns

Logarithmic traffic slider with Constant, Ramp, Spike, and Wave patterns — or express load as total requests per day, week, or month.

Stateful service models

Per-tick state for Lambda concurrency and cold starts, Kinesis shards, DynamoDB capacity, and SQS queue depth — 30+ dedicated processors.

Live cost dashboard

$/sec, $/hour, $/day, and $/month projections update as the run progresses, with a per-node breakdown sorted by load.

AI workload pricing

Bedrock and AgentCore modelled with token and unit pricing — your GenAI stack costed like any other service.

Overlays, alerts & percentiles

Load %, sparklines, latency, and health per node; animated edges with throughput labels; P50/P95/P99 across the graph; alerts with actionable fixes.

AI recommendations

A rule engine plus AI pass suggests cost, latency, and resilience improvements — one click applies the fix to the canvas and wires the edges.

History & sharing

Every run persists with config, peak RPS, cost, and recommendations. Share tokenized interactive simulation embeds — no login needed.

MCP simulation

AI agents run batch simulations through the API and get an inline simulation canvas back in chat.

Explore Simulation →

Go from canvas design to live infrastructure in your own cloud account — with zero long-lived keys stored.

Pinpole as OIDC issuer

Your cloud trusts Pinpole via standard OIDC/WIF federation. Every deploy uses short-lived credentials minted on demand — no stored secrets.

AWS OIDC connect & deploy

One-click CloudFormation Quick Create sets up the IAM OIDC provider and deploy role. Canvas → CloudFormation → AssumeRoleWithWebIdentity → live stack.

GCP Workload Identity Federation

Provision the WIF pool via Terraform ZIP + Cloud Shell or Google OAuth, then deploy through Cloud Build in your own project.

Cloudflare realtime deploy

Connect with an API token and apply Workers, Pages, KV, D1, Queues, R2, and DNS in real time — or import existing resources onto the canvas.

Azure quick deploy

Generate an ARM template from the canvas and deploy through the Azure Portal link. Full OIDC connect is next on the roadmap.

Quick deploy — no connect

No credentials at all: get the generated template plus a console Quick Create URL and run it from your own cloud console.

GitHub → Lambda code sync

Link a repo, branch, and handler path to any Lambda node. Pinpole discovers SAM/Serverless functions and auto-syncs code on every push — no stack redeploy.

IaC generation

Terraform, CloudFormation, CDK, and SAM for AWS; Terraform and ARM for Azure; Terraform for GCP — with optional AI refinement of the output.

Team cloud accounts

Shared AWS/GCP connections at team level, with role-gated deploy permissions and plan-based deploy quotas.

Explore Deploy →

The canvas stays in charge after the deploy — sync changes, detect drift, and manage the live stack.

Deploy becomes Sync

After connect, one button pushes saved canvas changes to the live stack — CloudFormation on AWS, Cloud Build on GCP, realtime on Cloudflare.

Incremental apply

A diff engine compares canvas vs last deploy and routes small changes to direct API updates instead of full stack updates.

Per-service & per-field push

Push a single service's config — or one field — to a live AWS resource without redeploying anything else.

Drift detection

Compare the canvas against live CloudFormation and Lambda state on AWS, the last deployed snapshot on GCP, or resource fingerprints on Cloudflare.

Deployment history & detail

Versioned per-workspace history with status, trigger, resources, and cost — plus live status polling, timeline, stack outputs, and console deep links.

Live resource cards

Physical name, ARN, and endpoint URL per node, with "Open in AWS Console" links and a resource info API.

Notifications & events

Per-deployment event streams, in-app toasts and inbox, and email preferences for deploy and simulation outcomes.

Teams, audit & security

Role-based collaboration (owner, admin, dev, viewer), an immutable MFA-gated audit log, API/MCP tokens, and session management.

Cloud Terminal

A CLI-style surface to query simulated state and explore quotas without leaving the session.

Explore post-deploy management →
End to end — the GitHub + Lambda workflow
1Design API Gateway → Lambda → DynamoDB on the canvas
2Simulate at 50,000 RPS — apply the AI recommendation to add an SQS buffer
3Connect AWS via OIDC, deploy, link the GitHub repo to the Lambda node
4Push to GitHub — code auto-syncs, canvas Sync pushes infra changes, drift check confirms live matches design

One workspace, every provider. Pinpole speaks your stack’s language.

For developers

An API-first platform.
Automate everything.

REST API & SDKs

Everything in the UI,
available as an API.

Create architectures, run simulations, and trigger deployments programmatically. Type-safe SDKs for TypeScript and Python, webhooks for everything else.

  • Architectures, simulations & deployments as resources
  • Webhooks for deploy & simulation events
  • Fine-grained API keys with scoped permissions
Explore the API reference →
$ curl -X POST https://api.pinpole.cloud/v1/architectures \
  -H "Authorization: Bearer $PINPOLE_API_KEY" \
  -d '{
    "prompt": "Event-driven image pipeline on AWS",
    "simulate": { "rps": 1000 }
  }'

// 201 Created
{
  "id": "arch_8fk2m",
  "resources": 27,
  "simulation": { "p50_ms": 26, "monthly_cost": 182 }
}
import { Pinpole } from "@pinpole/sdk";

const pinpole = new Pinpole();

const arch = await pinpole.architectures.create({
  prompt: "Event-driven image pipeline on AWS",
});

const sim = await arch.simulate({ rps: 1_000 });

console.log(sim.monthlyCost); // $182
console.log(sim.bottlenecks); // []

await arch.deploy({ env: "production" });
$ pinpole export --format terraform

 Generated 27 resources
  ├─ main.tf
  ├─ variables.tf
  ├─ networking.tf
  └─ outputs.tf

$ terraform plan
Plan: 27 to add, 0 to change, 0 to destroy.
MCP Gateway

Your infrastructure,
available to every AI agent.

The Pinpole MCP server exposes your canvases, simulations and deployments as tools any MCP-compatible agent can call — Claude, Cursor, or your own.

pinpole_create_project pinpole_build_architecture pinpole_draw_on_canvas pinpole_simulate_cost pinpole_open_canvas
@pinpole/mcp on npm →
agent session
$ npm install -g @pinpole/mcp
 pinpole-mcp connected (9 tools)

 Design a RAG pipeline and estimate its cost

 pinpole_build_architecture("RAG pipeline")
  └─ 14 resources on canvas
 pinpole_simulate_cost(rps: 500)
  └─ $96/month · no bottlenecks

 Done — canvas ready to review.
Model Hub

Bring any model
into your workflows.

Choose which intelligence powers your AI Architect and agent workflows. Swap models per project, per workflow — or bring your own keys.

  • Frontier models from every major lab
  • BYO API keys — your data never trains anyone’s model
  • Per-workflow routing & fallbacks
Explore the Model Hub →
ClaudeAnthropic
GPT-5OpenAI
GeminiGoogle
LlamaMeta
MistralMistral AI
+ BYOKYour own endpoint
Explore all models in the Model Hub →
Why Pinpole

Less guessing. More shipping.

Infrastructure decisions made on guesswork are a systems failure, not a personal one. Pinpole gives engineers the data to make decisions they can defend.

Design

Visual cloud architecture that stays perfectly in sync with what actually runs.

Validate

Run production traffic simulations and know your costs and limits before launch.

Deploy

Ship with confidence — one click from validated design to running infrastructure.

Build with AI

What would you like to build?

Type it. Pinpole architects it.

Stop guessing.
Start building infrastructure
that works.

Start Free

Free forever for individuals · No credit card required