Show Your Work
The first public benchmark for AI Code Remediation
Open methodology. Reproduce it. Beat it.
15
Scenarios
43
Merge-Ready Fixes
100%
Loop Closure
12
Mins
$17
In Tokens
PR review time is up by 91%. Developers are merging 98% more code.
The generation problem is solved.
The trust problem isn’t.
Gomboc sits above Cursor, Claude Code and Copilot to make every fix
accurate, governed, and cost-optimized. The benchmark is the proof.
What every fix should be
First public benchmark for AI Code Remediation. Real numbers, repeatable methodology.
Every fix is tied to the policy that triggered it. Idempotent, auditable and merge-ready.
Idempotent fixes mean no re-spending tokens on the same problem.
The 15 Scenarios
- AWS
- SECURITY
AWS S3 SSE-KMS Encryption
Remediates S3 buckets missing server-side encryption so data at rest is protected with customer-managed KMS keys.
VIEW FIX ON GITHUB
- GCP
- SECURITY
GCS Uniform Bucket-Level Access
Enforces uniform bucket-level access on GCS buckets to eliminate object-level ACLs that bypass central IAM policies.
VIEW FIX ON GITHUB
- Kubernetes
- Reliability
Kubernetes Resource Limits
Adds CPU and memory requests and limits to containers so workloads cannot exhaust shared cluster resources.
VIEW FIX ON GITHUB
- AWS
- SECURITY
AWS Security Group SSH Ingress Restriction
Restricts security group SSH ingress from the open internet to approved corporate CIDR ranges.
VIEW FIX ON GITHUB
- GCP
- SECURITY
GCP GKE Workload Identity + Shielded Nodes
Enables Workload Identity and Shielded Nodes on GKE clusters for per-pod GCP identity and hardware-rooted node trust.
VIEW FIX ON GITHUB
- GCP
- Reliability
AWS RDS Multi-AZ + Automated Backups
Configures Multi-AZ deployment and automated backups on RDS instances to survive AZ failures and enable point-in-time recovery.
VIEW FIX ON GITHUB
- Azure
- Security
Azure Subnet NSG Association
Associates Network Security Groups with subnets so inbound and outbound traffic is filtered at the network layer.
VIEW FIX ON GITHUB
- GCP
- Security
GCP Cloud SQL SSL + Authorized Networks
Requires SSL for Cloud SQL connections and restricts authorized networks so databases are not exposed on the public internet.
VIEW FIX ON GITHUB
- Kubernetes
- Security
Kubernetes StatefulSet Image Update
Updates a StatefulSet container image to patch known Redis CVEs in production caching workloads.
VIEW FIX ON GITHUB
- GCP
- Cost
GCE Disk Type Migration (Hyperdisk Balanced)
Migrates overprovisioned pd-ssd disks to Hyperdisk Balanced to decouple capacity from IOPS and reduce storage waste.
VIEW FIX ON GITHUB
- AWS
- Reliability
AWS EKS Multi-AZ Expansion
Expands an EKS cluster to a third Availability Zone with coordinated subnet, node pool, and autoscaler changes.
VIEW FIX ON GITHUB
- AWS
- Cost
AWS EC2 Right-Sizing + Scheduling
Right-sizes underutilized dev instances and schedules batch ETL instances to cut EC2 spend without breaking workload patterns.
VIEW FIX ON GITHUB
- AWS
- Security
AWS S3 Legacy Provider Encryption (Inline Syntax)
Adds encryption to S3 buckets defined with legacy inline provider syntax that modern scanners often miss.
VIEW FIX ON GITHUB
- KUBERNETES
- Cost
AWS CloudTrail Organizational Consolidation
Consolidates redundant per-account CloudTrail trails into a single organizational trail to cut cost and unify audit logs.
VIEW FIX ON GITHUB
- AWS
- SECURITY
- Cost
AWS VPC Endpoints (NAT Gateway Traffic Bypass)
Adds VPC endpoints so private traffic to AWS services bypasses NAT gateways, reducing data transfer cost and public egress.
VIEW FIX ON GITHUB
- AWS
- SECURITY
AWS S3 SSE-KMS Encryption
Remediates S3 buckets missing server-side encryption so data at rest is protected with customer-managed KMS keys.
VIEW FIX ON GITHUB
- GCP
- SECURITY
GCS Uniform Bucket-Level Access
Enforces uniform bucket-level access on GCS buckets to eliminate object-level ACLs that bypass central IAM policies.
VIEW FIX ON GITHUB
- AWS
- SECURITY
AWS Security Group SSH Ingress Restriction
Restricts security group SSH ingress from the open internet to approved corporate CIDR ranges.
VIEW FIX ON GITHUB
- GCP
- SECURITY
GCP GKE Workload Identity + Shielded Nodes
Enables Workload Identity and Shielded Nodes on GKE clusters for per-pod GCP identity and hardware-rooted node trust.
VIEW FIX ON GITHUB
- Azure
- Security
Azure Subnet NSG Association
Associates Network Security Groups with subnets so inbound and outbound traffic is filtered at the network layer.
VIEW FIX ON GITHUB
- GCP
- Security
GCP Cloud SQL SSL + Authorized Networks
Requires SSL for Cloud SQL connections and restricts authorized networks so databases are not exposed on the public internet.
VIEW FIX ON GITHUB
- Kubernetes
- Security
Kubernetes StatefulSet Image Update
Updates a StatefulSet container image to patch known Redis CVEs in production caching workloads.
VIEW FIX ON GITHUB
- AWS
- Security
AWS S3 Legacy Provider Encryption (Inline Syntax)
Adds encryption to S3 buckets defined with legacy inline provider syntax that modern scanners often miss.
VIEW FIX ON GITHUB
- AWS
- SECURITY
- Cost
AWS VPC Endpoints (NAT Gateway Traffic Bypass)
Adds VPC endpoints so private traffic to AWS services bypasses NAT gateways, reducing data transfer cost and public egress.
VIEW FIX ON GITHUB
- Kubernetes
- Reliability
Kubernetes Resource Limits
Adds CPU and memory requests and limits to containers so workloads cannot exhaust shared cluster resources.
VIEW FIX ON GITHUB
- GCP
- Reliability
AWS RDS Multi-AZ + Automated Backups
Configures Multi-AZ deployment and automated backups on RDS instances to survive AZ failures and enable point-in-time recovery.
VIEW FIX ON GITHUB
- AWS
- Reliability
AWS EKS Multi-AZ Expansion
Expands an EKS cluster to a third Availability Zone with coordinated subnet, node pool, and autoscaler changes.
VIEW FIX ON GITHUB
- GCP
- Cost
GCE Disk Type Migration (Hyperdisk Balanced)
Migrates overprovisioned pd-ssd disks to Hyperdisk Balanced to decouple capacity from IOPS and reduce storage waste.
VIEW FIX ON GITHUB
- AWS
- Cost
AWS EC2 Right-Sizing + Scheduling
Right-sizes underutilized dev instances and schedules batch ETL instances to cut EC2 spend without breaking workload patterns.
VIEW FIX ON GITHUB
- KUBERNETES
- Cost
AWS CloudTrail Organizational Consolidation
Consolidates redundant per-account CloudTrail trails into a single organizational trail to cut cost and unify audit logs.
VIEW FIX ON GITHUB
- AWS
- SECURITY
- Cost
AWS VPC Endpoints (NAT Gateway Traffic Bypass)
This scenario illustrates a billing anti-pattern that is extremely common but rarely detected because it requires combining network topology knowledge
VIEW FIX ON GITHUB
How we measure a fix
IDEMPOTENT
Apply it twice. Same result. No side effects.
POLICY-ALIGNED
Every fix is tied to the standard that triggered it.
AUDITABLE
Full trail from finding to merged PR.
TESTED
Validated against the actual environment config.
MERGE-READY
No human rewriting required to ship.
The Challenge
We published ours. It’s time to
Show Your Work
The era of vibes is over.