Show Your Work

The first public benchmark for AI Code Remediation

Open methodology. Reproduce it. Beat it.

15

Scenarios

43

Merge-Ready Fixes

100%

Loop Closure

12

Mins

$17

In Tokens

PR review time is up by 91%. Developers are merging 98% more code.
The generation problem is solved.
The trust problem isn’t. 

Gomboc sits above Cursor, Claude Code and Copilot to make every fix
accurate, governed, and cost-optimized. The benchmark is the proof.

What every fix should be

First public benchmark for AI Code Remediation. Real numbers, repeatable methodology. 

Every fix is tied to the policy that triggered it. Idempotent, auditable and merge-ready.

Idempotent fixes mean no re-spending tokens on the same problem. 

The 15 Scenarios

  • AWS
  • SECURITY

AWS S3 SSE-KMS Encryption

Remediates S3 buckets missing server-side encryption so data at rest is protected with customer-managed KMS keys.

VIEW FIX ON GITHUB

  • GCP
  • SECURITY

GCS Uniform Bucket-Level Access

Enforces uniform bucket-level access on GCS buckets to eliminate object-level ACLs that bypass central IAM policies.

VIEW FIX ON GITHUB

  • Kubernetes
  • Reliability

Kubernetes Resource Limits

Adds CPU and memory requests and limits to containers so workloads cannot exhaust shared cluster resources.

VIEW FIX ON GITHUB

  • AWS
  • SECURITY

AWS Security Group SSH Ingress Restriction

Restricts security group SSH ingress from the open internet to approved corporate CIDR ranges.

VIEW FIX ON GITHUB

  • GCP
  • SECURITY

GCP GKE Workload Identity + Shielded Nodes

Enables Workload Identity and Shielded Nodes on GKE clusters for per-pod GCP identity and hardware-rooted node trust.

VIEW FIX ON GITHUB

  • GCP
  • Reliability

AWS RDS Multi-AZ + Automated Backups

Configures Multi-AZ deployment and automated backups on RDS instances to survive AZ failures and enable point-in-time recovery.

VIEW FIX ON GITHUB

  • Azure
  • Security

Azure Subnet NSG Association

Associates Network Security Groups with subnets so inbound and outbound traffic is filtered at the network layer.

VIEW FIX ON GITHUB

  • GCP
  • Security

GCP Cloud SQL SSL + Authorized Networks

Requires SSL for Cloud SQL connections and restricts authorized networks so databases are not exposed on the public internet.

VIEW FIX ON GITHUB

  • Kubernetes
  • Security

Kubernetes StatefulSet Image Update

Updates a StatefulSet container image to patch known Redis CVEs in production caching workloads.

VIEW FIX ON GITHUB

  • GCP
  • Cost

GCE Disk Type Migration (Hyperdisk Balanced)

Migrates overprovisioned pd-ssd disks to Hyperdisk Balanced to decouple capacity from IOPS and reduce storage waste.

VIEW FIX ON GITHUB

  • AWS
  • Reliability

AWS EKS Multi-AZ Expansion

Expands an EKS cluster to a third Availability Zone with coordinated subnet, node pool, and autoscaler changes.

VIEW FIX ON GITHUB

  • AWS
  • Cost

AWS EC2 Right-Sizing + Scheduling

Right-sizes underutilized dev instances and schedules batch ETL instances to cut EC2 spend without breaking workload patterns.

VIEW FIX ON GITHUB

  • AWS
  • Security

AWS S3 Legacy Provider Encryption (Inline Syntax)

Adds encryption to S3 buckets defined with legacy inline provider syntax that modern scanners often miss.

VIEW FIX ON GITHUB

  • KUBERNETES
  • Cost

AWS CloudTrail Organizational Consolidation

Consolidates redundant per-account CloudTrail trails into a single organizational trail to cut cost and unify audit logs.

VIEW FIX ON GITHUB

  • AWS
  • SECURITY
  • Cost

AWS VPC Endpoints (NAT Gateway Traffic Bypass)

Adds VPC endpoints so private traffic to AWS services bypasses NAT gateways, reducing data transfer cost and public egress.

VIEW FIX ON GITHUB

  • AWS
  • SECURITY

AWS S3 SSE-KMS Encryption

Remediates S3 buckets missing server-side encryption so data at rest is protected with customer-managed KMS keys.

VIEW FIX ON GITHUB

  • GCP
  • SECURITY

GCS Uniform Bucket-Level Access

Enforces uniform bucket-level access on GCS buckets to eliminate object-level ACLs that bypass central IAM policies.

VIEW FIX ON GITHUB

  • AWS
  • SECURITY

AWS Security Group SSH Ingress Restriction

Restricts security  group SSH ingress from the open internet to approved corporate CIDR ranges.

VIEW FIX ON GITHUB

  • GCP
  • SECURITY

GCP GKE Workload Identity + Shielded Nodes

Enables Workload  Identity and Shielded Nodes on GKE clusters for per-pod GCP identity and  hardware-rooted node trust.

VIEW FIX ON GITHUB

  • Azure
  • Security

Azure Subnet NSG Association

Associates Network  Security Groups with subnets so inbound and outbound traffic is filtered at  the network layer.

VIEW FIX ON GITHUB

  • GCP
  • Security

GCP Cloud SQL SSL + Authorized Networks

Requires SSL for  Cloud SQL connections and restricts authorized networks so databases are not  exposed on the public internet.

VIEW FIX ON GITHUB

  • Kubernetes
  • Security

Kubernetes StatefulSet Image Update

Updates a StatefulSet  container image to patch known Redis CVEs in production caching workloads.

VIEW FIX ON GITHUB

  • AWS
  • Security

AWS S3 Legacy Provider Encryption (Inline Syntax)

Adds encryption to S3  buckets defined with legacy inline provider syntax that modern scanners often  miss.

VIEW FIX ON GITHUB

  • AWS
  • SECURITY
  • Cost

AWS VPC Endpoints (NAT Gateway Traffic Bypass)

Adds VPC endpoints so  private traffic to AWS services bypasses NAT gateways, reducing data transfer  cost and public egress.

VIEW FIX ON GITHUB

  • Kubernetes
  • Reliability

Kubernetes Resource Limits

Adds CPU and memory  requests and limits to containers so workloads cannot exhaust shared cluster  resources.

VIEW FIX ON GITHUB

  • GCP
  • Reliability

AWS RDS Multi-AZ + Automated Backups

Configures Multi-AZ  deployment and automated backups on RDS instances to survive AZ failures and  enable point-in-time recovery.

VIEW FIX ON GITHUB

  • AWS
  • Reliability

AWS EKS Multi-AZ Expansion

Expands an EKS  cluster to a third Availability Zone with coordinated subnet, node pool, and  autoscaler changes.

VIEW FIX ON GITHUB

  • GCP
  • Cost

GCE Disk Type Migration (Hyperdisk Balanced)

Migrates  overprovisioned pd-ssd disks to Hyperdisk Balanced to decouple capacity from  IOPS and reduce storage waste.

VIEW FIX ON GITHUB

  • AWS
  • Cost

AWS EC2 Right-Sizing + Scheduling

Right-sizes  underutilized dev instances and schedules batch ETL instances to cut EC2  spend without breaking workload patterns.

VIEW FIX ON GITHUB

  • KUBERNETES
  • Cost

AWS CloudTrail Organizational Consolidation

Consolidates  redundant per-account CloudTrail trails into a single organizational trail to  cut cost and unify audit logs.

VIEW FIX ON GITHUB

  • AWS
  • SECURITY
  • Cost

AWS VPC Endpoints (NAT Gateway Traffic Bypass)

This scenario illustrates a billing anti-pattern that is extremely common but rarely detected because it requires combining network topology knowledge

VIEW FIX ON GITHUB

How we measure a fix

IDEMPOTENT

Apply it twice. Same result. No side effects. 

POLICY-ALIGNED

Every fix is tied to the standard that triggered it. 

AUDITABLE

Full trail from finding to merged PR. 

TESTED

Validated against the actual environment config. 

MERGE-READY

No human rewriting required to ship.

The Challenge

We published ours. It’s time to

Show Your
Work

The era of vibes is over.