Blitzy Fixed and Enhanced Claude's C Compiler

Mar 19, 2026 • Murph Vandervelde • 5 min read

370 Engineering Hours in 4 Days. 226,000 LoC. 1,260 Tests. Here's How.

Our first two Blitzy Open Source Initiative posts focused on Great Refactor projects in dnsmasq and curl. For this blog, given all the attention it has garnered, we set Blitzy on a mission to fix existing issues with Claude's C Compiler (CCC) and enhance the functionality of CCC in 1 run.

Claude's Very Public Compiler Release

On February 6, 2026, Anthropic announced that 16 Claude Opus 4.6 agents had built a C compiler from scratch in 14 days. A run of this length is quite impressive. Claude Code is an amazing platform for accelerating developer velocity on smaller software engineering tasks, as Blitzy's internal team are large users ourselves. The compiler project also illustrates the gap between research and production standards. A senior Anthropic researcher still had to act as full-time conductor, logging over 2,000 interactive turns with the Claude agents and writing many of the test cases.

The project generated enormous excitement: a working compiler, written primarily by agents, capable of compiling the Linux kernel. Headlines called it "a new era of autonomous coding". The GitHub repo racked up thousands of stars overnight, but when engineers tried to use it, they were met with a variety of unexpected problems.

The very first issue opened on the GitHub repo was titled, "Hello world does not compile." The compiler couldn't find its own system headers. John Regehr, one of the world's foremost experts on compiler correctness, ran his fuzzing tools against CCC and found 14 miscompilations out of 101 Csmith programs and 5 out of 101 YARPGen programs. His conclusion: "From the point of view of people working with production compilers, CCC isn't useful."

Deep Codebase Understanding: Tech Spec

Before modifying a single line of Rust, Blitzy ingested and analyzed the entire Claude C Compiler codebase and created a detailed technical specification of 351 source files totaling 186,696 lines across 17 C header files and 6 shell stubs.

The requirements mapped the full scope of what needed to be fixed across 9 audit areas including:

13 active bugs causing incorrect codegen or blocking real-world compilation, from ARM assembler CASPAL instruction encoding failures to macro parameter prefix-matching substitution errors.
C11 language feature gaps with incomplete _Atomic support, missing VLAs, absent _Generic selection, and a _Pragma operator explicitly skipped in the source code.
Zero external Rust crate dependencies, meaning every fix had to be implemented from scratch using only the Rust standard library.
Single-tier optimization pipeline where every -O level ran the same passes, confirming what benchmarkers had already discovered: optimization flags did nothing.

The analysis also surfaced the structural issues identified from the outside: optimization toward test-suite passage rather than general correctness, silent acceptance of invalid C programs that GCC would reject, and register spilling patterns that inflated binary sizes by 2.7–3x compared to GCC output.

What Blitzy Built

Blitzy generated the entire run from this prompt. See the pull request and project guide that recaps the work. The PR includes detailed descriptions for all relevant commits to walk through the changes and decisions made.

Here are the numbers:

92% of the entire project (370 engineering hours) completed autonomously in 4 days by parallelizing development work
10,785 autonomous agent turns (communication points between Blitzy agents, akin to human and agent interaction in iterative tools like Claude Code) compared to Claude Code's 2,000 human/agent turns
39,455 net new lines of code expanding the codebase from 186,696 to 226,151 lines, with 527 new files and 134 modified files
1,260 tests (753 unit, 507 integration) with a 100% pass rate across all four architectures. Zero compilation errors.
All 13 active bugs resolved with dedicated regression tests for each, stabilizing the compilation baseline
Full C11 language conformance including _Atomic with architecture-specific codegen across all 4 backends, _Generic, _Static_assert, _Alignas, _Noreturn, VLAs, _Complex Annex G fixes, restrict qualifier propagation, and inline linkage semantics
6-tier optimization pipeline replacing the single-tier system: -O0 through -Oz with distinct pass configurations, a new 1,893-line loop unrolling pass at -O3, and configurable inlining thresholds
Tail call optimization extended from x86-64-only to all 4 architectures
1,883-line linker script parser supporting SECTIONS, MEMORY, ENTRY, PROVIDE, and KEEP directives
62 ABI compliance tests cross-linking CCC caller and GCC callee across all 4 architectures

The architectural decisions matter as much as the volume.

The original CCC ran every optimization pass at every level. Passing -O3 did nothing different from -O0. The binaries were byte-identical. That's why independent benchmarks showed CCC producing executables 737–158,000x slower than GCC on complex operations.

Blitzy replaced this with six distinct tiers, each with a documented pass configuration. The difference is not cosmetic. The tiers change what the compiler actually does with your code.

Performance Improvements

After Blitzy's work, CCC's benchmarking profile shifted significantly. Compilation speed is 1.4–4.5x faster than GCC across all optimization levels. The gap widens at -O3, where GCC's heavier optimizer passes cost more time. Binary size is consistently 10% smaller than GCC. Sequential integer codegen is production-quality: FNV-1a hashing achieves full parity with GCC at -O3, proving the scalar code generation pipeline is solid. Loop-depth-aware spill weight in the register allocator reduces the stack frame bloat that benchmarkers had flagged as a major issue. The compiler frontend and scalar codegen are now solid.

Remaining Work

With 92% completion executed by Blitzy, the remaining 8% is well-understood. Auto-vectorization is absent, and floating-point optimization shows zero scaling from -O0 to -O3, leaving Blitzy's CCC 3–12x slower on data-parallel workloads like matrix multiply and float arithmetic. Blitzy understands the gaps and has provided a clear plan to human engineers to work with copilots (perhaps Claude Code) to finalize the build.

We have tested this across 1,260 different tests, but have not tested for every edge case. If you find any gaps, thank you for helping improve this project. Please report your findings to [email protected], and we can show you how to refine our PR in Blitzy. Review how to use Blitzy in our dev docs.

What's Next

CCC was the most high-profile AI coding demonstration of the year. Claude Code is a revolutionary addition to the arsenal of developer tools globally, but when building a large codebase autonomously, the results revealed that under real-world scrutiny, assembling known techniques and passing test suites is not the same thing as building production software.

Blitzy's fixes and enhancements signal what is possible with autonomous software development. The platform completed 370 engineering hours of autonomous work on this project. Every bug fix, C11 feature, optimization tier, and ABI test got the same level of attention. Claude Code, designed to accelerate developer productivity, is not equipped with the level of system understanding and agentic orchestration that Blitzy possesses.

Claude C Compiler was our third open source project, demonstrating Blitzy's technical maturity and our complementary nature to Claude Code. Blitzy is the only platform that generates code that can stand up to the scrutiny of real-world edge cases in production.

To stay current on Blitzy's Open Source Enhancement Initiative, subscribe to our newsletter and follow us on LinkedIn.

Frequently asked questions

What is Blitzy?

Blitzy enables development teams to transform six-month software projects into six-day turnarounds using Blitzy OS, an agentic platform that enables thousands of AI Agents to 'think' and cooperate for hours to bulk build software with precision. The platform builds everything AI can deliver in a precise manner, around 80% of any roadmap or new product, supplemented with a human engineering guide to complete the remaining 20% needed for production. With over 27 patents and counting, Blitzy is actively hiring PhDs and senior developers in Cambridge, MA who have a passion for building AI that leverages 'System 2 Thinking' to solve problems at inference.

Who is Blitzy for?

Enterprises that aim to dramatically accelerate their software development velocity, development agencies with enterprise clients, development teams with complex existing products, and individuals looking to accelerate their own velocity on complex builds.

How does Blitzy's technology work?

Our patent-pending code ingestion framework maps a curated selection of robust, reliable, and secure open source software libraries that we track by version and update frequently. Combined with our proprietary code generation technology that specializes on enforcing enterprise-class software policies, Blitzy far exceeds the utility of typical chatbots and co-pilots in creating production-ready software at scale.

Is Blitzy a coding co-pilot?

Nope. Blitzy surpasses traditional co-pilots with its ability to autonomously generate nearly-complete code repositories, not just snippets. It features a daily-refreshed knowledge base, avoiding the pitfalls of outdated information. Blitzy's proprietary codebase representation system enables deep understanding of generated code, offering highly contextual and relevant suggestions for your entire repository.

What's my role in Blitzy's development process?

Your team is responsible for bringing the requirements, and as an approver during the technical specification stage. We ask you to edit/approve the Technical Specification. The document is editable, so you can edit and approve to get exactly what you had in mind.

How does Blitzy decide which tasks to delegate to human developers?

Blitzy's multi-agent system is meticulously and rigorously trained to know what it can accomplish, and what needs to be left for the human engineers. This ensures you only receive quality code and have a clear picture of remaining tasks.

Does Blitzy do more than just autonomous code generation?

Yes. Blitzy is a comprehensive platform that provides end-to-end development assistance. We support the entire development lifecycle by taking descriptive inputs and generating software requirements documents, technical design, code structure, and generative code within repos for your product.

Is this high quality and secure?

Quality and security matter deeply to us — and they were our biggest frustration with the copilots already on the market. That frustration is what led us to build something different: a system designed to meet enterprise standards from the start. Every piece of work passes through multiple QA agents that review each other's output before any code reaches you, so what you receive is held to a consistent quality bar rather than the variable output typical of single-pass code generation. We deliver production-grade code repositories. As with any code entering your environment — written by humans or AI — your team should still run its own QA, QC, and security testing before deployment. We build to a high standard and give your reviewers a strong starting point; final validation stays with the team that owns the production environment.

What is the typical cost of your solution?

Blitzy uses a two-phase pricing model: evaluation followed by deployment. This structure lets enterprises validate ROI at their preferred scale before committing to organization-wide implementation. The evaluation phase provides three options. Reverse Engineer ($0) offers an initial assessment with complete codebase reverse engineering and understanding up to 100K lines of code; Proof of Concept ($50K for a 2-month term), where Blitzy delivers a guided POC to demonstrate value; or Structured Pilot ($250K for a 6-month term), which fully deploys Blitzy in your environment with 5M lines onboarding and 1.25M lines generation to prove production readiness. Following successful evaluation, organizations choose between three deployment paths. Commercial ($500K typical investment per year) adopts Blitzy on one team to accelerate a defined initiative: the first 20M lines onboarded are included, with additional onboarding at $0.10 per line and generation at $0.20 per line starting at 2.5M lines, plus dedicated infrastructure and SAML-SSO. Enterprise ($5M typical investment per year) rolls Blitzy out across your engineering organization, with onboarding billed at $0.10 per line across the full codebase — a typical engagement onboards 50M lines — and generation at $0.20 per line as needed, adding a Dedicated AI Solutions Consultant, 2 Forward Deployed Engineers, org-wide onboarding and certification, and priority support. Transformation ($50M typical investment per year) supports your largest codebases, with a typical engagement onboarding 500M lines at the same per-line rates, custom deployment, and embedded teams including a Field CTO, a Dedicated AI Solutions Consultant, 6 Forward Deployed Engineers, and 2 Forward Deployed Designers for complete digital transformation. All tiers maintain SOC 2 Type II compliance, ISO 27001 certification, and guarantee no training on your code. Pricing follows a transparent two-rate model: $0.10 per line onboarded for reverse engineering and $0.20 per line generated for forward engineering. Because reverse engineering also produces complete technical documentation of your codebase, onboarding-only engagements are fully supported, and in every tier costs align directly with the value delivered.

After submitting my prompt, Blitzy added functionality in my tech spec that I did not expect. What do I do?

The system defaults to taking advantage of all technology upgrades when modernizing or upgrading to the latest technology stack. For example, if you specify an upgrade to Java 21, the system will by default implement virtual threads, as it's generally seen as a superior technical approach. If you do not want this, you must simply tell the system to 'make as few changes as possible to achieve the desired request'. Being as specific as possible about what functionality is (and is not) desired helps yield results that will align with expectations.

What do Blitzy agents rely on as a source of truth to represent my existing codebase?

Blitzy agents rely on the actual source code of your existing codebase—not the Tech Spec documentation—when performing refactors or extending functionality. However, an accurate Tech Spec significantly aids the system's efficiency in querying the underlying representation of the code. Therefore, investing time to ensure the Tech Spec reflects the core features of the application will yield expectation-aligned results and will save time with last-mile development.

Can Blitzy work with existing products and code bases?

Yes! Blitzy excels at working with existing codebases, using them as a foundation to ensure consistent, high-quality development. The platform enables you to add new features to existing products, generate comprehensive documentation, and tackle technical debt by upgrading legacy systems to state-of-the-art technologies or refactoring complex codebases. Our platform deploys dedicated AI agents that map and understand your codebase before generation, ensuring intelligent, contextualized development that aligns with your existing patterns and standards.

What programming languages does Blitzy support?

Blitzy's AI platform works with all programming languages.

How should I structure my prompts for Blitzy?

Structure and organization are crucial when prompting Blitzy. The most effective prompts follow our prompting template with clear sections for WHY (vision & purpose), WHAT (core requirements), and HOW (technical details, user experience & implementation priorities). Each section should be detailed but concise, focusing on essential information while providing relevant context. Including structured frameworks and concrete examples - like data models, user stories, or feature templates - helps Blitzy deliver more precise and purposeful solutions.

What information does Blitzy need to compile and run my code?

During code generation, Blitzy compiles your codebase and performs runtime validation to ensure the generated code works correctly. To enable this, we require: (1) Internal dependencies - any private packages, libraries, or binaries not publicly available that your code needs to build and run, (2) Environment variables and secrets - API keys, credentials, and configuration values required for compilation and runtime (shared securely through our encrypted UI, never exposed to AI agents), and (3) Build instructions - the specific steps or scripts needed to compile your code, typically found in your README or setup documentation. This information allows Blitzy to replicate your development environment and verify that all generated code functions properly before delivery.

How can I exclude certain files or folders from Blitzy's code generation?

Create a .blitzyignore file in your repository's root directory to specify which files or paths Blitzy should exclude during tech-spec generation and code generation. This works similarly to .gitignore - simply list the file patterns, directories, or specific files you want Blitzy to skip, using standard gitignore syntax like *.log, /build/, or config/secrets.json. To ensure Blitzy respects these exclusions, mention in both your codebase context prompt and target state prompt that Blitzy should reference the .blitzyignore file and exclude those paths from processing.

Can I cancel my project/job (code gen) once in progress?

At this time, jobs are not cancelable. Once you submit, it consumes the assigned quota.

Build enterprise software in days, not months.

Start building Talk to an expert

Blitzy Fixed and Enhanced Claude's C Compiler

Claude's Very Public Compiler Release

Deep Codebase Understanding: Tech Spec

What Blitzy Built

Performance Improvements

Remaining Work

What's Next

More from the blog

Blitzy's Blitz: Adventures in Chess

Dynamic Discourse: Security, AI & Open Source

Frequently asked questions

Build enterprise software in days, not months.

Product

Company

Support

Resources

Social

Legal