Skip to content

Beyond story points: how to measure developer velocity the right way

Traditional velocity metrics measure activity, not outcomes. Learn why developer experience is the real driver of productivity and how the DX Core 4 framework provides a better path forward.

Taylor Bruneaux

Analyst

As AI adoption accelerates, software is expected to ship faster, while engineering work becomes increasingly complex. In response, leaders are feeling increasing pressure to demonstrate returns on their investments in developer productivity. Most reach for velocity metrics, assuming they can measure team performance and optimize delivery speed.

But here’s the uncomfortable truth: traditional developer velocity measurement often makes teams slower, not faster. The problem isn’t velocity itself—it’s how most organizations measure it. Developer experience has emerged as a key lever for sustainable productivity improvement, yet traditional velocity metrics often overlook this critical dimension.

Counting story points, lines of code, or other activity-based outputs can create a false sense of progress while masking the fundamental drivers of engineering productivity. At DX, we take a different approach. In this article, we’ll explore a more nuanced approach to understanding development velocity and demonstrate its connection to broader developer productivity metrics.

How organizations typically measure developer velocity

Most organizations measure velocity through activity metrics. Practically, that usually means tracking things like story points completed, commits merged, or pull requests closed. Here’s what that means:

Story points completed

Teams assign point values during sprint planning and then total the completed points at the end of the sprint. Leadership compares numbers across sprints or teams, treating it as “throughput.” Rising points are interpreted as an indication of improved productivity.

Commits merged

Engineering managers track the number of commits each developer or team merges into the main branch per week. Dashboards visualize this as trend lines, assuming higher commit numbers indicate that more work is being done.

Pull requests closed

Organizations track the number of PRs reviewed and merged within given periods. Some set targets, expecting teams to close a minimum PRs each sprint, reasoning that more closed PRs equals faster progress.

Deployment frequency

Leaders track how often teams release changes into production. A higher deployment frequency becomes a proxy for higher velocity, with executives comparing numbers against industry benchmarks to gauge their competitiveness.

These measures are attractive because they’re easy to capture with existing systems (issue trackers, version control, CI/CD pipelines). They provide concrete numbers for slide decks, allowing executives to track progress over time. The simplicity gives an impression of objectivity and progress, even when the underlying reality is more complex.

The limits of developer velocity metrics

Here’s the core problem: velocity metrics reflect activity, not outcomes.

They measure how much developers are doing, not whether the work is valuable or sustainable. This is why organizations relying on velocity metrics fall into predictable dysfunctions:

Friction that compounds daily

Developers lose an average of 2.5 hours each day to poor tooling, context switching, and workflow friction. For a 50-person team, that’s more than a million dollars in lost capacity each year—waste that velocity metrics never reveal.

Quality eroded under pressure

When story points become the goal, teams cut corners. They rush reviews, skip tests, and take on technical debt. The short-term speed quickly disappears as rework and quality issues pile up.

Work detached from outcomes

Research indicates that up to 40% of engineering time is spent on tasks that do not align with business goals. Teams can hit velocity targets while shipping features no one uses or solving problems that do not matter. This misalignment between engineering metrics and real value creation is a core flaw in traditional measurement.

Burnout and disengagement

When developers are measured in terms of units of output, they disengage. Frustration grows as metrics are manipulated, fueling the retention crisis in engineering. The human cost is just as real. Developers choose this profession to solve complex problems and build products that matter. Reducing their work to points or lines of code undermines the motivation that drives their best contributions.

Why traditional approaches fail

Organizations often rely on standard metrics, such as story points, lines of code, or fragmented tool data, to measure developer velocity. However, these approaches mislead more than they help, creating unintended incentives while overlooking key work dimensions and disconnecting measurement from actual outcomes.

  • Story points per sprint push teams to game the system by inflating estimates or avoiding complex but valuable work. They also overlook quality. Teams can hit point targets while accumulating technical debt or degrading reliability. Most importantly, story point velocity has no connection to business outcomes. Teams shipping unused features can still appear productive on paper.
  • Activity metrics, such as lines of code or commits per day, seem objective, but they reward the wrong behaviors. They prioritize volume over value, encouraging verbose code over elegant and maintainable solutions. They also have a narrow focus on individual output, which undermines collaboration and the long-term health of the system. The result is activity that creates motion without meaningful progress.
  • Fragmented tool-based metrics combine data from deployment trackers, quality scanners, and occasional surveys. Without integration, these signals are incomplete and misleading. Speed gains may come at the expense of quality, or quality may slow down delivery, and no one clearly sees the trade-offs. Optimizing isolated metrics often results in degrading the overall developer experience.

Traditional velocity approaches fail because they measure activities instead of outcomes, symptoms instead of causes, and isolated signals instead of the complete experience. A modern alternative requires frameworks that account for multiple productivity dimensions in a systematic way, such as the DX Core 4, which balances delivery, quality, efficiency, and satisfaction.

How to increase developer velocity by improving developer productivity

The solution isn’t abandoning measurement. Instead, it’s measuring comprehensively. The most effective organizations use research-backed frameworks that capture the complete picture of engineering effectiveness.

Developer experience as the foundation

Modern productivity measurement starts with understanding developer experience as the foundational layer. Our research identifies three core dimensions that determine whether developers can do their best work:

  • Feedback loops: How quickly can developers get information about their code’s correctness, performance, and business impact? Shorter, clearer feedback loops accelerate learning and reduce wasted effort.
  • Cognitive load: How much mental overhead do tools, processes, and organizational dysfunction add to core software development work? A lower cognitive load means more mental energy is available for problem-solving.
  • Flow state: How often can developers achieve deep, uninterrupted focus on complex problems? Protecting flow state is crucial for tackling the most valuable and challenging work.

These experience factors predict delivery outcomes better than traditional activity metrics. Happy, unblocked developers naturally move faster and produce higher-quality work.

Balanced measurement through the DX Core 4 framework

The Core 4 operationalizes our developer productivity research into a practical measurement system that addresses every limitation of traditional velocity measurement.

The Core 4 tracks four counterbalanced dimensions:

  • Speed: How quickly teams deliver working software to customers (deployment frequency, lead time, cycle time)
  • Effectiveness: Developer experience and productivity drivers (Developer Experience Index, workflow efficiency, time to value)
  • Quality: Reliability and maintainability of deliverables (change failure rate, recovery time, technical debt)
  • Business Impact: Value creation and strategic alignment (time on new capabilities, initiative ROI, customer outcomes)

This framework prevents single-metric optimization, which can destroy traditional velocity measurement. Teams can’t game speed at the expense of quality or boost activity metrics while ignoring the business impact. This comprehensive approach is why organizations implementing the DX Core 4 see a 3%-12% overall increase in engineering efficiency, 14% increase in R&D time spent on feature development, and 15% improvement in employee engagement scores.

Outcome-based measurement

Modern agile software development velocity requires moving beyond story point tracking toward outcome-based measurement. The Core 4 framework enables agile teams to:

  • Track actual delivery speed (speed dimension) rather than estimated story points
  • Measure developer satisfaction and workflow efficiency (effectiveness dimension)
  • Ensure sustainable pace through quality metrics (quality dimension)
  • Connect sprint work to business value (business impact dimension)

This approach eliminates the gaming and dysfunction plaguing story point measurement while enabling accurate capacity planning and continuous improvement.

Automation velocity as a force multiplier

Automation velocity refers to the acceleration achieved by systematically eliminating manual work from development workflows. The DX Core 4 framework helps organizations optimize automation velocity by measuring its impact across all four dimensions:

  • Speed improvements from CI/CD automation and deployment streamlining
  • Effectiveness gains through reduced cognitive load and context switching
  • Quality enhancements via automated testing and deployment gates
  • Business impact from faster feature delivery and reduced operational overhead

Organizations that track automation velocity through the DX Core 4 framework typically see productivity improvements as manual processes are systematically eliminated.

Modern measurements that capture complexity

Measuring velocity with activity metrics leaves leaders blind to the forces that really shape developer effectiveness. To see the whole picture, you need to combine different perspectives on the work.

Experience sampling captures what developers are feeling in the moment, surfacing friction and tool issues that data alone cannot show. AI recommendations connect those signals to concrete opportunities for improvement across multiple dimensions of productivity.

Workflow analysis reveals where work actually slows down, the types of bottlenecks that story points often fail to expose. And TrueThroughput™ shifts the focus from activity completed to value delivered.

When these perspectives are combined, leaders move beyond isolated metrics. They can see how speed, quality, and effectiveness align, and make decisions that improve both developer experience and business outcomes simultaneously.

How comprehensive measurement changes the outcomes

The theoretical benefits of better measurement translate into concrete business outcomes. Organizations that implement comprehensive frameworks experience dramatic improvements across multiple dimensions.

Pfizer: $5M+ annual savings through systematic measurement

Pfizer transformed their software delivery during the pandemic, scaling from 100 to over 1,000 developers while improving productivity. Instead of relying on story point velocity, they implemented the DX Core 4 framework for comprehensive measurement and tracking.

Their systematic approach revealed bottlenecks invisible to traditional velocity tracking: incident resolution improved by 33%, deployment frequency increased 22%, and developer effectiveness freed up significant engineering capacity. The result: over $5 million annually in productivity gains with clear ROI attribution across all four DX Core 4 dimensions.

John Lewis: 40% more coding time through experience optimization

John Lewis moved beyond story point velocity to measure actual developer effectiveness. Their comprehensive approach focused on reducing friction through workflow automation, process standardization, tool optimization, and quality integration. Business impact: 40% increase in time spent coding and 10x+ ROI on developer experience investments.

Vercel: 43% cycle time reduction through comprehensive insights

Vercel used systematic measurement to identify and eliminate delivery bottlenecks. Rather than optimizing story point completion, they focused on actual customer value delivery speed. Their data-driven approach revealed process bottlenecks, tool inefficiencies, and quality gates that balanced speed improvements with reliability requirements. Business impact: 43% reduction in pull request cycle times (from 4.6 to 2.7 hours).

These organizations succeeded by measuring what actually drives results rather than optimizing metrics that can be easily gamed or manipulated.

Getting started with effective measurement

Moving beyond traditional velocity metrics requires a systematic approach. Engineering leaders can begin with focused measurement initiatives that provide immediate insights while building toward comprehensive frameworks.

Start with a developer experience baseline

The first step in any measurement transformation is establishing a baseline. The Developer Experience Index (DXI) provides that foundation. DXI is a proprietary, research-backed framework that quantifies engineering efficiency through a composite score of 14 key drivers. It is predictive, benchmarked, and tied directly to business outcomes.

Organizations can begin by surveying developers to understand satisfaction and pain points across the software development lifecycle. DXI takes this further by standardizing those signals, benchmarking them against industry peers, and translating friction into measurable dollar impact.

With DXI, leaders see what traditional velocity metrics miss: workflow efficiency, tool effectiveness, code review and deployment friction, retention risk factors, cognitive load, and context switching overhead. Instead of just tracking activity, DXI shows where productivity is being lost and which improvements will deliver the most significant ROI.

Implement targeted improvement initiatives

Use baseline insights to prioritize high-impact improvements. Our playbooks provide proven strategies for addressing common productivity bottlenecks. Workflow optimization playbooks help eliminate manual processes and reduce context switching, while tool consolidation playbooks guide platform engineering investments that reduce cognitive overhead. Quality integration playbooks ensure that speed improvements don’t compromise system reliability.

Deploy comprehensive measurement infrastructure

Expand from baseline assessment to continuous measurement using integrated tools. Self-service dashboards provide teams with real-time insights into their own productivity patterns. Scorecards track progress across speed, effectiveness, quality, and business impact dimensions, preventing single-metric optimization while enabling data-driven improvement decisions.

Scale successful practices across teams

As measurement maturity grows, successful practices can be scaled across the engineering organization. The Core 4 framework provides structure for organization-wide measurement standardization while allowing team-level customization based on specific contexts and challenges.

Engineering leaders who start with developer experience measurement and expand to comprehensive frameworks typically see measurable improvements within weeks and substantial ROI within months.

The future of engineering measurement

Traditional velocity metrics are not just limited; they are becoming harmful in today’s environment. As software becomes increasingly complex and AI reshapes engineering workflows, activity counts such as story points or commits reveal less and less about actual effectiveness.

How AI changes measurement

AI coding tools hold real promise for accelerating productivity, but simple activity metrics cannot measure their impact. For example, Workhuman improved the ROI of GitHub Copilot by 21 percent when measuring how AI affected overall developer effectiveness, rather than just tracking code completion.

To accurately measure AI’s real impact, organizations need frameworks that account for multiple dimensions simultaneously. This includes workflow efficiency, code quality, learning velocity, and developer satisfaction. Traditional velocity metrics overlook all of these factors, leaving leaders blind to where AI is helping and where it may be creating new friction.

Frequently asked questions about developer velocity measurement

What is developer velocity?

Developer velocity is the speed and efficiency with which engineering teams deliver valuable software to customers. Unlike activity metrics such as story points or commits, true velocity is about outcomes: how quickly teams ship features that solve real problems and create business impact. The DX Core 4 framework captures this through balanced measurement across delivery speed, quality, efficiency, and business value.

How should developer velocity be measured?

Measuring velocity effectively means moving beyond story points and other narrow indicators. The DX Core 4 framework considers multiple dimensions simultaneously: delivery speed, developer experience, code quality, and business outcomes. This approach reduces gaming, provides a more accurate picture of effectiveness, and gives leaders actionable insights for improvement.

Key takeaways for moving beyond traditional velocity measurement

Traditional approaches to measuring developer velocity create more problems than they solve. Story points, activity counts, and fragmented tools often optimize for the wrong behaviors while ignoring the fundamental drivers of engineering effectiveness.

The real question is not whether velocity metrics are imperfect. The question is whether leaders can afford to rely on measures that waste millions in capacity and frustrate their most valuable technical talent. Measuring what truly drives effectiveness is the first step to unlocking sustainable productivity.

Published
September 23, 2025