Skip to main content

Story Points

By Janna Bastow

Updated: March 24th, 2026

Reviewed by: Simon Cast

Fact checked by: Julie Hammers

Story points are one of the most widely adopted estimation practices in software development, and one of the most widely misunderstood. Teams use them daily without knowing where they came from or why they exist. Managers build dashboards around them without understanding what the numbers actually represent. And entire organizations have turned a deliberately abstract concept into a rigid unit of measurement, complete with conversion tables that translate points into hours. That defeats the original purpose entirely. The debate around story points is genuinely polarizing. Their inventor has publicly expressed regret about creating them. The #NoEstimates movement has called for abandoning them altogether. And yet, most agile teams still use them in some form because, when applied correctly, they solve a real and specific problem: how do you get a team with different skill levels and speeds to agree on the relative size of work?

What Are Story Points?

Agile teams use story points as an abstract unit of measure to express the relative effort required to complete a user story or product backlog item. A story point estimate accounts for the combined weight of effort, complexity, risk, and uncertainty involved in delivering a piece of work. Story points are deliberately divorced from clock time. A 5-point story is not “5 hours” or “5 days.” It means a team considers that item roughly five times the effort of their smallest reference story.

The reason story points use relative sizing rather than absolute time estimates is rooted in a well-documented human cognitive pattern: people are significantly better at comparing things to each other than they are at guessing how long something will take in hours. You can look at two backlog items and confidently say one is about twice as big as the other. You cannot look at either one and confidently predict it will take exactly 14 hours and 20 minutes. Relative estimation leverages a comparative strength and avoids a consistent weakness.

The numbers used in story point estimation typically follow a modified Fibonacci sequence: 1, 2, 3, 5, 8, 13, 20, 40, 100. The increasing gaps between values reflect a principle from psychophysics known as Weber’s Law, which describes how humans perceive differences. Small tasks are distinguishable at fine grain (the difference between a 1 and a 2 is noticeable). Large tasks carry so much uncertainty that trying to distinguish a 14 from a 15 is meaningless. The Fibonacci-like spacing forces estimators to make meaningful distinctions and avoids false precision on large items.

tory points Fibonacci sequence showing relative estimation scale used in ProdPad Product Management software

Where Did Story Points Come From?

Story points originated in the late 1990s as part of Extreme Programming (XP), one of the original agile methodologies. Ron Jeffries, one of the three founders of XP alongside Kent Beck and Ward Cunningham, invented the concept.

The original XP teams at the Chrysler Comprehensive Compensation System project estimated work in “ideal days,” a theoretical unit describing how long a task would take if the developer could work without any interruption. In practice, as Jeffries has recounted, it took roughly three real days to accomplish one ideal day’s worth of work. The problem was that managers and stakeholders heard “3 days” and interpreted it as a commitment, regardless of the “ideal” qualifier. The team started stripping away the time label and simply using the raw number. A 3-ideal-day task became a “3.” When people asked “3 what?”, the answer eventually became “3 points.”

The abstraction was intentional. By removing the word “days,” the team hoped to prevent stakeholders from converting estimates into calendar commitments. Mike Cohn, co-founder of the Scrum Alliance and author of Agile Estimating and Planning, later popularized story points widely through his writing and training. Cohn framed story points as measures of effort that incorporate complexity, risk, and uncertainty, and paired them with Planning Poker as the primary estimation technique.

Jeffries has since publicly stated that he regrets inventing story points, citing the widespread confusion and misuse they have caused. The tool Jeffries designed to protect teams from unrealistic commitments has, in many organizations, become the very mechanism that enforces those commitments.

How Do Story Points Work in Practice?

Story point estimation is a team activity. The whole development team participates, because the goal is to arrive at a shared understanding of the work involved, not to extract a number from the most senior engineer.

Planning Poker

The most common method for assigning story points is Planning Poker, a consensus-based estimation technique first described by James Grenning in 2002 and later popularized by Cohn. Each team member holds a set of cards showing values from the modified Fibonacci sequence. The Product Manager or Product Owner describes a backlog item. The team discusses it, asks clarifying questions, and then everyone simultaneously reveals a card representing their estimate.

If all cards match, that value becomes the estimate. If there is disagreement, the team discusses the reasoning behind the highest and lowest estimates.

This conversation is where most of the real value of story points lives. Disagreements surface hidden assumptions, unrecognized dependencies, and gaps in the team’s understanding of what “done” actually means for that item. After discussion, the team re-estimates until they reach consensus.

The simultaneous reveal is a deliberate design choice. It prevents anchoring bias, a cognitive pattern where the first number spoken aloud disproportionately influences everyone else’s thinking. By forcing independent estimates before any discussion begins, Planning Poker produces more honest initial data points to work from.

Reference Stories

Effective story point estimation requires at least one calibration point. Teams typically designate a “reference story,” a previously completed piece of work that everyone agrees represents a specific point value (often a 3 or a 5). he team makes every subsequent estimate relative to that anchor. Nobody estimates a new backlog item in isolation. The question is always: compared to our reference story, is this bigger, smaller, or about the same?

This is what makes story points a relative measure. A senior developer might complete a 5-point story in four hours. A more junior developer might need ten hours for the same story. Both can agree it is a 5, because the number does not represent individual speed. It represents the team’s shared judgment about the size of the work.

Velocity

Once a team has been estimating with story points across several sprints, a useful metric emerges: velocity. Velocity is the average number of story points a team completes per sprint. If a team consistently delivers around 30 points per two-week sprint, that number becomes a planning input. The team (and Product Manager) can use velocity to forecast roughly how many sprints the remaining backlog will take, or how much of the backlog is realistic to expect in a given timeframe.

Velocity is a lagging indicator specific to one team. Comparing velocity across teams is meaningless, because a “5” on one team does not represent the same amount of work as a “5” on another.

Estimates only matter if they connect to a strategy. ProdPad’s guide to lean roadmapping covers how to plan around outcomes instead of delivery dates, and why the Now-Next-Later format replaces the false certainty of timeline roadmaps

What Do Story Points Actually Measure?

This is the question that causes the most confusion. Mike Cohn has consistently defined story points as a measure of effort, where effort incorporates three factors:

Complexity

Work that requires significant cognitive load, unfamiliar technology, or intricate logic is more complex. A straightforward CRUD form is less complex than a multi-step workflow involving third-party API integrations, conditional business rules, and edge case handling.

Risk and Uncertainty

If the team has never worked with a particular technology, or the requirements are ambiguous, or there are external dependencies that could cause delays, those unknowns add to the estimate. More unknowns mean a higher point value, because the team is factoring in the probability that things will not go smoothly.

Volume of Work

Even straightforward, low-risk work can be large. Migrating 200 database fields is not complex or risky, but it is a lot of work. Volume matters in story point estimation alongside complexity and uncertainty.

The critical point is that all three factors collapse into a single number. Story points are not a measure of complexity alone, nor a measure of time, nor a measure of lines of code. They are a holistic judgment about how much effort the team expects to expend getting this item to done, relative to other items they have estimated.

Story points estimation factors showing complexity risk and effort in ProdPad Product Management software

What Scales Do Teams Use for Story Points?

Teams use several scales in practice. The choice of scale matters less than consistent application within a team.

Modified Fibonacci Sequence

The most widely used scale: 1, 2, 3, 5, 8, 13, 20, 40, 100. The increasing gaps between values force teams to make meaningful sizing decisions. An item that falls between a 5 and an 8 forces the team to round in one direction, which triggers a useful conversation about what is driving the size up or down.

Powers of Two

Some teams use 1, 2, 4, 8, 16, 32. This scale doubles at each step, which simplifies the mental model (“is this twice as big?”) but can encourage a narrower type of comparison conversation.

T-Shirt Sizing

XS, S, M, L, XL. This works well for high-level backlog grooming or roadmap-level estimation where precision is unnecessary. Many teams use T-shirt sizing for initial triage and convert to Fibonacci for sprint-level planning.

Story Counting (No Points)

Some teams skip points entirely and simply count stories, relying on the discipline of keeping stories small and roughly uniform in size. This approach is central to the #NoEstimates movement (more on that below).

The scale a team chooses is far less important than the conversation the estimation process generates. Any scale that surfaces assumptions, clarifies scope, and builds shared understanding is doing its job.

ProdPad helps product teams prioritize using Impact vs Effort scoring , so you can weigh what matters at the strategic level before stories ever get pointed at the sprint level

How Should Story Points Be Used?

Story points serve two primary purposes when used well.

Sprint Planning

The team’s historical velocity tells them roughly how many points of work they can reliably take on in a sprint. During sprint planning, they pull items from the backlog until they approach that capacity. This is capacity-based planning, not commitment-based planning. The velocity average is a guide, not a guarantee.

Long-Range Forecasting

If the remaining backlog totals 200 points and the team’s velocity averages 25 points per sprint, a rough forecast of 8 sprints (16 weeks for two-week sprints) is reasonable. This forecast carries significant uncertainty, especially for items deep in the backlog that have not been refined. But it gives product leaders and stakeholders a directional answer to the “when?” question that is grounded in empirical data rather than wishful thinking.

Product leaders who use outcome-based roadmapping can use velocity data to inform the “Later” horizon of a Now-Next-Later roadmap without making premature commitments. The forecast answers “roughly how far out is this work likely to land?” without pretending to know the exact sprint.

Story points used in Now-Next-Later roadmap planning in ProdPad Product Management software

How Are Story Points Commonly Misused?

The list of anti-patterns around story points is long, and most of them stem from the same root cause: treating an abstract, team-internal planning tool as a management reporting metric.

Equating Story Points to Hours

This is the most common mistake and the one that does the most damage. Once an organization creates a conversion table (“1 point = 4 hours”), story points lose every advantage they were designed to provide. The abstraction collapses. Different-speed developers can no longer agree on estimates. And the numbers become pseudo-commitments dressed up in unfamiliar units. As Cohn has argued extensively [link: https://www.mountaingoatsoftware.com/blog/dont-equate-story-points-to-hours], if you are going to convert points to hours, you should stop calling them points and just estimate in hours. At least that is honest.

Comparing Velocity Across Teams

A team that averages 40 points per sprint is not “more productive” than a team that averages 25. Their scales are different. Their reference stories are different. Their definitions of done are different. Comparing velocity across teams is like comparing temperatures measured in Celsius and Fahrenheit without conversion. The numbers are not interchangeable.

Using Story Points as a Performance Metric

When individual developers are evaluated on how many story points they complete, the incentive structure distorts immediately. Developers will inflate estimates. They will avoid pairing on hard problems (because it “dilutes” their personal point count). They will resist helping teammates. Story points were designed to be a collective team planning tool. Turning them into an individual performance metric destroys collaboration and produces inflated, meaningless data.

Spending Too Much Time Estimating

If a team regularly spends 90 minutes in a Planning Poker session arguing about whether something is a 5 or an 8, the estimation process has become a cost center. The value of story points comes from the conversation they spark, not from the precision of the final number. A 5 and an 8 are close enough that the difference will wash out across a sprint’s worth of work.

Estimating Items That Are Too Large

Any backlog item estimated at 13 points or higher likely needs to be broken down into smaller pieces. Large estimates carry enormous uncertainty. A 20-point story is not a useful planning input. It is a signal that the team does not yet understand the work well enough to plan around it.

Story points measure effort. Prioritization should measure value. ProdPad’s guide to 17 prioritization frameworks covers RICE, MoSCoW, Cost of Delay, and more, so your team can decide what to build based on outcomes, not just how big something is.

Product Manager's Guide to Prioritization Models

Should You Use Story Points? The #NoEstimates Debate

The #NoEstimates movement, which gained traction through the work of Woody Zuill, Vasco Duarte, and Allen Holub, argues that estimation in general (and story points in particular) is waste. The core argument is straightforward: if teams keep their stories small and roughly uniform in size, you can forecast delivery just as accurately by counting stories as you can by summing story points. Duarte’s empirical analysis found that story-count-based forecasts matched the accuracy of point-based forecasts when teams kept their stories small.

Holub goes further, arguing that estimates create dysfunction because they are inevitably treated as commitments, which pressures teams into cutting corners or gaming numbers. The alternative proposed by #NoEstimates advocates is measurement-based projection: track how many stories the team completes per week, use that throughput rate to project where the backlog will stand at any future date, and adjust scope accordingly.

The #NoEstimates position has real merit, particularly for mature teams with strong story-slicing discipline. If every story is genuinely 1 to 3 days of effort, counting them produces useful forecasts without the overhead of estimation sessions. The argument weakens for teams with highly variable story sizes, significant technical uncertainty, or organizational contexts that require some form of sizing for budgeting or portfolio prioritization.

Many teams land on a practical middle ground: use story points during team formation and early sprints to build shared understanding and calibrate expectations. Once the team matures and develops the discipline to keep stories small and well-defined, consider whether the estimation ceremony is still adding value or just consuming time.

How Story Points Connect to Strategic Product Work

Story points live in the delivery layer of Product Management. They help Engineering teams plan sprints and forecast timelines. They do not (and should not) drive strategic product decisions. The decision about what to build and why belongs upstream, in the domain of product strategy, OKRs, customer feedback analysis, and prioritization frameworks like RICE scoring or Cost of Delay.

The common anti-pattern is treating story points as a prioritization input at the strategic level. “This feature is only 8 points, so let’s do it first” is effort-driven thinking, not outcome-driven thinking. A 3-point story that validates a core product hypothesis is worth more than a 40-point feature that nobody asked for.

Product leaders who separate discovery from delivery can use story points where they add value (sprint planning and short-term forecasting) without letting them contaminate strategic prioritization. Tools shape behavior. When the only lens you have is effort, everything looks like a sizing exercise. When you anchor your roadmap to outcomes and connect your backlog to customer feedback, story points become one input among many, not the primary driver.

ProdPad’s Ultimate Guide to Product Roadmaps covers how to structure roadmaps around outcomes, connect initiatives to OKRs, and keep estimation in its proper place.

Why the Story Points Debate Reveals Something Deeper

The intensity of the story points debate, points to a larger tension in how organizations manage product development. Ron Jeffries invented story points to protect development teams from unrealistic commitments. They were deliberately abstract, deliberately team-owned, and deliberately disconnected from calendar time. The fact that most organizations have stripped away every one of those protections, converting points to hours, comparing across teams, and using them as performance metrics, says more about organizational incentives than about the estimation technique itself.

Teams that use story points well share common traits: they keep the estimates team-internal, they use the estimation conversation to build shared understanding, they track velocity for planning (not evaluation), and they know that estimates are probabilistic forecasts that will sometimes be wrong.

Teams that struggle with story points are usually struggling with something else entirely: unclear requirements, unstable team composition, unrealistic stakeholder expectations, or a culture that treats forecasts as promises.

The tool is not the problem. The system the tool operates within determines whether it helps or harms.

Enjoy a single source of truth for every product idea

Start a free trial and see how easy your Product Management life could be with ProdPad