Co-Founder of Momentum. Formerly Product @ Klaviyo, Zaius (acquired by Optimizely), and Upscribe.
Table of Contents
- Your Team Is Arguing About Estimates. Again.
- Moving Beyond Hours
- The Real Purpose
- Moving Beyond Hours with Relative Sizing
- Finding Your Baseline
- Why the Weird Numbers?
- Common Story Point Scales Compared
- Getting Your Team to Agree (Without a Fistfight)
- Planning Poker: The Great Equalizer
- This Is Where the Magic Happens
- Converging on an Estimate
- Velocity: The Predictability Metric You’re Almost Certainly Misusing
- Where It All Goes Wrong
- Using Velocity as a Diagnostic Tool
- The Anti-Patterns That Will Derail Your Team
- Translating Points to Hours
- Using Points for Individual Performance
- Weaponizing Velocity
- Estimating Bugs and Chores in Points
- Never Recalibrating Your Baseline
- Your Action Plan for the Next Sprint
- Establish Your Baseline
- Run a Mini Estimation Session
- Commit to the Sprint
- Track and Learn
- Common Questions About Story Points
- Should We Re-Estimate Rollover Work?
- What About Stories That Are Way Too Big?
- Do Story Points Only Work in Scrum?

Do not index
Do not index
Story points. It's a simple concept that somehow turns every sprint planning meeting into a cage match. Think of them less as a number and more as a gut check—a way for your team to agree on how "big" a task feels compared to everything else. It’s a relative measure of effort, complexity, and uncertainty.
Your Team Is Arguing About Estimates. Again.
Let's set the scene. You’re in a sprint planning meeting, and the debate over a new feature is getting heated. The senior engineer casually throws out a '3'. The junior, eyes wide, insists it's an '8'. Meanwhile, you're just trying to figure out if you can promise leadership a delivery date without lying.
Then someone says the forbidden word: "hours." The whole room groans.

This is precisely where most teams drive off a cliff. The problem isn't the points themselves; it's that nobody ever bothered to explain why they exist. They were invented specifically to stop us from talking about the clock.
Moving Beyond Hours
The whole point is to shift the conversation from, "How many hours will this take?" to a much simpler question: "Is this bigger or smaller than that other thing we did?" It's a subtle change, but it's a powerful one. It decouples the work from all the baggage that comes with deadlines and time tracking.
When your team is stuck arguing, it’s usually a sign of a deeper misalignment. It's not just about the number; it's about different assumptions and hidden fears. This is a perfect time to break out some essential conflict resolution techniques and turn that friction into a productive conversation.
Story points first showed up in the early 2000s, born from a need for psychological safety. The idea was to give teams a way to talk about effort without accidentally committing to a hard deadline that a manager might weaponize later.
Story points are about creating a shared language for effort. They help a team align on complexity and risk without the false precision of hourly estimates, which vary wildly from person to person.
The Real Purpose
The goal isn't to land on a perfect number. It's to build a shared understanding.
When you're debating whether something is a 5 or an 8, you're not just arguing numbers. You're surfacing risks. The senior dev thinks it's a 5 because they've done it before. The junior sees it as an 8 because they're factoring in unfamiliar territory. That conversation is gold.
By the end of this article, you'll see how to use story points to make your team more predictable, less stressed, and finally, on the same page.
Moving Beyond Hours with Relative Sizing
Let’s be honest, the biggest hurdle with story points is letting go of time. Your entire career, you’ve been trained to think in hours and days. Switching gears feels unnatural, maybe even a little irresponsible.
But here's the secret every seasoned team eventually learns: humans are terrible at estimating absolute time, but we're fantastic at comparing things.
Think about it. Can you guess the exact weight of a watermelon just by looking at it? Probably not. But could you instantly tell me it’s heavier than an apple? Of course. That’s the magic of relative estimation.

This simple concept is why story points actually work. Instead of getting into a pointless debate over whether a task will take 20 or 30 hours, the team just has to agree on how it stacks up against other work they’ve already done.
Finding Your Baseline
To make this system click, you need an anchor—a shared point of reference everyone understands.
Find a task that is mind-numbingly simple and well-understood by the whole team. Something like, "update the copyright year in the footer." Everyone knows what that involves. No debate.
That task is now your baseline. Let’s call it a ‘2’. From this point forward, every other task is sized in relation to that one.
A baseline story acts as your team’s “yardstick.” It’s not about how many hours it took; it’s about the collective memory of the effort involved. Every future estimate becomes a simple comparison: “Is this new task bigger or smaller than our yardstick, and by how much?”
Suddenly, the conversation changes. Is this new feature request twice as complex as the baseline? It’s probably a ‘3’ or a ‘5’. Is it a monster that feels five times bigger? Maybe that's an ‘8’. This approach sidesteps all the anxiety and guesswork that comes with estimating in hours.
If you really want to go deep on this, our complete guide on task estimation covers even more strategies.
Why the Weird Numbers?
This is where scales like the Fibonacci sequence (1, 2, 3, 5, 8, 13…) come into play. Those numbers aren't random; they’re a clever trick to reflect growing uncertainty. The gap between a 2 and a 3 is small, but the jump from an 8 to a 13 is a chasm.
This forces your team to make a real choice. Is this task really closer to an 8, or is it a 13? This intentional lack of precision stops teams from getting bogged down in meaningless debates over whether something is a 6 or a 7.
Of course, Fibonacci isn't the only game in town. Some teams prefer to abstract the idea even further.
Common Story Point Scales Compared
To find the right fit, it helps to see the most popular scales side-by-side. Each has its own flavor and is suited for different team dynamics.
Scale | Values | Core Principle | Best For |
Fibonacci | 1, 2, 3, 5, 8, 13, 21... | Numbers get further apart to reflect increasing uncertainty. It forces a clear choice for larger tasks. | Established Agile teams who are comfortable with the concept of story points. |
T-Shirt Sizing | XS, S, M, L, XL | Removes numbers entirely to focus the conversation purely on relative size. It feels more conversational and less intimidating. | New teams, non-technical stakeholders, or for high-level roadmap planning. |
Choosing a scale isn't a life-or-death decision. The specific system you use matters far less than your team's commitment to using it consistently.
The real goal is to create a shared language for effort. It allows a senior and a junior engineer to agree a task is a "5" even if their actual time to completion would be wildly different. They aren't estimating hours; they're agreeing on size, complexity, and risk. And that changes everything.
Getting Your Team to Agree (Without a Fistfight)
Alright, let's talk about the elephant in the planning room: how do you actually get a room full of opinionated engineers to agree on a number?
You’ve seen it happen. The discussion spirals, the same points get rehashed for the fifth time, and an hour later, you’re no closer to an estimate than when you started. It turns into a conversational cage match.
You don't need another argument. You need a process—one that channels all that passionate debate into productive alignment.

Planning Poker: The Great Equalizer
The most popular (and, frankly, effective) method is Planning Poker. Don’t let the name fool you; it’s not about gambling. It’s a dead-simple, structured way to get everyone's honest, unbiased opinion on the table at the exact same time.
Here’s the play-by-play:
- The Story Is Told: The product manager lays out a user story, explaining the user problem and what "done" looks like.
- Questions Fly: The team pokes holes in it. “What happens if the API call fails?” “Did we think about the mobile view for this?” This is where you start closing the initial knowledge gaps.
- The Silent Vote: Everyone privately picks a card with the story point value they think fits. This part is crucial—no one gets influenced by the senior dev’s confident smirk or the tech lead’s loud opinion.
- The Big Reveal: On the count of three, everyone flips their card.
Now, one of two things happens. Either the numbers are pretty close (a mix of 3s and 5s, for example), and you can just agree on the higher number and move on. Or, the numbers are all over the place.
This Is Where the Magic Happens
Let's say you see a ‘2’ and a ‘13’ for the same exact task. Don’t groan. This isn't a failure; it’s an invitation to a conversation that needs to happen.
You turn to the two people with the highest and lowest estimates and ask them to explain their thinking.
The senior engineer who voted ‘2’ might say, “Oh, this is simple. We have a reusable library that does half the work already.”
Meanwhile, the junior engineer who voted ‘13’ might fire back, “I was factoring in the need to write new database migrations and update three downstream services, which I’m pretty sure that library doesn't handle.”
The point of Planning Poker isn't to find the 'right' number. It's to uncover hidden risks, false assumptions, and knowledge gaps. The number is just a byproduct of creating a shared understanding.
Boom. In less than a minute, the entire team just learned about a reusable component and a bunch of hidden dependencies. The discussion that follows is the whole point. You're not negotiating a number; you're aligning the team's mental model of the work.
For a deeper dive into this and other techniques, our guide on agile estimation methods is a great next step.
Converging on an Estimate
After that quick but crucial chat, you vote again. This time, the estimates will be much, much closer because everyone is working from the same playbook. You repeat this until the numbers settle into a reasonable range.
This isn't just theory; it works. At one startup I advised, implementing Planning Poker cut their estimation meetings from a dreadful two hours to a crisp 45 minutes. More importantly, it stopped the bickering and forced everyone to ask the right questions upfront, which killed a lot of that deadline anxiety.
The goal isn't perfect, crystal-ball accuracy. It's team alignment. And maybe, just maybe, a little less fighting in your planning meetings.
Velocity: The Predictability Metric You’re Almost Certainly Misusing
So, you’ve diligently pointed your backlog. Now what? You're sitting on a pile of abstract numbers and faced with the inevitable question: how do we use these to actually forecast anything?
Enter velocity. It’s the metric that promises predictability but is so often twisted into a weapon of micromanagement. In its purest form, velocity is just the average number of story points your team fully completes in a sprint. Simple as that.

If your team knocked out 25, 30, and 28 points in the last three sprints, your average velocity is hovering around 28. This isn't just a number; it's your team's superpower.
This little data point gives you a realistic glimpse into how much work your team can actually handle. Now you can look at that daunting 200-point backlog and say, "Okay, this will likely take us about seven sprints." It's not a blood oath; it's a forecast.
Where It All Goes Wrong
But here’s where the wheels come off. Someone, somewhere, will inevitably decide that velocity is a productivity metric. And they will be catastrophically wrong.
You absolutely cannot compare the velocity of Team A (who averages 30 points) to Team B (who averages 50) and declare Team B the winner. Their point scales are entirely relative to their own internal baselines. It's like comparing 50 apples to 30 oranges and wondering why the math feels weird.
The moment you start using velocity to pressure a team to "increase their velocity," you've poisoned the well. The team will do exactly what you've incentivized them to do: they'll start inflating their estimates. That simple 3-point task? It’s a 5 now. That 8-pointer? Let’s call it a 13 to be safe.
Velocity is a tool for forecasting, not a weapon for micromanagement. Its value is in its honesty. The second it becomes a performance target, it becomes a vanity metric, and you lose all ability to predict anything.
This phenomenon, often called "story point inflation," renders your numbers meaningless. Worse, it completely erodes the trust between the team and leadership. The team feels like they're being measured by a flawed yardstick, and leadership wonders why their forecasts are suddenly so unreliable.
Using Velocity as a Diagnostic Tool
Instead of treating velocity as a score to be beaten, think of it as a health check.
A stable velocity is a sign of a predictable, rhythmic team. A wildly fluctuating velocity, on the other hand, is a signal flare that something is wrong.
Maybe the team is constantly getting pulled into unplanned fire drills. Perhaps user stories are poorly defined, leading to scope creep mid-sprint. Or maybe there are hidden dependencies and blockers that aren't being dealt with.
- Sudden Drop in Velocity? Dig in. What external pressures or internal roadblocks is the team facing? Was half the team out sick? Did a critical production issue derail the sprint?
- Sudden Spike in Velocity? This isn't always good news. Did the team just get lucky with a few underestimated tasks? Or are they cutting corners on quality just to hit a number?
Velocity isn't about the number itself; it's about the conversation it starts. It’s a lagging indicator that tells you how your process is performing. By understanding your velocity, you can have more honest conversations about capacity, timelines, and the real trade-offs required to hit your goals.
Tracking this correctly is key, which is why a deep understanding of product success metrics is so important for any team leader. Your team's ability to ship consistently is a direct reflection of a healthy process, and velocity is one of the best tools you have to monitor that health—as long as you don't misuse it.
The Anti-Patterns That Will Derail Your Team
You can follow all the best practices, run a perfect Planning Poker session, and still watch the whole system crumble. Why? Because a few seemingly innocent bad habits can silently poison the well, turning a tool for alignment into a source of frustration.
Spotting these anti-patterns is half the battle. They’re tricky because they often start from a good place but lead to a total mess.
Translating Points to Hours
This is the original sin of story point adoption. The second a manager asks, “So, what does a 5-point story mean in days?” you’re already in trouble. It feels like a harmless question, just an attempt to translate this new, fuzzy concept back into the comfortable language of time.
But it’s a trap.
It forces the team into a shadow system where they’re constantly doing mental gymnastics—estimating in points, then immediately converting them back to hours to make stakeholders happy. This double-work completely defeats the purpose of relative estimation and brings back all the anxiety you were trying to escape.
Using Points for Individual Performance
Story points are a team metric. Full stop. They measure the collective effort needed to get a story to "done." Tying points to individual engineers is one of the fastest ways to kill collaboration.
As soon as you do this, you start rewarding the wrong things. Engineers will begin cherry-picking easy, high-point tasks to pump up their personal "score." Worse, they’ll stop helping each other. Why would anyone pause their own work to help a teammate if it means their personal point count for the sprint goes down? It turns a collaborative exercise into a zero-sum game.
Weaponizing Velocity
Like we talked about, velocity is a diagnostic tool for forecasting, not a performance target. If leadership declares, “We need to boost velocity by 10% next quarter,” the team will hit that goal. But not by working faster. They'll just start gaming the system.
They’ll inflate their estimates—a practice known as story point inflation—and your predictable metric will become useless overnight. Velocity should be a measured outcome of the team’s process, not the goal itself. A stable velocity is healthy; a manipulated one is a lie.
Estimating Bugs and Chores in Points
This one can be controversial, but pointing non-feature work often muddies the waters. While some teams do it, it can distort your velocity and make it a nightmare to forecast how much new value you're actually shipping.
Think of it this way:
- Bugs are a failure of quality, not new value for the customer. They represent a debt that needs to be paid.
- Chores are necessary maintenance, like upgrading a library or refactoring code. They enable future value but aren't features themselves.
A better approach is often to just time-box these tasks. Set aside a certain percentage of your sprint’s capacity—say, 20%—to handle bugs and chores. This keeps your velocity focused purely on the new, customer-facing value your team is delivering.
The transition to story points is notoriously tricky, with failure rates hovering between 40-60% in early attempts. A huge part of this is managers misusing points as productivity measures, which completely undermines the system. You can read more about these common hurdles in reports from the Agile Alliance.
Never Recalibrating Your Baseline
Your team evolves. They get better, learn new tech, and their shared understanding of the product deepens. The "2-point" story that was your baseline six months ago might feel like a "1-pointer" today.
If you never revisit your baseline, your estimates will slowly drift out of sync with reality. This is "point deflation," and it can make your velocity look like it's tanking even when the team is actually getting more efficient. Once or twice a year, it's smart to have a quick recalibration session to make sure your scale still makes sense for the team you are today, not the team you were a year ago.
Your Action Plan for the Next Sprint
Alright, let's get out of the theory and into the real world. Enough talking about story points—it's time to actually use them. Here's your game plan for your very next sprint planning meeting.
The goal here isn't perfection on day one. It's just to get started.
Establish Your Baseline
First things first, you need a yardstick. Get the team to agree on a baseline story, but don't just pull something random from the top of the backlog. That's a recipe for a pointless debate.
Instead, pick a tiny, simple task everyone remembers shipping recently. Think "update the copyright year in the footer" or "change the color of the primary button." It needs to be so mind-numbingly straightforward that nobody can argue about what it took to get it done.
Got one? Good. Now, ask the team, “If that task was a 2, what would this new task be?” Boom. You’ve just created your reference point.
Run a Mini Estimation Session
Whatever you do, don't try to estimate your entire backlog. That’s a classic rookie mistake that just burns everyone out before you’ve even started. Just grab the top 10-15 highest-priority items. That's it.
Use a structured method like Planning Poker to get this done. It gives everyone a voice and, more importantly, turns major disagreements into really productive conversations. You can even use tools that make this a breeze, like Momentum's built-in Pointing Poker feature.
When the estimates are all over the place—say, someone throws out a 2 and someone else a 13 for the same story—that’s your cue to step in and facilitate. This is where you uncover all the hidden assumptions and start building real alignment.
A clean, visual board like this is your best friend. It keeps the whole team focused on what actually matters during the estimation huddle.
Commit to the Sprint
Okay, you've got a handful of pointed stories. Now it's time to build the actual sprint backlog. Since this is your first go, you have zero velocity history. That's totally fine. Don't invent a number.
Just rely on the team's collective gut. Lay out the pointed stories and ask a simple, direct question: “Looking at these points, what feels achievable over the next two weeks?”
Of course, planning is just the start. You'll also need a plan for handling last-minute changes at the end of a sprint—because they will happen.
Track and Learn
When the sprint ends, add up the story points for everything that was 100% completed. Don't judge the number, don't over-analyze it. Just write it down. That's your first-ever velocity data point.
It will probably feel weirdly low or just plain wrong. Again, that's fine. After 2-3 sprints, a baseline average will start to emerge. From that point on, your planning will become worlds more predictable, and you'll wonder how you ever got by without it.
Common Questions About Story Points
Even with the best intentions, a few nagging questions always seem to pop up when a team starts using story points. Let's tackle the most common ones head-on so you can get past the theory and back to shipping great software.
Should We Re-Estimate Rollover Work?
Short answer: Nope.
You only get credit for story points on work that is 100% complete. Done-done.
If a massive 13-point story is half-finished when the sprint ends, your team gets a big fat zero for it in that sprint. When it’s finally completed in the next sprint, the team gets the full 13 points. This keeps your velocity an honest reflection of what was actually delivered, making it a reliable forecasting tool. Giving partial credit just muddies the water and makes your metrics useless.
What About Stories That Are Way Too Big?
If the team throws out a number like 40, it’s not an estimate; it's a warning sign. A giant red flag. Any story that’s too large (usually anything over a 13 or 20) absolutely must be broken down into smaller, more manageable user stories.
A massive story is just a bundle of uncertainty and risk waiting to blow up your sprint. Splitting it forces the team to actually think through the requirements more deeply, uncovering hidden dependencies and nasty complexities before a single line of code is written. This "story splitting" isn’t extra work—it's a critical Agile skill.
Do Story Points Only Work in Scrum?
Absolutely not. While they’re most famous in Scrum circles, story points are an Agile estimation tool, not a rule exclusive to one framework.
Kanban teams, for example, can use story points to get a better handle on the size of items in their backlog. This helps them manage their Work In Progress (WIP) limits and see how different-sized tasks flow through their system, even without the fixed-length sprints that Scrum uses. It’s a flexible tool for understanding effort, no matter how you structure your work.
Ready to ditch the estimation arguments and streamline your sprint planning? Momentum unifies your entire Agile workflow, from async standups to built-in story pointing and automated sprint views. Stop juggling tools and start shipping faster. Get started with Momentum for free.
Written by
Avi Siegel
Co-Founder of Momentum. Formerly Product @ Klaviyo, Zaius (acquired by Optimizely), and Upscribe.