Back

Back

Back

December 12, 2023

December 12, 2023

December 12, 2023

GenAI helps low-performing Software teams >2x more than high-performing ones

GenAI helps low-performing Software teams >2x more than high-performing ones

GenAI helps low-performing Software teams >2x more than high-performing ones

Generative AI tools boosted productivity by ~15% in low-performing teams but only ~6% in high-performing ones, as shown in a 9-month study of five comparable software teams. This disparity suggests GenAI may provide greater value to teams with more room for improvement. Using a Stanford-developed algorithm, we objectively measured productivity based on the functional impact of code changes, offering a data-driven alternative to traditional metrics like Lines of Code and DORA.

Generative AI tools boosted productivity by ~15% in low-performing teams but only ~6% in high-performing ones, as shown in a 9-month study of five comparable software teams. This disparity suggests GenAI may provide greater value to teams with more room for improvement. Using a Stanford-developed algorithm, we objectively measured productivity based on the functional impact of code changes, offering a data-driven alternative to traditional metrics like Lines of Code and DORA.

Generative AI tools boosted productivity by ~15% in low-performing teams but only ~6% in high-performing ones, as shown in a 9-month study of five comparable software teams. This disparity suggests GenAI may provide greater value to teams with more room for improvement. Using a Stanford-developed algorithm, we objectively measured productivity based on the functional impact of code changes, offering a data-driven alternative to traditional metrics like Lines of Code and DORA.

Yegor Denisov-Blanch

Yegor Denisov-Blanch

Yegor Denisov-Blanch

Content

Content

Content

4 mins

4 mins

4 mins

Low-performing teams saw a ~15% increase in productivity from the use of Generative AI tools, while high-performing teams saw just a ~6% increase.

What could be the reasons behind this disparity?

In this case study, we examined 5 software teams using GenAI for >9 months. These teams were comparable in size, composition, location, programming language use, experience, turnover, GenAI tool use, and project type, allowing us to isolate the impact of GenAI.

We categorized them into high-performing (top ~30%) and low-performing (bottom ~30%) groups, analyzing their productivity before and after using GenAI for 9+ months each.

Despite its small sample size, this case study is a first in objectively evaluating GenAI's impact on software development productivity.

Traditional productivity metrics (Lines of Code, Commits/PRs, Story Points, DORA metrics) can misrepresent developer productivity: they don’t tightly correlate with productivity because they don’t analyze source code, might encourage counterproductive behavior, and could be easily manipulated.

How do we measure developer productivity in this Stanford case study?

Our algorithm measures developer productivity by analyzing the functionality (ie. what the code does) of code changes in Git commits. It weighs 30+ codebase dimensions and has been calibrated across millions of files in 10+ languages. This quantifies each commit's impact and allows us to combine it with Git metadata to measure individual and team productivity.

About Our Mission
-We are conducting research at Stanford focused on quantifying software engineering productivity.
-Our goal is to help engineering teams make decisions grounded in hard data, moving away from intuition and office politics.
-Participants in our research use our algorithm to make data-driven decisions about team performance, headcount, outsourcing, work settings (home vs office), etc.

Similar Insights

Similar Insights

Similar Insights

Open Blog

How useful do you find benchmarks of a software engineering org’s productivity?

We conduct productivity benchmarks to diagnose issues, spotlight high-performing teams, and promote best practices. By leveraging data-driven insights, our model provides objective, scalable assessments of software engineering output, fostering transparent, constructive discussions to address challenges and optimize performance.

4 mins

Open Blog

How useful do you find benchmarks of a software engineering org’s productivity?

We conduct productivity benchmarks to diagnose issues, spotlight high-performing teams, and promote best practices. By leveraging data-driven insights, our model provides objective, scalable assessments of software engineering output, fostering transparent, constructive discussions to address challenges and optimize performance.

4 mins

Open Blog

Companies Without Productivity Measurement Tools Rely on Their CTO's Experience and Intuition

As engineering teams grow, relying on intuition and experience to assess productivity becomes unsustainable. Traditional metrics like story points or commit counts are flawed and easily manipulated. Our research at Stanford suggests a more effective approach: analyzing code downstream to uncover team dynamics and bottlenecks. While manual code reviews can yield accurate insights, they’re slow and costly. That’s why we developed a groundbreaking tool that evaluates code with precision, speed, and cost-efficiency—providing data-driven support to enhance developer productivity at scale.

3 mins

Open Blog

Companies Without Productivity Measurement Tools Rely on Their CTO's Experience and Intuition

As engineering teams grow, relying on intuition and experience to assess productivity becomes unsustainable. Traditional metrics like story points or commit counts are flawed and easily manipulated. Our research at Stanford suggests a more effective approach: analyzing code downstream to uncover team dynamics and bottlenecks. While manual code reviews can yield accurate insights, they’re slow and costly. That’s why we developed a groundbreaking tool that evaluates code with precision, speed, and cost-efficiency—providing data-driven support to enhance developer productivity at scale.

3 mins

Open Blog

Software outsourcing teams with similar price tags can deliver vastly different results.

This case study compares five software teams from three agencies working on similar projects, uncovering significant performance gaps. While Net Promoter Score (NPS) highlights client expectations, it fails to measure the true value teams deliver for their cost. By analyzing over 2 billion lines of code, we aim to provide transparent, data-driven insights to optimize team performance and outsourcing decisions.

3 mins

Open Blog

Software outsourcing teams with similar price tags can deliver vastly different results.

This case study compares five software teams from three agencies working on similar projects, uncovering significant performance gaps. While Net Promoter Score (NPS) highlights client expectations, it fails to measure the true value teams deliver for their cost. By analyzing over 2 billion lines of code, we aim to provide transparent, data-driven insights to optimize team performance and outsourcing decisions.

3 mins

Open Blog

How useful do you find benchmarks of a software engineering org’s productivity?

We conduct productivity benchmarks to diagnose issues, spotlight high-performing teams, and promote best practices. By leveraging data-driven insights, our model provides objective, scalable assessments of software engineering output, fostering transparent, constructive discussions to address challenges and optimize performance.

4 mins

Open Blog

Companies Without Productivity Measurement Tools Rely on Their CTO's Experience and Intuition

As engineering teams grow, relying on intuition and experience to assess productivity becomes unsustainable. Traditional metrics like story points or commit counts are flawed and easily manipulated. Our research at Stanford suggests a more effective approach: analyzing code downstream to uncover team dynamics and bottlenecks. While manual code reviews can yield accurate insights, they’re slow and costly. That’s why we developed a groundbreaking tool that evaluates code with precision, speed, and cost-efficiency—providing data-driven support to enhance developer productivity at scale.

3 mins

Our Solution

How it works

FAQs

Insights

Sign In

Try it free

Our Solution

How it works

FAQs

Insights

Sign In

Try it free

Our Solution

How it works

FAQs

Insights

Sign In

Try it free

Objective Productivity Data for Smarter Engineering Team Decisions

Home

Platform

Let's talk

Book a Demo

© 2024 P10Y. All rights reserved.

Subscription Fulfillment Policy

Objective Productivity Data for Smarter Engineering Team Decisions

Home

Platform

Let's talk

Book a Demo

© 2024 P10Y. All rights reserved.

Subscription Fulfillment Policy

Objective Productivity Data for Smarter Engineering Team Decisions

Home

Platform

Let's talk

Book a Demo

© 2024 P10Y. All rights reserved.

Subscription Fulfillment Policy

Objective Productivity Data for Smarter Engineering Team Decisions

Home

Platform

Let's talk

Book a Demo

© 2024 P10Y. All rights reserved.

Subscription Fulfillment Policy

Objective Productivity Data for Smarter Engineering Team Decisions

Home

Platform

Let's talk

Book a Demo

© 2024 P10Y. All rights reserved.

Subscription Fulfillment Policy