A Forrester Total Economic Impact™ Study Commissioned By Dataiku, February 2024
In 2024, Forrester predicts that enterprises will be proactive and motivated to develop a meaningful AI strategy, while still considering governance and risk concerns.1 Dataiku offers organizations an AI platform that empowers them in their data analytics efforts and caters to both data and business users. This analysis found that by using Dataiku, organizations experience significant efficiency savings for both data and business users, as well as improved decision-making and considerable cost reductions.
Dataiku is a software company that offers an AI platform to help organizations get the ultimate value from data. Their platform enables organizations to build, deploy, and monitor machine learning models (including both traditional analytics projects and generative AI); perform data preparation and exploration; and design machine learning workflows. Dataiku aims to empower all individuals in an organization to extract valuable insights from their data and make data-driven decisions.
Dataiku commissioned Forrester Consulting to conduct a Total Economic Impact™ (TEI) study and examine the potential return on investment (ROI) enterprises may realize by deploying Dataiku.2 The purpose of this study is to provide readers with a framework to evaluate the potential financial impact of Dataiku on their organizations.
To better understand the benefits, costs, and risks associated with this investment, Forrester interviewed four representatives with experience using Dataiku. For the purposes of this study, Forrester aggregated the interviewees’ experiences and combined the results into a single composite organization that has $10 billion in revenue and is looking to advance its data analytics capabilities.
Interviewed data leaders said that prior to using Dataiku, their organizations relied on in-house tools, which did not meet their analytics needs — especially for AI projects. These in-house tools also did not offer multicloud deployment options. The interviewees’ organizations additionally relied on external third-party vendor models, which lacked transparency and incurred significant consulting expenses. The interviewed decision-makers had concerns about the lack of collaboration and scalability. This was due to data not being appropriately shared across teams and departments, leading to inaccurate data analysis from siloed data. Forrester research suggests that AI/ML platforms can enable AI teams to effectively collaborate, ideate, develop, test, deploy, and monitor AI applications.3
After the investment in Dataiku, the interviewees noted how their organizations democratized access to and the usage of data to multiple roles across their organizations. Interviewees witnessed a substantial positive impact, especially from a standpoint of user productivity, cost efficiency, and decision-making improvements.
Quantified benefits. Three-year, risk-adjusted present value (PV) quantified benefits for the composite organization include:
Unquantified benefits. Benefits that provide value for the interviewees’ organizations but are not quantified for this study include:
Costs. Three-year, risk-adjusted PV costs for the composite organization include:
The representative interviews and financial analysis found that a composite organization experiences benefits of $29.19 million over three years versus costs of $5.69 million, adding up to a net present value (NPV) of $23.50 million and an ROI of 413%.
Return on investment (ROI)
Benefits PV
Net present value (NPV)
Payback
From the information provided in the interviews, Forrester constructed a Total Economic Impact™ framework for those organizations considering an investment Dataiku.
The objective of the framework is to identify the cost, benefit, flexibility, and risk factors that affect the investment decision. Forrester took a multistep approach to evaluate the impact that Dataiku can have on an organization.
Interviewed Dataiku stakeholders and Forrester analysts to gather data relative to Dataiku.
Interviewed four representatives at organizations using Dataiku to obtain data about costs, benefits, and risks.
Designed a composite organization based on characteristics of the interviewees’ organizations.
Constructed a financial model representative of the interviews using the TEI methodology and risk-adjusted the financial model based on issues and concerns of the interviewees.
Employed four fundamental elements of TEI in modeling the investment impact: benefits, costs, flexibility, and risks. Given the increasing sophistication of ROI analyses related to IT investments, Forrester’s TEI methodology provides a complete picture of the total economic impact of purchase decisions. Please see Appendix A for additional information on the TEI methodology.
Readers should be aware of the following:
This study is commissioned by Dataiku and delivered by Forrester Consulting. It is not meant to be used as a competitive analysis.
Forrester makes no assumptions as to the potential ROI that other organizations will receive. Forrester strongly advises that readers use their own estimates within the framework provided in the study to determine the appropriateness of an investment in Dataiku.
Dataiku reviewed and provided feedback to Forrester, but Forrester maintains editorial control over the study and its findings and does not accept changes to the study that contradict Forrester’s findings or obscure the meaning of the study.
Dataiku provided the customer names for the interviews but did not participate in the interviews.
Consulting Team:
Elia Gollini
Bharath Sivan
Sanny Mok
| Role | Industry | Number Of | Revenue |
|---|---|---|---|
| Chief data scientist | Financial services | 500 | $9 billion |
| Head of data and analytics | Fashion | 350 | $6 billion |
| Analytics and data science product owner | Pharmaceuticals | 900 | $50 billion |
| Data science lead | Energy | 1,100 | $8 billion |
Before investing in Dataiku, interviewees noted they had siloed data and individual data analysis across their organizations. They needed a shared data platform to empower business and data users. The interviewees noted how their organizations struggled with common challenges, including:
The interviewees’ organizations searched for a solution that could:
Based on the interviews, Forrester constructed a TEI framework, a composite company, and an ROI analysis that illustrates the areas financially affected. The composite organization is representative of the four interviewees, and it is used to present the aggregate financial analysis in the next section. The composite organization has the following characteristics:
Description of composite. A global organization with $10 billion in revenue and 40,000 employees that is looking to advance its data analytics capabilities. The composite has siloed data and individual data analysis. It needed a shared data platform to empower business and data users. In its previous state, the composite faced key challenges like democratizing data access and usage, improving efficiency, reducing time to value, automating manual processes, and improving compliance. Dataiku helps the composite overcome these challenges.
Deployment characteristics. Though Dataiku does offer a cloud-based solution, the composite has an on-premises Dataiku deployment and starts using Dataiku with 50 licenses in Year 1. In Years 2 and 3, the number of licenses and users of Dataiku increases to 300 and 700, respectively. Half of the users can be categorized as data science/data engineering users, while the other half represents business users and data analysts.
| Ref. | Benefit | Year 1 | Year 2 | Year 3 | Total | Present Value |
|---|---|---|---|---|---|---|
| Atr | Increased user productivity | $541,048 | $3,232,921 | $7,521,205 | $11,295,173 | $8,814,489 |
| Btr | Reduced costs on data analytics tools and consultancies/third-party providers | $860,000 | $1,400,000 | $2,240,000 | $4,500,000 | $3,621,788 |
| Ctr | Business users’ efficiency savings | $67,680 | $487,296 | $1,516,032 | $2,071,008 | $1,603,269 |
| Dtr | Improved decision-making | $2,760,000 | $5,040,000 | $11,280,000 | $19,080,000 | $15,149,211 |
| Total benefits (risk-adjusted) | $4,228,728 | $10,160,217 | $22,557,237 | $36,946,181 | $29,188,757 | |
Evidence and data. Interviewees highlighted how using the Dataiku platform helped their data science and data engineer users benefit from substantial efficiency savings. Users reused data products and models, improving their efficiency.
Modeling and assumptions. Based on the interviews, Forrester assumes the following about the composite organization:
Risks. This benefit may vary for different organizations based on:
Results. To account for these risks, Forrester adjusted this benefit downward by 15%, yielding a three-year, risk-adjusted total PV (discounted at 10%) of $8.8 million.
| Ref. | Metric | Source | Year 1 | Year 2 | Year 3 |
|---|---|---|---|---|---|
| A1 | Total number of data science/analytics projects | Composite | 4 | 20 | 50 |
| A2 | Total number of users extracting and analyzing data (data scientists/data engineers) | Interviews | 25 | 150 | 350 |
| A3 | Time needed before Dataiku for data extraction and analysis per user (days) | Interviews | 105 | 105 | 105 |
| A4 | Percentage reduction in time needed to extract and analyze data due to Dataiku | Interviews | 71% | 71% | 71% |
| A5 | Average developer salary (daily) | TEI standard | $624 | $624 | $624 |
| A6 | Productivity capture rate | TEI standard | 50% | 50% | 50% |
| A7 | Subtotal: Faster data analysis and extraction | A2*A3*A4*A5*A6 | $581,490 | $3,488,940 | $8,140,860 |
| A8 | Total number of users working on model training, deployment, and monitoring (data scientists/data engineers) | Interviews | 7 | 40 | 90 |
| A9 | Time needed for model lifecycle before Dataiku (days) | Interviews | 60 | 60 | 60 |
| A10 | Percentage reduction in model lifecycle time due to Dataiku | Interviews | 42% | 42% | 42% |
| A11 | Subtotal: Model lifecycle efficiency savings | A8*A9*A10*A5*A6 | $55,037 | $314,496 | $707,616 |
| At | Increased user productivity | A7+A11 | $636,527 | $3,803,436 | $8,848,476 |
| Risk adjustment | ↓15% | ||||
| Atr | Increased user productivity (risk-adjusted) | $541,048 | $3,232,921 | $7,521,205 | |
| Three-year total: $11,295,173 | Three-year present value: $8,814,489 | ||||
Evidence and data. Before using the Dataiku platform, interviewees underlined how their organizations relied on many additional tools to enable data analysis within their organizations and leveraged external third parties, such as consultancies, to carry out data analysis work. By using Dataiku and enabling their own users to work with data, the interviewees’ organizations realized cost savings from data analytics tools and consultancies/third-party providers. The analytics and data science product owner at a pharmaceutical company has highlighted: “We are moving around 250 users from a statistical tool to Dataiku. Our annual contract was around $2.2 million or $2.1 million. Now, we have decommissioned that.”
The same interviewee also said: “Earlier, we used to send data to a consulting company, and they used to run it for us. They were charging us nearly a million dollars a year to run it. We stopped this and brought it in-house.”
Modeling and assumptions. Based on the interviews, Forrester assumes the following about the composite organization:
Risks. This benefit may vary for different organizations based on:
Results. To account for these risks, Forrester adjusted this benefit downward by 20%, yielding a three-year, risk-adjusted total PV (discounted at 10%) of $3.6 million.
| Ref. | Metric | Source | Year 1 | Year 2 | Year 3 | |
|---|---|---|---|---|---|---|
| B1 | Number of licenses reduced from other analytical tools | Interviews | 50 | 100 | 200 | |
| B2 | Average analytical tool license fee per user | Interviews | $6,500 | $6,500 | $6,500 | |
| B3 | Reduced cost on data analytics tools | B1*B2 | $325,000 | $650,000 | $1,300,000 | |
| B4 | Cost avoidance from leveraging consultancies/third-party providers | Interviews | $750,000 | $1,100,000 | $1,500,000 | |
| Bt | Reduced costs on data analytics tools and consultancies/third-party providers | B3+B4 | $1,075,000 | $1,750,000 | $2,800,000 | |
| Risk adjustment | ↓20% | |||||
| Btr | Reduced costs on data analytics tools and consultancies/third-party providers (risk-adjusted) | $860,000 | $1,400,000 | $2,240,000 | ||
| Three-year total: $4,500,000 | Three-year present value: $3,621,788 | |||||
Evidence and data. The interviewed data leader have highlighted how the outcomes of the work done on Dataiku benefited their entire organizations. Their organizations realized efficiencies thanks to Dataiku, mainly due to the automation of a variety of manual processes. Interviewees provided various examples of processes that previously required substantial manual effort and that were automated:
Modeling and assumptions. Based on the interviews, Forrester assumes the following about the composite organization:
Risks. This benefit may vary for different organizations based on:
Results. To account for these risks, Forrester adjusted this benefit downward by 20%, yielding a three-year, risk-adjusted total PV (discounted at 10%) of $1.6 million.
| Ref. | Metric | Source | Year 1 | Year 2 | Year 3 | |
|---|---|---|---|---|---|---|
| C1 | Number of business users | Composite | 25 | 150 | 350 | |
| C2 | Days spent on manual processes before Dataiku | Interviews | 60 | 60 | 60 | |
| C3 | Percentage of time saved due to Dataiku | Interviews | 50% | 60% | 80% | |
| C4 | Average business user salary (hourly) | TEI standard | $47 | $47 | $47 | |
| C5 | Productivity capture rate | TEI standard | 30% | 30% | 30% | |
| Ct | Business users’ efficiency savings | C1*C2*C3*(C4*8)*C5 | $84,600 | $609,120 | $1,895,040 | |
| Risk adjustment | ↓20% | |||||
| Ctr | Business users’ efficiency savings (risk-adjusted) | $67,680 | $487,296 | $1,516,032 | ||
| Three-year total: $2,071,008 | Three-year present value: $1,603,269 | |||||
Evidence and data. Interviewees described the impact Dataiku has on their companies’ revenues. The interviewees organizations used Dataiku for analysis and prediction purposes. As Dataiku adoption increased, the percentage of revenue that was analyzed and predicted increased throughout the years and interviewees attributed a part of the impact directly to Dataiku. On top of this, interviewees noted their organizations also used Dataiku for risk estimation use cases and recovered losses they would have previously faced thanks to the democratized usage and access of data across their organizations. The data science lead at an energy company emphasized how now their organization tackled issues precisely because of the data democratization enabled by Dataiku, stating, “The key value is that the platform is easy enough to be used by, for example, a maintenance technician and not only by someone with a data science degree.”
Modeling and assumptions. Based on the interviews, Forrester assumes the following about the composite organization:
Risks. This benefit may vary for organizations based on:
Results. To account for these risks, Forrester adjusted this benefit downward by 20%, yielding a three-year, risk-adjusted total PV (discounted at 10%) of $15.1 million.
| Ref. | Metric | Source | Year 1 | Year 2 | Year 3 | |
|---|---|---|---|---|---|---|
| D1 | Composite revenue | Composite | $10,000,000,000 | $10,000,000,000 | $10,000,000,000 | |
| D2 | Percentage of revenue impacted by Dataiku’s usage | Interviews | 10% | 15% | 30% | |
| D3 | Percentage of revenue realized due to Dataiku | Interviews | 1% | 1% | 1% | |
| D4 | Operating profit margin | TEI standard | 12% | 12% | 12% | |
| D5 | Subtotal: Commercial impact on revenue due to Dataiku | D1*D2*D3*D4 | $1,200,000 | $1,800,000 | $3,600,000 | |
| D6 | Average amount of yearly losses faced that could not be tackled before Dataiku | Interviews | $37,500,000 | $37,500,000 | $37,500,000 | |
| D7 | Percentage of issues that can be identified due to Dataiku’s data access and usage democratization | Interviews | 15% | 30% | 70% | |
| D8 | Dataiku attribution | Composite | 40% | 40% | 40% | |
| D9 | Subtotal: Ability to recover losses due to Dataiku’s data democratization | D6*D7*D8 | $2,250,000 | $4,500,000 | $10,500,000 | |
| Dt | Improved decision-making | D5+D9 | $3,450,000 | $6,300,000 | $14,100,000 | |
| Risk adjustment | ↓20% | |||||
| Dtr | Improved decision-making (risk-adjusted) | $2,760,000 | $5,040,000 | $11,280,000 | ||
| Three-year total: $19,080,000 | Three-year present value: $15,149,211 | |||||
Interviewees mentioned the following additional benefits that their organizations experienced but were not able to quantify:
The value of flexibility is unique to each customer. There are multiple scenarios in which a customer might implement Dataiku and later realize additional uses and business opportunities, including:
Flexibility would also be quantified when evaluated as part of a specific project (described in more detail in Appendix A).
| Ref. | Cost | Initial | Year 1 | Year 2 | Year 3 | Total | Present Value |
|---|---|---|---|---|---|---|---|
| Etr | Platform license fees | $0 | $302,500 | $1,650,000 | $3,465,000 | $5,417,500 | $4,241,942 |
| Ftr | Implementation and ongoing maintenance costs | $124,080 | $215,072 | $322,608 | $430,144 | $1,091,904 | $909,392 |
| Gtr | User training costs | $0 | $48,720 | $243,600 | $389,760 | $682,080 | $538,446 |
| Total costs (risk-adjusted) | $124,080 | $566,292 | $2,216,208 | $4,284,904 | $7,191,484 | $5,689,780 | |
Evidence and data. Interviewees noted that Dataiku is charged on a per user basis. The platform license fee increased as the number of users in the interviewees’ organizations grows.
Modeling and assumptions. To quantify this cost, Forrester assumes the following:
Risks. This cost may vary for different organizations based on:
Results. To account for these risks, Forrester adjusted this cost upward by 10%, yielding a three-year, risk-adjusted total PV (discounted at 10%) of $4.2 million.
| Ref. | Metric | Source | Initial | Year 1 | Year 2 | Year 3 | |
|---|---|---|---|---|---|---|---|
| E1 | Total platform license fees | Interviews | $275,000 | $1,500,000 | $3,150,000 | ||
| Et | Platform license fees | E1 | $275,000 | $1,500,000 | $3,150,000 | ||
| Risk adjustment | ↑10% | ||||||
| Etr | Platform license fees (risk-adjusted) | $0 | $302,500 | $1,650,000 | $3,465,000 | ||
| Three-year total: $5,417,500 | Three-year present value: $4,241,942 | ||||||
Evidence and data. Interviewees said their organizations allocated resources for the implementation of the Dataiku platform. Once the platform was implemented, the interviewees also dedicated resources to the ongoing management of the platform. These resources mainly focused on platform administration and training and user support.
Modeling and assumptions. To quantify this cost, Forrester assumes the following:
Risks. This cost may vary for different organizations based on:
Results. To account for these risks, Forrester adjusted this cost upward by 10%, yielding a three-year, risk-adjusted total PV (discounted at 10%) of $909,000.
| Ref. | Metric | Source | Initial | Year 1 | Year 2 | Year 3 | |
|---|---|---|---|---|---|---|---|
| F1 | FTEs dedicated to implementation | Interviews | 4 | ||||
| F2 | Length of implementation (days) | Interviews | 75 | ||||
| F3 | Platform admin FTEs | Composite | 1 | 1 | 1 | ||
| F4 | Training and user support FTEs | Composite | 1 | 2 | 3 | ||
| F5 | Average business user salary (yearly) | TEI standard | $97,760 | $97,760 | $97,760 | $97,760 | |
| Ft | Implementation and ongoing maintenance costs | F1*F2*(F6/260)+ ((F3+F4)*F5) | $112,800 | $195,520 | $293,280 | $391,040 | |
| Risk adjustment | ↑10% | ||||||
| Ftr | Implementation and ongoing maintenance costs (risk-adjusted) | $124,080 | $215,072 | $322,608 | $430,144 | ||
| Three-year total: $1,091,904 | Three-year present value: $909,392 | ||||||
Evidence and data. Interviewees noted that user training on Dataiku differed depending on the type of user. Users needed to dedicate some of their own time to understand the platform and get used to working with it. The time needed for users to train on the platform varied depending on whether they were data scientists and engineers, or business users and data analysts.
Modeling and assumptions. To quantify this cost, Forrester assumes the following:
Risks. This cost may vary for different organizations based on:
Results. To account for these risks, Forrester adjusted this cost upward by 20%, yielding a three-year, risk-adjusted total PV (discounted at 10%) of $538,000.
| Ref. | Metric | Source | Initial | Year 1 | Year 2 | Year 3 | |
|---|---|---|---|---|---|---|---|
| G1 | Number of data scientists and data engineers | Composite | 25 | 150 | 350 | ||
| G2 | Net-new data scientists and data engineers | Composite | 25 | 125 | 200 | ||
| G3 | Training hours per data scientist and data engineer | Assumption | 16 | 16 | 16 | ||
| G4 | Average data scientist and data engineer salary (hourly) | TEI standard | $78 | $78 | $78 | ||
| G5 | Number of data analysts and business users | Composite | 25 | 150 | 350 | ||
| G6 | Net new data analysts and business users | Composite | 25 | 125 | 200 | ||
| G7 | Training hours per data analysts and business users | Composite | 8 | 8 | 8 | ||
| G8 | Average data analyst and business users salary (hourly) | TEI standard | $47 | $47 | $47 | ||
| Gt | User training costs | (G2*G3*G4)+ | $0 | $40,600 | $203,000 | $324,800 | |
| Risk adjustment | ↑20% | ||||||
| Gtr | User training costs (risk-adjusted) | $0 | $48,720 | $243,600 | $389,760 | ||
| Three-year total: $682,080 | Three-year present value: $538,446 | ||||||
The financial results calculated in the Benefits and Costs sections can be used to determine the ROI, NPV, and payback period for the composite organization’s investment. Forrester assumes a yearly discount rate of 10% for this analysis.
These risk-adjusted ROI, NPV, and payback period values are determined by applying risk-adjustment factors to the unadjusted results in each Benefit and Cost section.
| Initial | Year 1 | Year 2 | Year 3 | Total | Present Value | |
|---|---|---|---|---|---|---|
| Total costs | ($124,080) | ($566,292) | ($2,216,208) | ($4,284,904) | ($7,191,484) | ($5,689,780) |
| Total benefits | $0 | $4,228,728 | $10,160,217 | $22,557,237 | $36,946,181 | $29,188,757 |
| Net benefits | ($124,080) | $3,662,436 | $7,944,009 | $18,272,333 | $29,754,697 | $23,498,977 |
| ROI | 413% | |||||
| Payback | <6 months | |||||
Total Economic Impact is a methodology developed by Forrester Research that enhances a company’s technology decision-making processes and assists vendors in communicating the value proposition of their products and services to clients. The TEI methodology helps companies demonstrate, justify, and realize the tangible value of IT initiatives to both senior management and other key business stakeholders.
Benefits represent the value delivered to the business by the product. The TEI methodology places equal weight on the measure of benefits and the measure of costs, allowing for a full examination of the effect of the technology on the entire organization.
Costs consider all expenses necessary to deliver the proposed value, or benefits, of the product. The cost category within TEI captures incremental costs over the existing environment for ongoing costs associated with the solution.
Flexibility represents the strategic value that can be obtained for some future additional investment building on top of the initial investment already made. Having the ability to capture that benefit has a PV that can be estimated.
Risks measure the uncertainty of benefit and cost estimates given: 1) the likelihood that estimates will meet original projections and 2) the likelihood that estimates will be tracked over time. TEI risk factors are based on “triangular distribution.”
The initial investment column contains costs incurred at “time 0” or at the beginning of Year 1 that are not discounted. All other cash flows are discounted using the discount rate at the end of the year. PV calculations are calculated for each total cost and benefit estimate. NPV calculations in the summary tables are the sum of the initial investment and the discounted cash flows in each year. Sums and present value calculations of the Total Benefits, Total Costs, and Cash Flow tables may not exactly add up, as some rounding may occur.
1 Source: “Predictions 2024: Artificial Intelligence,” Forrester Research, Inc., October 26, 2023.
2 Total Economic Impact is a methodology developed by Forrester Research that enhances a company’s technology decision-making processes and assists vendors in communicating the value proposition of their products and services to clients. The TEI methodology helps companies demonstrate, justify, and realize the tangible value of IT initiatives to both senior management and other key business stakeholders.
3 Source: “Enterprises Must Invest In AI Platforms To Empower Multirole AI Teams,” Forrester Research, Inc., August 26, 2022.
4 Source: “The Architect’s Guide To Generative AI,” Forrester Research, Inc., January 12, 2024.
5 Source: “The State Of Generative AI, 2024,” Forrester Research, Inc., January 26, 2024.
Cookie Preferences
Accept Cookies
A cookie is a small text file that a website saves on your computer or mobile device when you visit the site. It enables the website to remember your actions (data inputs, website navigation), so you don’t have to re-enter data when you come back to the site or browse from one page to another.
Behavioral information collected by our web analytics vendor is used to analyze data pertaining to visitor trends, plan website enhancements, and measure overall website effectiveness. We may also use cookies or web beacons to help us offer you products, programs, or services that may be of interest to you and to deliver relevant advertising. We may use third-party advertising companies to help tailor website content to users or to serve ads on our behalf. These companies may also employ cookies and web beacons to measure advertising effectiveness.
Please accept cookies and the collection of behavioral information to receive full functionality and enhance your experience. If you decline cookies, some features of the website may not function normally.
Please see our
Privacy Policy for more information.