Data Integration (DI), also known as ‘ETL’, is the analysis, combination, and transformation of data from a variety of sources and formats into a unified data model representation. Data Integration is a key element of data warehousing, application integration, and business analytics solutions. The variety and volume of data is always increasing and performance of data integration systems is critical. However, there has been no industry standard for measuring and comparing the performance of DI systems.
The TPC-DI benchmark subcommittee is continuing refinement of the specification. Definition of the System Under Test and rules for Pricing have been specified, and progress has been made on the definition of the automated audit. Development of the data generator is ongoing.