DW data warehouse DM data mart OLAP on-line analytical processing DS data sources ODS operational data store DSA data staging area DBMS database management system OLTP on-line transaction processing CDC change
Current state-of-the-art architectures of BI systems rely on a centralized data warehouse (DW) or multiple decentralized data marts to store the integrated data set. The process of collecting data from the transactional systems and transporting it into a dedicated storage is called extraction, tran...
language for defining and processing data flows is based on concepts similar to how DW engineers think aboutETL. opendatacenteralliance.org opendatacenteralliance.org 同样地,用于定义和 处理数据流的 Pig 语言基于类似于 DW 工程师对 ETL 的看法的概念。
EDW 实现方法与本书中介绍的DW 总线架构(Data Wa ouse Bus Architecture)方法有本质的不同。EDW 中的很多 主题,需要跟DW 总线的方法对照着来解释。同时,如果将逻辑问题和物理实现 问题分开来看会更有帮助。 从逻辑来看,这两者都提倡对分散在整个企业内部的不同数据源进行统一的 定义。DW 总线架构采用规格化 和...