Explore tens of thousands of sets crafted by our community.
Data Warehousing Concepts
15
Flashcards
0/15
Dimension Table
A dimension table contains dimensions of a fact. They are used in a data warehouse to describe objects in a fact table and are generally normalized.
Star Schema
A type of database schema that is the simplest style of data mart schema and is organized into fact and dimension tables with a central fact table surrounded by dimension tables.
Metadata Repository
A metadata repository is where metadata is stored in a data warehouse. It defines the warehouse objects such as source, transformations, schema, space allocation, and database objects.
Dimensional Modeling
Dimensional modeling is a data modeling technique in data warehouse design, intended to support end-user queries in a data warehouse. It uses fact tables and dimension tables.
Data Cube
A data structure used to represent data along some measure of interest. It is used in OLAP databases to visualize and analyze data across multiple dimensions.
Fact Table
A fact table is a primary table in a dimensional model. A fact table stores quantitative information for analysis and is often denormalized.
Aggregates
In the context of data warehousing, aggregates are pre-calculated summary or total values that are stored in the data warehouse to enhance query performance.
Online Analytical Processing (OLAP)
OLAP is a category of software that allows users to analyze information from multiple database systems at the same time.
Data Mining
Data mining is the process of discovering patterns in large data sets involving methods at the intersection of machine learning, statistics, and database systems.
Data Warehousing
A data warehousing is a system used for reporting and data analysis, and is considered a core component of business intelligence.
Data Warehouse Architecture
Data warehouse architecture refers to the design of the database system that allows for the consolidation and integration of data from various sources, typically comprising of tiers such as data source layer, data staging layer, and presentation layer.
Snowflake Schema
An extension of the star schema where the dimension tables are normalized, hence more complex, resulting in a structure that resembles a snowflake shape.
Data Mart
A data mart is a subset of a data warehouse that is often oriented to a specific business line or team.
Extraction, Transformation, and Loading (ETL)
ETL refers to a process in database usage and especially in data warehousing that involves extracting data from outside sources, transforming it to fit operational needs, and loading it into the end target.
Slowly Changing Dimensions (SCD)
SCDs are dimensions that change slowly over time, rather than changing on a regular schedule, time-base. They account for and manage changes in dimension data.
© Hypatia.Tech. 2024 All rights reserved.