Wide and narrow are terms used to describe two different presentations for tabular data.

Terminology

The terms used vary by community and software:

  • Wide and long: Common in modern data science and time-series analysis (e.g., pandas, R).
  • Un-stacked and stacked: Common in statistical software and spreadsheet operations.
  • Pivoted and unpivoted: Common in SQL and data preparation tools like Power Query, where "unpivoting" refers to transforming wide data into a long format.
  • Wide and narrow: Common in database modeling.

Wide

Wide, or unstacked data is presented with each different data variable in a separate column.

PersonAgeWeightHeight
Bob32168180
Alice24150175
Steve64144165

Narrow

Narrow, stacked, or long data is presented with one column containing all the values and another column listing the context of the value

PersonVariableValue
BobAge32
BobWeight168
BobHeight180
AliceAge24
AliceWeight150
AliceHeight175
SteveAge64
SteveWeight144
SteveHeight165

This is often easier to implement; addition of a new field does not require any changes to the structure of the table, however it can be harder for people to understand.

Implementations

Many statistical and data processing systems have functions to convert between these two presentations, for instance the R programming language has several packages such as the package. The pandas package in Python implements this operation as function which converts a wide table to a narrow one. The process of converting a narrow table to wide table is generally referred to as "pivoting" in the context of data transformations. The "pandas" python package provides a method which provides for a narrow to wide transformation.

See also

External links