Data Centricity Lab Logo

The truth is infinitely complex and a model is merely an approximation to the truth. If the approximation is poor or misleading, then the model is useless. ― T. Tarpey

What is data centricity?

Every data model tells a story about the world. Behind every regression, every neural network, every forecast is a set of assumptions – choices we make about what matters, what can be ignored, and how the underlying reality behaves. These assumptions form the invisible architecture of the insights we derive.

Data centricity is about getting that architecture right. Our focus goes beyond collecting more data or cleaning it better. We ask: are the assumptions that connect data to insights sound? When assumptions hold, models reveal useful insights. When they don't, even the most complex models are useless.

At Data Centricity Lab, we study this gap – the space between raw data and reliable insights for decision making. Our work spans the assumptions embedded in statistical methods (parametric, nonparametric, semi-parametric), the assumptions required by different modeling objectives (causal inference, prediction), and the assumptions that carry forward into the decisions these models inform.

Data

Data

Sales, price, reviews – the raw material

Assumptions

Assumptions

Linearity, consistency – the bridge to models

Model

Model

Regression, XGBoost, neural nets – the lens

From the Lab