Low Cardinality: Understanding and Optimising Data with Small Distinct Values
In the world of data science, analytics and databases, the term low cardinality describes columns or features that contain a relatively small number of distinct values. While it sounds simple, the implications are wide-ranging: from storage efficiency to modelling decisions, from query performance to machine learning outcomes. This article unpacks what Low Cardinality means in…
Read more