A data dictionary traditionally refers to a database dictionary, metadata repository or business glossary. It primarily focuses on the meaning or definition of all columns in a data table.
Atlan's Data Dictionary goes beyond the traditional definition and provides a complete profile of the data.
Apart from defining each column, the Data Dictionary also provides other information—the column's data type, glossary terms attached to it, classifications linked, and stats like missing values.
✨ Spotlight: The best part of Atlan's Data Dictionary is that it isn't located somewhere away from the data. It is located right next to the data table with all information easily accessible.
Atlan's Data Dictionary does not just stop at defining the column. It also provides information that helps you understand the column's properties better.
The best part 🌟 is that most of this information is auto-generated by Atlan's system itself.
Let's look at all the details that are shown in Atlan's Data Dictionary.
The description for each column is shown in the Data Dictionary.
If the description is present in the source, it will get automatically picked up. Otherwise, you can edit or add your own definition.
The data type (like number or text) of each column is also shown, through both text and symbols.
Distinct values: Number and percentage of unique values in the column
Missing values: Number and percentage of missing values in the column
Frequency: Occurrence of each value, shown as a histogram and a value (on hover)
All these metrics are auto-generated when the data catalog runs through the data profile. Check out the automated data profile article to learn more.
To provide full context, the classification and glossary terms attached to the column are also shown.
Want to know more about how to create and add glossary terms? Check out the article below 👇