Prepare for the IBM Data Science Exam. Utilize flashcards and multiple-choice questions with hints and explanations to hone your skills. Get exam-ready now!

Practice this question and more.


With ____________ data, you have categorical variables described by groups rather than numbers.

  1. Normalized

  2. Messy

  3. Unstructured

  4. Structured

The correct answer is: Structured

The correct answer is that with structured data, you have categorical variables described by groups rather than numbers. In data science, structured data refers to data that is organized into a defined format, often in rows and columns, like that found in databases or spreadsheets. This organization allows for easy categorization and enables the use of categorical variables, which are variables that represent discrete groups or categories rather than continuous numerical values. Examples of categorical variables include colors, names, or types of products, which can be grouped or categorized for analysis. Each of the other options relates to different characteristics of data. Normalized data refers to the standardization of database tables to reduce redundancy, which does not specifically pertain to categorical variables. Messy data is characterized by inconsistencies, inaccuracies, or incomplete information, which makes it less structured. Unstructured data, on the other hand, lacks a predefined format or organization, often taking forms such as text, images, or videos, making it difficult to analyze with traditional data tools. Therefore, structured data is the most appropriate choice when discussing categorical variables organized by groups.