Course: DATA 201 - Thinking with Data | Data Wrangling
DATA 201 | Data Wrangling Assignment 3: Data Wrangling Screenshot of the raw data (with the empty spaces highlighted) Omitting Omitting Irrelevant Attributes The first thing I did was to omit any attributes that were either empty or had useless information. This included the attributes “name”, “keywords”, “language”, and “currency symbol”. I decided to keep the attributes “occasion” and “currency” because although the data is very small, I thought They could still be meaningful to the research being done. This step addresses the data quality issues of missing data. What I did using Excel was click on the columns I wanted to omit and press delete. This could have also been done using OpenRefine under edit column, and clicking “remove column”. Removing unnecessary data and attributes is important because it increases the quality of the data, especially since they will have no use in the analysis process. Omitting Useless entities After Omitting the attributes, I went ...