Qualtrics Data Scrubber

Qualtrics Data Scrubber

Client

Amazon, Alexa Devices

Synopsis

As a side project for my internship at Amazon, I coded a website that speeds up my team's workflow in scrubbing quantitative survey data.

Platform

Website

Role

UX Research, Frontend Development (HTML & CSS & Javascript)

Timeline

Two days in 8/2023

Background

During the summer of 2023, I returned to the Amazon's Alexa HumanFX Team as a UX Designer Intern. I worked with a Senior Technical Product Manager on am upcoming Echo Dot feature meant to drive customer engagement and purchase decision.

Automatic Qualtrics Data Scrubber

During the internship, I had to conduct a quantitative survey (420 respondents) via Qualtrics' Market Research Panel to uncover customer needs. It was a costly endeavor with thousands of dollars spent recruiting the right people to fill out the survey.

Once respondents finish filling out the survey, one major hassle was to conduct a "scrub" of the survey data to identify low-quality responses. For example, in the image below, respondents might have rushed through the question by selecting the same response every time. (Otherwise known as "straight-lining")

Image source: Qualtrics

Some other participants would provide careless responses by alternating between responses in a predictable way. (I call it a "zigzagging patttern")

Image source: Qualtrics

My team's existing solution is to export all responses into a csv file and manually tagging each low-quality response. It was time-consuming for large quantitative survey with hundreds of data points.

To expedite the data scrubbing workflow for our UX Researchers, I coded a web-based platform that automates the discovery of straight-lining and zigzagging responses.

Demo

The platform is specifically designed to scrub data from Qualtrics surveys. To use the platform, we first need to export a csv file from our survey by following the procedure below:

Once we obtain the csv file, we can go to the platform and upload it.

After the file uploads, the dropdown menus in the image below are populated with each individual question in the survey. We can select the specific questions of interest ("Start point" & "Endpoint") and the extent of careless answers ("Number of consecutive answers" & "Number of consecutive zigzags").

In the example below, I choose to investigate a matrix question (Q1.8_1 to Q1.8_10) and filter out all responses that chose the same answer from beginning to end (Number of consecutive answers = 10).

After finishing the settings, I clicked on "Check for Straight-lining" and the platform generates a table with each straight-lining response as a row.

By programmatically gathering low-quality responses, the platform was able to expedite the time-consuming data-scrubbing process and grant higher productivity to UX Researchers on my team.

Play around on the platform by clicking on the image below or visiting https://www.justinzhang.me/data-srcubber-demo!

Learn more about my summer 2023 Amazon internship—where I created the data scrubber project—here.

© Portfolio by Justin Zhang, 2025