Warm tip: This article is reproduced from stackexchange.com, please click

chi-squared-test likert

Binning answers from a Likert-scale question

发布于 2013-03-07 15:46:06

I have six questions that each have a five-point Likert scale. We are interested in whether participants, overall, "agree" or "disagree" (or are neutral) toward the questions.

To do this, we currently calculate the average of the items, which results in an average from 1 to 5, then we collapse it into 3 bins, where 5 and 4 are "Agree", 3 is Neutral, and 2 and 1 are "Disagree".

We then compare this using a $\chi^2$ Goodness of Fit to the expected outcome, which we assume is randomly uniform: we anticipated that as 40% = agree, 20% = neutral, and 40% = disagree, since on a five-point scale 2 options are agree, 1 is neutral, and 2 options are disagree.

However I have doubts that we can average on a five point scale, then collapse into a 3 point scale and claim the expected averages are the same. I'm wondering if we can make the expected values claim if we collapse the categories into a 3-point scale before calculating the average. My colleagues don't see the problem. Is there a mathematical argument that shows whether these approaches (collapse before average, or collapse after average) are equivalent?

Questioner

Irwin

Viewed

0

russellpierce 2013-03-08 00:57:05

I have doubts that your approach is optimal. I think a more appropriate approach to your question is to do a two-tailed one-sample t-test where you set the null hypothesis population mean to 3. You will be able to detect if there is a statistically significant deviation from the middle of your likert-scale and then look at that actual mean difference and determine whether or not it is meaningful to you. Moreover, framing the question in this way assumes that you know the true value of the anchors and that a 3 really is the center of psychological space on the variable you are measuring. That is sort of a big assumption (this is one of many reasons why control groups and experimental manipulations are nice).

In regards to your question itself, binning then averaging will result in different results than averaging then binning. I started to come up with a mathematical example for this, or at the very least a simulation... then I noticed that a lot depends on how you precisely set up your bin boundaries when you are binning after averaging and binning the average after binning the items themselves. Another question relevant warning is that your bins (as stated) are not of equal size. The amount of the inequality depends on your bin cut points and where you do your binning. So do be sure to have a $\chi^2$ goodness-of-fit null hypothesis that matches your binning. Given that you set those proportions up to appropriately match your binning, then you will be able to make some sort of 'expected values' claim.

As a matter of pure theory, when you do binning you lose information. You are (informationally) better off to doing the averaging then the binning rather than vice versa. However, that assumes that an answer of 1 does mean something different than an answer of 2 to you. If you really think these are the same per your hypothesis then binning before averaging makes some kind of sense.

Irwin 2013-03-08 05:11:05

Thank you, your suggestion regarding the t-test was enlightening and appropriate.

热门github

1

A multi-platform library for OpenGL, OpenGL ES, Vulkan, window and input

2

Dev tool that writes scalable apps from scratch while the developer oversees the implementation

3

shadcn/ui, but for Svelte. ✨

4

The Python Risk Identification Tool for generative AI (PyRIT) is an open access automation framework to empower security professionals and machine learning engineers to proactively find risks in their generative AI systems.

5

Performance-portable, length-agnostic SIMD with runtime dispatch

6

ZK Credo

7

OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement

8

Joplin - the secure note taking and to-do app with synchronisation capabilities for Windows, macOS, Linux, Android and iOS.

9

Mamba is a new state space model architecture showing promising performance on information-dense data such as language modeling, where previous subquadratic models fall short of Transformers. It is based on the line of progress on structured state space models, with an efficient hardware-aware design and implementation in the spirit of FlashAttention.

10

This repository contains System Design resources which are useful while preparing for interviews and learning Distributed Systems

11

Curso para aprender el lenguaje de programación Python desde cero y para principiantes. 75 clases, 37 horas en vídeo, código, proyectos y grupo de chat. Fundamentos, frontend, backend, testing, IA...

12

🎓 Path to a free self-taught education in Computer Science!

13

1️⃣🐝🏎️ The One Billion Row Challenge -- A fun exploration of how quickly 1B rows from a text file can be aggregated with Java

14

A collective list of free APIs

15

📚 Freely available programming books