(rather than what kind ofβ¦
If your goal is to find out how people on reddit actually talk about it (rather than what kind of discussion from other people on the subject redditors value), a cutoff at the 100 karma threshhold is probably going to produce horribly skewed results: because karma determines presentation order, it grows non-linearly, which means that only a few very popular comments will get more than single-digit karma. If you want to get a smaller data set but keep it representative, I would isolate comments with exactly one karma point.
By John Ohno on April 21, 2017.
[Canonical link](https://medium.com/@enkiv2/if-your-goal-is-to-find-out-how- people-on-reddit-actually-talk-about-it-rather-than-what-kind-of-8b2b4538e193)
Exported from Medium on September 18, 2020.
Rendering context...