Article

The limits of Big Data for analyzing reading

Details

Citation

Rowberry S (2019) The limits of Big Data for analyzing reading. Participations, 16 (1), pp. 237-257. http://www.participations.org/Volume%2016/Issue%201/12.pdf

Abstract
Companies including Jellybooks and Amazon have introduced analytics to collect, analyze and monetize the user’s reading experience. Ebook apps and hardware collect implicit data about reading including progress and speed as well as encouraging readers to share more data through social networks. These practices generate large data sets with millions, if not billions of data points. For example, a copy of the King James Bible on the Kindle features over two million shared highlights. The allure of big data suggests that these metrics can be used at scale to gain a better understanding of how readers interact with books. While data collection practices continue to evolve, it is unclear how the metrics relate to the act of reading. For example, Kindle software tracks which words a reader looks up, but cannot distinguish between accidental look-ups, or otherwise link the act to the reader’s comprehension. In this article, I analyze patent filings and ebook software source code to assess the disconnect between data collection practices and the act of reading. The metrics capture data associated with software use rather than reading and therefore offer a poor approximation of the reading experience and must be corroborated by further data.

Keywords
Reader Analytics; Amazon; Kindle; Ebooks; Big Data; Critical Code Studies; Patents

Journal
Participations: Volume 16, Issue 1

Status	Published
Publication date	31/05/2019
Publication date online	31/05/2019
Date accepted by journal	07/03/2019
URL	http://hdl.handle.net/1893/29576
Publisher URL	http://www.participations.org/Volume%2016/Issue%201/12.pdf
ISSN	1749-8716

Files (1)

Rowberry 16.1