Elsevier

Technovation

Volumes 55–56, September–October 2016, Pages 14-21
Technovation

Crowdsourcing not all sourced by the crowd: An observation on the behavior of Wikipedia participants

https://doi.org/10.1016/j.technovation.2016.05.002Get rights and content

Highlights

  • Wikipedia articles are not always rightly sourced from the crowd.
  • Wikipedia articles are often maintained by a dominant few.
  • The dominance of the few participants becomes more severe as the crowd matures.
  • Intrinsic motivation encourages Wiki participants to engage in revising articles.

Abstract

This study investigates the behavioral patterns of Wikipedia participants to obtain a picture of internal dynamics of the world's largest crowdsourcing platform. It observes the responses of people when “other” people enter a crowd where internal and external controls are mostly absent. From the analysis of 342 Wikipedia articles, this study shows that the overall tone of Wikipedia articles is mostly decided by a dominant few rather than by a trivial many, and such domination worsens as the number of participant increases and the article matures. This result contradicts a common belief on crowdsourcing that Wikipedia would reflect the voices of a vast majority, obtain a balanced solution, and attain democracy on the Internet. Therefore, this study contributes to the literature by analyzing how effectively Wikipedia functions as a crowdsourcing platform within the context. It also implies that developing a proper crowdsourcing strategy such as effective management of a platform is necessary, especially when an organization has a specific goal to achieve throughout a project.

Introduction

To understand the dynamics of a complex crowdsourcing platform, this study investigates the behavioral patterns of Wikipedia participants. We observe the responses of participants when “other” people enter a large crowdsourcing platform, Wikipedia, where internal and external controls are mostly absent. This observation would elucidate how the behaviors of crowdsourcing participants change as the crowd grows larger and subsequently provide insights into the internal mechanism of the complex crowdsourcing platform.
Nowadays, people are connected more than ever because of the Internet, which forces users to be part of an unexpected but fully functioning collaboration (Doan et al., 2011). For example, reCAPTCHA is a user-dialogue system that allows Project Gutenberg to digitize public domain materials, such as ancient books that cannot be read by optical character recognition (OCR) software (von Ahn et al., 2008). The system collects contributions from more than 100 million Internet users every day from websites such as Facebook, Twitter, and Craigslist.
Such strategic collaboration through the Internet has been conceptualized and developed into the term “crowdsourcing” (Albors et al., 2008). Crowdsourcing does not simply refer to the gathering of resources from numerous people; rather, the concept emphasizes the synergic and value-added effects created by resources collected from the “crowd.” Crowdsourcing differs from typical Internet-based collaboration in that the former emphasizes the value created from the “crowd” rather than from a “vital few” (Kittur, 2010).
As crowdsourcing phenomena continue to evolve in forms and functions, various types of crowdsourcing platforms emerge (Kohler, 2015). For example, one popular type of platform is tournament-style intermediaries for solution and idea crowdsourcing (Prpić et al., 2015a), such as Kaggle.com for predictive modeling projects, Threadless.com for product design, InnoCentive.com for research and development, and TopCoder.com for software development projects (Dissanayake et al., 2015). These platforms have established a large pool of self-selected participants, who are problem solvers rather than workers for hire. Organizations often commission crowdsourcing intermediaries as a paid platform service to acquire their own crowds (Prpić et al., 2015b).
As the crowdsourcing approach widely spreads, the dynamics inside a crowd is gaining attention from researchers and practitioners (Afuah and Tucci, 2012). When the number of participants of a crowdsourcing project increases, the intricacy of coordination among the participants also increases. The more openness crowdsourcing embeds, the more likely crowdsourcing be abused by people with malice or incompetence (Yasseri et al., 2014). Furthermore, crowdsourcing is concerned not only with ways to collect resources, but also with methods to create value through fair assessment and management of collected resources (Bonabeau, 2009). Therefore, crowdsourcing can be considered a double-edged sword that can either build knowledge or generate misinformation (Leimeister et al., 2009).
For these reasons, the current study investigates the basic behavioral patterns of crowdsourcing participants and selects Wikipedia as a crowdsourcing platform to explore. Wikipedia is selected because it is one of the largest crowdsourcing platforms with a notably low level of control and a high level of dynamics of people with pure motivation. While organizations actively utilize and attempt to control various crowdsourcing platforms such as Kaggle and Innocentive for their business interests and create their own proprietary crowds (Piezunka and Dahlander, 2015), people contribute to Wikipedia to achieve intrinsic motivation, such as seeking the truth, rather than extrinsic motivation, such as monetary incentives. Hence, Wikipedia, which is designed as an Internet-based control-free discussion agora, is suitable as a platform for testing the participating behaviors of crowdsourcing participants (Martinez and Walton, 2014). Observing the behavior of Wikipedia participants in various aspects will provide researchers and practitioners with an in-depth understanding of crowdsourcing at the principal motivation level.
This study is organized as follows. We first review the literature on crowdsourcing and Wikipedia as a crowdsourcing platform with low control. We then formulate three primary hypotheses on the behavioral pattern changes of Wikipedia participants as the crowdsourcing project matures. Next, we collect data from 342 Wikipedia articles and analyze the article creation and revision patterns to validate the hypotheses. Finally, we summarize our results and discuss implications.

Section snippets

Crowdsourcing

Studies on crowdsourcing can be divided into three folds: exploratory papers such as conceptualizations and taxonomies, practice-based case studies, and theoretically applied studies. The first group focuses on basic features of crowdsourcing such as definition and taxonomy. Albors et al. (2008) overview the new types of network collaboration paradigms including motivation and consequences, from the various types of crowdsourcing. Doan et al. (2011) review how the practice of crowdsourcing

People talk faster as crowd matures

When everyone is allowed to create and revise an article independently, as practiced in Wikipedia, people revise an article when the information in the article is insufficient and/or when the contents contradict their perspectives. The motivations for article revision on Wikipedia are not extrinsic, such as economic incentive or external pressure, but mostly intrinsic self-concepts, such as pursuit of inherent standards (Leonard et al., 1999) and/or a sense of personal achievement (Yang and

Data collection and descriptive study

Data were collected from Wikipedia articles that were selected based on various lists of the world's most influential people. We combined a total of seven lists, including Hart's Ranking of the 100 Most Influential People, The All-Time TIME 100 of All Time, and The Top 100 Influential Figures in American History by the Atlantic (full lists are available upon request).
The list was selected for the following reasons. First, the subjects of the articles are well-known worldwide and thus attract

Summary of Findings

Test results of the hypotheses highlight that “crowd” power in Wikipedia is not as apparent as it was previously thought. On the contrary, the strong leadership of the dominant few (i.e., a small group of active users) is evident in the content creation process. Participants who actively edit articles to defend their beliefs and opinions revise the articles more frequently as the platform becomes crowded and mature. The frequency of their editing increases faster than those who are not as

Conclusion

This study investigates the behavioral patterns of Wikipedia participants to obtain a picture of internal dynamics of the world's largest crowdsourcing platform. Understanding the inner workings of Wikipedia is vital because it is a prototype of evolving crowdsourcing platforms, which are now widely adopted and utilized by organizations and businesses in various forms (Zhao and Zhu, 2014). From the analysis of 342 articles from Wikipedia, this study shows that the overall tone of Wikipedia

Acknowledgement

This study is supported by Hankuk University of Foreign Studies (HUFS) Research Fund of 2016.

References (47)

  • D. Nevo et al.

    Primary vendor capabilities in a mediated outsourcing model: can IT service providers leverage crowdsourcing?

    Decis. Support Syst.

    (2014)
  • J. Prpić et al.

    How to work a crowd: developing crowd capital through crowdsourcing

    Bus. Horiz.

    (2015)
  • R.M. Ryan et al.

    Intrinsic and extrinsic motivations: classic definitions and new directions

    Contemp. Educ. Psychol.

    (2000)
  • C. Standing et al.

    How can organizations use wikis for innovation?

    Technovation

    (2011)
  • H. Yang et al.

    Motivations of Wikipedia content contributors

    Comput. Human. Behav.

    (2010)
  • A. Afuah et al.

    Crowdsourcing as a solution to distant search

    Acad. Manag. Rev.

    (2012)
  • BBC News, 2013. Top. That Spark Wikipedia ‘edit wars’ revealed, 18 July 2013...
  • E. Bonabeau

    Decisions 2.0: the power of collective intelligence

    MIT Sloan Manag. Rev.

    (2009)
  • M. Boons et al.

    Feelings of pride and respect as drivers of ongoing member activity on crowdsourcing platforms

    J. Manag. Stud.

    (2015)
  • D.C. Brabham

    Crowdsourcing the public participation process for planning projects

    Plan. Theory

    (2009)
  • D.C. Brabham

    Moving the crowd at threadless: motivations for participation in a crowdsourcing application

    Inf. Commun. Soc.

    (2010)
  • I. Dissanayake et al.

    Task division for team success in crowdsourcing contests: resource allocation and alignment effects

    J. Manag. Inf. Syst.

    (2015)
  • A. Doan et al.

    Crowdsourcing systems on the world-wide web

    Commun. ACM

    (2011)
  • Cited by (32)

    • Nudging and citizen science: The effectiveness of feedback in energy-demand management

      2020, Journal of Environmental Management
      Citation Excerpt :

      Recent advancements in information technology have facilitated the involvement of crowds from outside organizational boundaries for the generation and dissemination of new knowledge (Cappa et al., 2019; Franzoni and Sauermann, 2014; Majchrzak and Malhotra, 2013; Sauermann and Franzoni, 2015). When dispersed individuals are recruited in exchange of a bounty to work on new ideas for solving problems and for innovating, we speak of “crowdsourcing” (Afuah and Tucci, 2013; Lee and Seo, 2016; Natalicchio et al., 2017); but, we refer to citizen science when they are voluntarily involved in the collection or analysis of data that leads to knowledge creation (Auerbach et al., 2019; Cappa et al., 2016b; Young et al., 2019). Citizen science constitutes an increasingly popular framework for involving volunteers from the general public, who are known as “citizen scientists,” in authentic scientific projects (Dickinson et al., 2012; Groulx et al., 2017; Riesch et al., 2013; Science Europe, 2018).

    • The past, present and future of open innovation

      2020, European Journal of Innovation Management
    View all citing articles on Scopus
    View full text