Preprint Article Version 1 Preserved in Portico This version is not peer-reviewed

Modeling Popularity and Reliability of Sources in Multilingual Wikipedia

Version 1 : Received: 31 March 2020 / Approved: 31 March 2020 / Online: 31 March 2020 (22:18:51 CEST)



A peer-reviewed article of this Preprint also exists. Lewoniewski, W.; Węcel, K.; Abramowicz, W. Modeling Popularity and Reliability of Sources in Multilingual Wikipedia. Information 2020, 11, 263. Lewoniewski, W.; Węcel, K.; Abramowicz, W. Modeling Popularity and Reliability of Sources in Multilingual Wikipedia. Information 2020, 11, 263. Copy Journal reference: Information 2020, 11

DOI: 10.3390/info11050263

Cite as: Lewoniewski, W.; Węcel, K.; Abramowicz, W. Modeling Popularity and Reliability of Sources in Multilingual Wikipedia. Information 2020, 11, 263. Lewoniewski, W.; Węcel, K.; Abramowicz, W. Modeling Popularity and Reliability of Sources in Multilingual Wikipedia. Information 2020, 11, 263. Copy CANCEL COPY CITATION DETAILS

Abstract

One of the most important factors impacting quality of content in Wikipedia is presence of credible sources. By following references readers can verify facts or find more details about described topic. A Wikipedia article can be edited independently in any of over 300 languages, even by anonymous users, therefore information about the same topic may be inconsistent. This also applies to use of references in different language versions of a particular article, so the same statement can have different sources. In this paper we analyzed over 40 million articles from the 55 most developed language versions of Wikipedia to extract information about nearly 200 million references and find the most popular and reliable sources. We presented 10 models for the assessment of the popularity and reliability of the sources based on analysis of meta information about the references in Wikipedia articles, page views and authors of the articles. Using DBpedia and Wikidata we automatically identified the alignment of the sources to a specific domain. Additionally, we analyzed the changes of popularity and reliability in time and identified growth leaders in each considered months. The results can be used for quality improvements of the content in different languages versions of Wikipedia.

Supplementary and Associated Material

http://data.lewoniewski.info/sources/: Supplementary materials

Subject Areas

Wikipedia; reference; source; reliability; popularity; Wikidata, DBpedia

Copyright: This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Comments (0)

We encourage comments and feedback from a broad range of readers. See criteria for comments and our diversity statement.