Statistical analyses of digital collections: Using a large corpus of systematic reviews to study non-citations

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningfagfællebedømt

Standard

Statistical analyses of digital collections: Using a large corpus of systematic reviews to study non-citations. / Frandsen, Tove Faber; Nicolaisen, Jeppe.

I: Libellarium: journal for the research of writing, books, and cultural heritage institutions, Bind 9, Nr. 2, 2017, s. 81-94.

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningfagfællebedømt

Harvard

Frandsen, TF & Nicolaisen, J 2017, 'Statistical analyses of digital collections: Using a large corpus of systematic reviews to study non-citations', Libellarium: journal for the research of writing, books, and cultural heritage institutions, bind 9, nr. 2, s. 81-94. https://doi.org/10.15291/libellarium.v9i2.253

APA

Frandsen, T. F., & Nicolaisen, J. (2017). Statistical analyses of digital collections: Using a large corpus of systematic reviews to study non-citations. Libellarium: journal for the research of writing, books, and cultural heritage institutions, 9(2), 81-94. https://doi.org/10.15291/libellarium.v9i2.253

Vancouver

Frandsen TF, Nicolaisen J. Statistical analyses of digital collections: Using a large corpus of systematic reviews to study non-citations. Libellarium: journal for the research of writing, books, and cultural heritage institutions. 2017;9(2):81-94. https://doi.org/10.15291/libellarium.v9i2.253

Author

Frandsen, Tove Faber ; Nicolaisen, Jeppe. / Statistical analyses of digital collections: Using a large corpus of systematic reviews to study non-citations. I: Libellarium: journal for the research of writing, books, and cultural heritage institutions. 2017 ; Bind 9, Nr. 2. s. 81-94.

Bibtex

@article{1e982a982d284af7b1af2c7238d4bf5a,
title = "Statistical analyses of digital collections: Using a large corpus of systematic reviews to study non-citations",
abstract = "Using statistical methods to analyse digital material for patterns makes it possible to detect patterns in big data that we would otherwise not be able to detect. This paper seeks to exemplify this fact by statistically analysing a large corpus of references in systematic reviews. The aim of the analysis is to study the phenomenon of non-citation: Situations where just one (or some) document(s) are cited from a pool of otherwise equally citable documents. The study is based on more than 120,000 cited studies, and a total number of non-cited studies of more than 1.6 million. The number of cited studies is found to be much smaller than the number of non-cited. Also, the cited and non-cited studies are found to differ in age. Very recent studies tend to be non-cited whereas the cited studies are rarely of recent age (e.g. within the same year). The greatest differences are found within the first 10 years. After 10 years the cited and non-cited studies tend to be more similar in terms of age. Separating the data set into different sub-disciplines reveals that the sub-disciplines vary in terms of age of cited vs. non-cited references. Some fields may be expanding and the number of published studies is thus growing. Consequently, cited and non-cited studies tend to be younger. Other fields may be more slowly progressing fields that use a greater proportion of the older literature within the field. These field differences manifest themselves in the average age of references.",
author = "Frandsen, {Tove Faber} and Jeppe Nicolaisen",
year = "2017",
doi = "10.15291/libellarium.v9i2.253",
language = "English",
volume = "9",
pages = "81--94",
journal = "Libellarium: journal for the research of writing, books, and cultural heritage institutions",
issn = "1846-8527",
publisher = "University of Zadar",
number = "2",

}

RIS

TY - JOUR

T1 - Statistical analyses of digital collections: Using a large corpus of systematic reviews to study non-citations

AU - Frandsen, Tove Faber

AU - Nicolaisen, Jeppe

PY - 2017

Y1 - 2017

N2 - Using statistical methods to analyse digital material for patterns makes it possible to detect patterns in big data that we would otherwise not be able to detect. This paper seeks to exemplify this fact by statistically analysing a large corpus of references in systematic reviews. The aim of the analysis is to study the phenomenon of non-citation: Situations where just one (or some) document(s) are cited from a pool of otherwise equally citable documents. The study is based on more than 120,000 cited studies, and a total number of non-cited studies of more than 1.6 million. The number of cited studies is found to be much smaller than the number of non-cited. Also, the cited and non-cited studies are found to differ in age. Very recent studies tend to be non-cited whereas the cited studies are rarely of recent age (e.g. within the same year). The greatest differences are found within the first 10 years. After 10 years the cited and non-cited studies tend to be more similar in terms of age. Separating the data set into different sub-disciplines reveals that the sub-disciplines vary in terms of age of cited vs. non-cited references. Some fields may be expanding and the number of published studies is thus growing. Consequently, cited and non-cited studies tend to be younger. Other fields may be more slowly progressing fields that use a greater proportion of the older literature within the field. These field differences manifest themselves in the average age of references.

AB - Using statistical methods to analyse digital material for patterns makes it possible to detect patterns in big data that we would otherwise not be able to detect. This paper seeks to exemplify this fact by statistically analysing a large corpus of references in systematic reviews. The aim of the analysis is to study the phenomenon of non-citation: Situations where just one (or some) document(s) are cited from a pool of otherwise equally citable documents. The study is based on more than 120,000 cited studies, and a total number of non-cited studies of more than 1.6 million. The number of cited studies is found to be much smaller than the number of non-cited. Also, the cited and non-cited studies are found to differ in age. Very recent studies tend to be non-cited whereas the cited studies are rarely of recent age (e.g. within the same year). The greatest differences are found within the first 10 years. After 10 years the cited and non-cited studies tend to be more similar in terms of age. Separating the data set into different sub-disciplines reveals that the sub-disciplines vary in terms of age of cited vs. non-cited references. Some fields may be expanding and the number of published studies is thus growing. Consequently, cited and non-cited studies tend to be younger. Other fields may be more slowly progressing fields that use a greater proportion of the older literature within the field. These field differences manifest themselves in the average age of references.

UR - http://ozk.unizd.hr/lida/proceedings/

U2 - 10.15291/libellarium.v9i2.253

DO - 10.15291/libellarium.v9i2.253

M3 - Journal article

VL - 9

SP - 81

EP - 94

JO - Libellarium: journal for the research of writing, books, and cultural heritage institutions

JF - Libellarium: journal for the research of writing, books, and cultural heritage institutions

SN - 1846-8527

IS - 2

ER -

ID: 182194470