Open citations: A letter from the scientometric community to scholarly publishers

December 5th, 2017

Openness is central to the research endeavor. It is essential to promote reproducibility and appraisal of research, reduce misconduct, and ensure equitable access to and participation in science. Yet, calls for increased openness in science are often met with initial resistance. The introduction of pre-print servers, open access repositories, and open data sets were, for example, initially resisted, but eventually adopted without adverse effects to the scholarly ecosystem. The launch of the Initiative for Open Citations (I4OC) is facing similar obstacles. This initiative has campaigned for scholarly publishers to make openly available the references found in articles from their journals. Many publishers, including most of the large ones, support the initiative and have opened their references. However, the initiative still lacks support from a minority of the large publishers.

Calls for enhanced reproducibility have been heard across all fields of science. However, scientometrics is often unable to meet these standards, largely because of the dependency of bibliometric research upon proprietary data sources. The ability to undertake large-scale and generalizable bibliometric research, both basic and applied, is limited to a few well-funded centers that can afford to pay for full access to the raw data of Web of Science or Scopus. The remaining bulk of bibliometric research is restricted to the analysis of small data sets or the use of freely available data sources such as PubMed, Google Scholar, and Microsoft Academic. Although these freely available data sources are valuable, they suffer from shortcomings, such as incomplete coverage, data quality problems, lack of transparency, or limited large-scale accessibility. In order to conduct rigorous analyses, scientometricians need a data source that is freely available and comprehensive. This is a matter of scientific integrity, scientific progress, and equity—we must ensure that all members of the scientometric community are able to participate in and validate the research in the field. I4OC is striving to create such an opportunity.

I4OC requests that all scholarly publishers make references openly available by providing access to the reference lists they submit to Crossref. At present, most of the large publishers—including the American Physical Society, Cambridge University Press, PLOS, SAGE, Springer Nature, and Wiley—have opened their reference lists. As a result, half of the references deposited in Crossref are now freely available. We urge all publishers who have not yet opened their reference lists to do so now. This includes the American Chemical Society, Elsevier, IEEE, and Wolters Kluwer Health. By far the largest number of closed references can be found in journals published by Elsevier: of the approximately half a billion closed references stored in Crossref, 65% are from Elsevier journals. Opening these references would place the proportion of open references at nearly 83%.

Open availability of citation data is important not only for the scientometric community, but also for science at large. Scientometrics is widely used to support science policy and research evaluation, with consequences for the entire scientific community. There is a need for specialized organizations, both commercial and non-commercial, that offer scientometric services. However, in order to guarantee full transparency and reproducibility of scientometric analyses, these analyses need to be based on open data sources. Analyses based on proprietary data sources have limited transparency and tend to be difficult to reproduce. Yet, as long as half of all references are missing in open data sources such as Crossref, analyses based on these data sources do not offer a viable alternative. With such a large proportion of missing references, these data sources provide an incomplete portrait of the scholarly landscape, which can lead to negative effects, such as policies that ignore certain areas of research or certain countries.

Scientometricians have a professional obligation to promote sound practices in our field. In the current environment, advocating for open references is critical to ensure replicable and equitable research practices. We should use our relationships with journals—as authors, reviewers, and editorial board members—to advocate for openness and should expect scientometric journals to be leaders in this respect. References are a product of scholarly work and represent the backbone of science—demonstrating the origin and advancement of knowledge—and provide essential information for studying science and making decisions about the future of research. References are generated by the academic community and should be freely available to this community. We therefore issue a strong call to all publishers to make available to the academic community that which it created in the first place. To those publishers that have not already responded to the Initiative for Open Citations, our plea is: open citations now!

Original signatories

Cassidy R. Sugimoto
Indiana University Bloomington; President of the International Society for Scientometrics and Informetrics

Ludo Waltman
CWTS, Leiden University; Editor-in-Chief of Journal of Informetrics

Vincent Larivière
Université de Montréal; Observatoire des Sciences et des Technologies; Associate Editor of Journal of Informetrics

Nees Jan van Eck
CWTS, Leiden University

Kevin W. Boyack
SciTech Strategies

Paul Wouters
CWTS, Leiden University

Sarah de Rijcke
CWTS, Leiden University

