This document provides a guide to downloading information about authors and academic articles from three major databases: Scopus, OpenAlex and Web of Science (WoS), using the programming software R. We start with guidelines (section 2) on how to install R and Rstudio on your computer. In section 3, we provide details of how to obtain API keys for each database. In section 4, we provide a list of codes in R that extract information from Scopus about published articles, authors and their affiliations. In section 5 the same for OpenAlex and in section 6 for WoS. In the last section we provide a comparison between the three databases with examples.
We examine the evolution of order of authorship based on seniority during 1975–2021. Results show that for small teams (≤ 5 authors), the likelihood of placing the most junior author first has been increasing since the nineties. Additionally, the likelihood of placing the most senior author in last place has also been increasing. The results are at least partially driven by digitization of bibliographic records that drastically facilitated assignment of citations to all authors. We interpret our findings as a growing trend of small author teams becoming fairer. We do not find any significant effects for larger teams suggesting different practices when team size increases. Given that team size is, slowly but steadily, increasing over the last decades, the debate over the ethical considerations around authorship practices should place significance on the number of co-authors.
https://link.springer.com/article/10.1007/s11192-024-05124-x
Reference:
Drivas, K. (2024). The evolution of order of authorship based on researchers’ age. Scientometrics, 1-19. https://doi.org/10.1007/s11192-024-05124-x
We adopt the principle of relatedness, as that modified to frame science-technology linkages, to examine whether universities file for patent applications in fields that exhibit related scientific specializations. We perform the analysis to approximately a hundred US universities by documenting their publication and patent application profiles during 2001-2015. We find a strong relationship between the university’s science base and the propensity to file for patent applications in related technological fields. More importantly, we find that patent applications related to the university’s scientific capabilities to experience a higher probability of securing the grant. A mechanism of early technology transfer is discussed and corroborated. Our findings point to the need for managers to document and monitor the university’s science base as a complementary tool to evaluate invention disclosures.
https://link.springer.com/article/10.1007/s13132-024-02402-0#citeas
Reference:
Anagnosti, A., Drivas, K. The Relation Between University Science Base and Follow-Up Patent Activity. J Knowl Econ (2024). https://doi.org/10.1007/s13132-024-02402-0
After the launch of Google Scholar older papers experienced an increase in their citations, a finding consistent with a reduction in search costs and introduction of ranking algorithms. I employ this observation to examine how recombination of science takes place in the era of online search platforms. The findings show that as papers become more discoverable, their knowledge is diffused beyond their own broad field. Results are mixed when examining knowledge diffusion within the same field. The results contribute to the ongoing debate of narrowing of science. While there might be a general reduction in recombination of knowledge across distant fields over the last decades, online search platforms are not the culprits.
https://asistdl.onlinelibrary.wiley.com/doi/10.1002/asi.24959
Reference:
Drivas, K. (2024). The role of online search platforms in scientific diffusion. Journal of the Association for Information Science and Technology, 1–24. https://doi.org/10.1002/asi.24959