library(bibliometrix)
biblioshiny()
Reference: Caputo and Kargina, 2021
Part 1: the entry type
@book{ ... }
Part 2: the citekey
@book{MyUniqueCitekey, ... }
Part 3: a list of key-value pairs storing the bibliographic data
@book{MyUniqueCitekey,
title = "Title of the book", ... }
Note: BibTex files are case insensitive
"Over the years, Scopus has earned its equal place as a comprehensive bibliographic data source and it has proven itself to be reliable and, in some respects, even better than WoS " Pranckute, 2021
Export each dataset from each in ".bib" format
WOS: select export >> Other File Formats >> Record Content: "Full Record and Cited References" >> File Format: BibTex , 500 records at a time,
Scopus: select All Results >> select BibTex Export. I had to download as excel to get the works cited list which was not available in BibTex format. Changing to the old Scopus view did allow this but it (see email from Michael Baker, Librarian UniSA, "Issue with exporting documents as BibTeX [240605-012500]") BUT it did not allow me to download the entire search list.
Load Biblioshiny and run with this text:
library(bibliometrix)
biblioshiny()
RStudio Version: 2024.04.1+748 | Released: 2024-05-11 (downloaded 4/6/2024)
Repeat this for both Scopus and WoS data: >> In Biblioshiny: Select Import raw file(s) >> Import the .bib file(s) >> select Save as Excel
Rename exported files with WOS.xlsx and Scopus.xlsx
Note: It appears that BiblioShiny only retains the first author of each cited reference (therefore, co-authors are not included)
Merge the two datasets manually from Excel
Use Excel to open both WOS.xlsx and Scopus.xlsx files: Arrange windows in horizontal above/below arrangement >> select one file as master file (WoS) >> Both files will have the same tag fields in row 1, but the information will be arranged on different columns. (some less relevant information may not be present in one or the other, eg. URL is not present in WoS)
Simplify one of the files to be used as the master: >> remove coloumns not required in the analysis >> rearrange the colomns in the second xlsx file to match the master file
Copy one file to the other manually
Remove duplicates using "Remove Duplicates" function under the data menu. The duplicate criteria set to author, year, and title .
See these steps for data cleaning
Use conditional formating
Then use Macros
Scopus (.bib), WoS (.bib), Goggle Scholar (excel via Publish or Perish), IEEE Explore (.bib), ACM (.bib), Science Direct (.bib), Pubmed (.txt).
Keywords need to be separated by ';' (instead of ','). Rename tags according to guidelines provided by Biblioshiny. -- See FAQ -- : "Bibliometrix/biblioshiny need a bibtex file formatted exactly as exported by WoS or Scopus and with the full set of mandatory metadata (authors' name, affiliations, references, etc.). If not, it will not work." "WoS metadata is characterized by a higher quality than Scopus, and, if it is possible, I prefer to use WoS. On the other hand, Scopus is a better choice when your analysis regards Arts & Humanities. In this case, Scopus DB is extremely richer than WoS."
Import data, one by one, into Biblioshiny, then export to Excel
Open a new document
Click on "data" and "Get data" and then "Launch Power Query Editor".
On the top right, click on "New Source" >> file >> Excel Workbooks >> Open the first file.
Import all remaining files similarly.
Select one of the files, >> click Append Queries >> Appnd Queries as new,
In the new window, select "Three or more tables and add all the sheets to the "Tables to Append" window (Hold ctrl to select more than one sheet while adding) >> click OK.
This will append all the sheets together and give them a name such as "Append". Click on "Close and Load", which will load the data from all sheets to a new sheet.
Save the data in .xlsx format.
Remove unwanted articles (using Conditional Formatting and Macro)
Next:
Remove any spaces in the merged file in the Author "AU" column
Replace empty values for "SR" and "SR full" columns as Nil for GS Data.
PY: don't need to sort by PY to remove old articles
Remove duplicates: "Title TI AND author AU AND year PY" : "Remove Duplicates"
Check for duplicate removals: Home tab >> Conditional Formatting >> Highlight Cell rules >> Duplicate values used. (This will highlight duplicates not deleted by the merge above).
Save the cleaned data in .xlsx format.
Download bibliographic data from each database. This may need to be carried out in batches (eg. by each year in Scopus which limits 2000 papers at one time)
For *.bib-files, import into Texmaker to detect duplicates.
BibTeX (*.bib-files) for BiblioShiny
*.ris-files for Endnote
NB: Need to create a new blank EndNote library. Do not sync that library, as it will merge with all the same libraries under the same email account. Make sure to disable the sync feature before creating a new EndNote library. Open EndNote > Edit > Preferences > Sync > untick Sync Automatically.
Scopus
Data format: .bib
WoS
Data format: .bib
Google Scholar: possible exclusion due to incompleteness of extracted data.
Use "Publish or Perish" to extract data to Excel, then formated to be used for Biblioshiny
Publish or Perish allows a maximum of 1000 downloads at one time
Data will not likely contain the necessary information for citation, co-citation, and most popular author analysis
IEEE Explore
Data format: .bib
ACM
Data format: .bib
Only 50 papers can be displayed and downloaded at one time, and will therefore require merging after download
Science Direct
Data format: .bib
Pubmed
Data format: .txt
Final Format: .xls
Year; Author Name
Subject Area
Document Type
Publication Stage
Source Title
Keyword
Affiliation
Funding Sponsor
Country
Source Type (Journal, book, conference paper, etc)
Language
Not all of these fields will be available from each source however here is a range of possible fields
Authors - use full names to facilitate disambiguation; Last Name/Surname, First Name/Forename; Middle Name.
Title of source
Publication
Year
Subject Headings
Aims/purpose
Abstract
Methods
Methodology
Date data collected
Definition(s) of presence
Geographical location
Institution
Academic Discipline (include sub disciplines: eg. Health -- nursing -- mental health)
Humanities (Performing arts; Visual arts; History; Languages; Literature; Law; Philosophy)
Social sciences (Anthropology; Archaeology; Economics; Geography; Linguistics; Political Science; Psychology; Sociology)
Natural sciences (Biology; Chemistry; Earth science; Astronomy; Physics)
Formal science (Computer science; Mathematics)
Applied science (Agriculture; Architecture; Design; Business; Engineering and technology; Environmental studies and forestry; Family and consumer science; Human physical performance and recreation; Journalism; Media studies; Communication; Law; Library studies; Medicine; Health science; Military sciences; Public administration; Public policy; Social work; Transportation)
Type of evidence source
Primary research
Peer-reviewed
Epidemiology for population-level datasets
Evidence syntheses: narrative reviews, systematic reviews, scoping reviews, rapid reviews, bibliometric reviews.
Conference abstracts
Discussion articles
Editorials
Theses
Presence measurement method(s)
Subjective:
Questionnaire; Which questionnaire was used
Interview
Physiological
Heart Rate
Skin Conductance
Eye-gaze
EEG
Behavioural
Posture
Gestures
Locomotion
Task performance
Other measured parameters
Engagement
Learning
Task performance
Quantitative/Qualitative
Virtual Reality technology specifications
Head Mounted Display: tethered or stand alone
PC configuration
2D user interface Virtual Reality
Type of Virtual Reality Experience
Simulation
Social
Interactive
Video: 360º, 180º, 3D
Game
Results/Key findings
Impact of sense of presence on measured outcomes