Social Media Data

Social media data mining collects and analyzes unstructured information (such as posts, comments, tweets, shares, likes, mentions, and clicks) shared on networks like Facebook and Twitter. This kind of information provides a rich source of text data for academic research in a variety of creative ways:



The Springer's article "Social Media Analytics: A Survey of Techniques, Tools, and Platforms" provides an easy-to-read overview and essential foundation knowledge of social media data mining.

According to another article "Collection Social Media Data for Research", there are three major ways to collect social media data:



This guide will briefly discuss the second and third ways.

Use Web Services

Increasing number of web services are available on the market to help you collect and analyze social media data. Some services are developed with researchers as one of their primarily customers, including Volunteer Science and uMaxData. Some are developed for more general purposes but also commonly used by academia, such as ExportData and WebTunix.


Most web services can provide custom-made mining solutions to meet the specific requirements of sophisticated research projects, but they cost money. As an alternative option, the Library has subscribed to a web service plan (with specified data coverage) for the use of the HKBU community.


UMiner    INSTALLED IN DAR OF MLC


The tool has been installed in Data Analysis Room. Please book a workstation in advance. Every user can create a personalized account so that previous search queries, search results and analysis can be saved and downloaded.

Use Coding

If you wish to start with coding, you will need to know APIs, R, Python, etc. to do data collection. Please refer to the following resources to learn relevant techniques:


R    FREE

https://www.r-project.org/


Python    FREE

https://www.python.org/

Miscellaneous Tools

There are other social media research tools that can be used to collect (and/or analyze) social media data. Some tools are free to use and require little or no programming. Some tools are costly or coding intensive. The following lists provide an overview.