Artificial Intelligence

Background

Artificial intelligence (AI) refers to computer systems designed to perform tasks that typically require human intelligence, such as recognizing images, understanding language, making decisions, and generating text or other content. Although the field dates to the 1950s, AI has experienced rapid growth in recent years, driven by advances in machine learning — particularly deep learning — and the availability of large-scale datasets and computing power (LeCun et al., 2015). Today, AI systems are embedded in a wide range of technologies, from medical diagnosis and language translation to recommendation systems and autonomous vehicles.

Research on AI spans many disciplines, including computer science, cognitive science, economics, sociology, law, and ethics. Researchers study not only how AI systems work technically, but also how they are adopted across different industries, how they affect employment, whether they replicate or amplify human biases, and what their long-term societal implications may be. The rapid pace of AI development has made open data particularly important: shared benchmarks, datasets, and research outputs allow independent researchers to evaluate AI systems and hold developers accountable (Jobin et al., 2019; Floridi et al., 2020).

References:

Floridi, L., Cowls, J., King, T. C., & Taddeo, M. (2020). How to design AI for social good: Seven essential factors. Science and Engineering Ethics, 26(3), 1771–1796. https://doi.org/10.1007/s11948-020-00213-5

Jobin, A., Ienca, M., & Vayena, E. (2019). The global landscape of AI ethics guidelines. Nature Machine Intelligence, 1(9), 389–399. https://doi.org/10.1038/s42256-019-0088-2

LeCun, Y., Bengio, Y., & Hinton, G. (2015). Deep learning. Nature, 521(7553), 436–444. https://doi.org/10.1038/nature14539


Data Sources

AI Research and Publication Data

1. Papers With Code https://paperswithcode.com/

2. Semantic Scholar https://www.semanticscholar.org/

3. arXiv (cs.AI and cs.LG sections) https://arxiv.org/list/cs.AI/recent https://arxiv.org/list/cs.LG/recent


AI Use, Adoption, and Industry Trends

4. Stanford HAI – AI Index Report https://aiindex.stanford.edu/report/

5. Our World in Data – Artificial Intelligence https://ourworldindata.org/artificial-intelligence

6. OECD.AI Policy Observatory https://oecd.ai/en/data

AI and Society: Employment, Bias, and Ethics

7. AI Incident Database https://incidentdatabase.ai/

8. Gender Shades / Algorithmic Bias Datasets https://www.media.mit.edu/projects/gender-shades/overview/

9. World Economic Forum – Future of Jobs Report Data https://www.weforum.org/publications/the-future-of-jobs-report-2025/

AI in Japan and Asia

10. Ministry of Economy, Trade and Industry (METI) – AI Research and Policy Datahttps://www.meti.go.jp/english/policy/mono_info_service/joho/index.html

11. National Institute of Informatics (NII) – Research Data https://www.nii.ac.jp/en/

Benchmark and Model Evaluation Data

12. Hugging Face Datasets https://huggingface.co/datasets

13. UCI Machine Learning Repository https://archive.ics.uci.edu/


Example Research Questions

To answer some of these questions, you might need to combine AI datasets with other data sources (e.g., country economic data, population figures, employment statistics, or education data).


Tips for Using AI Data

Getting Started:

Understanding the Data:

Data Quality Considerations:

Making Comparisons:

Combining Datasets: AI research often benefits from combining multiple sources:


Useful Additional Data Sources

When studying AI topics, you may also want to use:


Questions? Need Help?