CORPORA & ANALYSES OF CORPORA

Corpus of Contemporary English Corpus (COCA)

English Lexicon Project Web Site

Linguistic Data Consortium (LDC)

Center for Chinese Linguistics PKU (CCL)

Chinese Lexical Database (CLD)

Hong Kong Cantonese Corpus (HKCanCor)

Mandarin Wordlikeness Project

Traditional Chinese Psycholinguistic Database

CALLHOME Linguistic Data Consortium

语料库在线

Yao, Z., Wu, J., Zhang, Y., & Wang, Z. (2017). Norms of valence, arousal, concreteness, familiarity, imageability, and context availability for 1,100 Chinese words. Behavior research methods, 49(4), 1374-1385. 

Cai, Q., & Brysbaert, M. (2010). SUBTLEX-CH: Chinese word and character frequencies based on film subtitles. PloS one, 5(6), e10729.

Leung, M. T., & Law, S. P. (2001). HKCAC: the Hong Kong Cantonese adult language corpus. International journal of corpus linguistics, 6(2), 305-325.

Tsang, Y. K., Huang, J., Lui, M., Xue, M., Chan, Y. W. F., Wang, S., & Chen, H. C. (2018). MELD-SCH: A megastudy of lexical decision in simplified Chinese. Behavior research methods, 50(5), 1763-1777.

Wang, R., Huang, S., Zhou, Y., & Cai, Z. G. (2019). Chinese character handwriting: A large-scale behavioral study and a database. Behavior research methods, 1-15.

FOR CHILD STUDIES

Child Language Data Exchange System (CHILDES)

Chinese Early Language Acquisition (CELA)

Hong Kong Cantonese Child Language Corpus (CANCORP)

BILEX: A new tool measuring bilingual children's receptive vocabulary

FOR PSEUDO-WORDS

ARC Nonword Database

English Lexicon Project Web Site

Wuggy: A multilingual pseudoword generator

Traditional Chinese Psycholinguistic Database

METHODS FOR CHILD LANGUAGE ACQUISITION

Habit (Software for running Habituation-Switch studies)

FOR MAKING EXPERIMENTAL STIMULI

CNBC Stimulus Repository

DATA ANALYSES & VISUALIZATION

Which graph for what?

Stats & R

DataNovia

RESEARCH METHODS IN GENERAL

Research Methods Knowledge Base

RESOURCES FOR DOING ONLINE STUDIES

A summary from ICIS [PDF]

Ibex Farm (self-paced reading tasks, etc.)

jsPsych (many templates for psycholinguistic tasks)

Cognition.run (for running jsPsych experiments)

Millisecond (for running psycholinguistics experiments)

NATURAL LANGUAGE PROCESSING

NLTK (Python-based NLP toolkit)