Bilingualism and Psycholinguistics Research Laboratory

Leap-questionnaire.

Language Experience and Proficiency Questionnaire (LEAP-Q) [doc] , by Marian, Blumenfeld, & Kaushanskaya (2007), published in the Journal of Speech Language and Hearing Research, 50 (4), 940-967 . [pdf]

The LEAP-Q Language Experience and proficiency Questionnaire: Ten years later. [pdf] , by Marian, Blumenfeld, & Kaushanskaya (2020), published in Bilingualism: Language and Cognition, 23 (5), 945-950 .

LEAP-Q SUPPLEMENT , explaining how to export data from the filled out questionnaire.  [pdf]

LEAP-Q SUPPLEMENT , for versions of Microsoft Word prior to 2013 (PC) or 2011 (Mac). [pdf]

LEAP-Q with  non-identifying codes  instead of participant’s name, for use with up to five languages.  [doc]

LEAP-Q electronic version for use with trilinguals .  [doc]

A Gorilla version of the LEAP-Q is now publicly available:

  • Gorilla version of the LEAP-Q (adapted by Anne Neveu, University of California San Diego)

A Qualtrics version of the LEAP-Q is now available for online use as researchers move to online data collection during the COVID pandemic:

  • LEAP-Q Qualtrics Survey [qsf] (adapted by Sayuri Hayakawa, Northwestern University)
  • How to Import Qualtrics Survey   [pdf]

THE LEAP-QUESTIONNAIRE IN OTHER LANGUAGES AND DIALECTS:

Dialect Experience and Proficiency Questionnaire (DEAP-Q)   [doc] (adapted by Kaniya Hester, Ryan Wagner, Sayuri Hayakawa, and Viorica Marian, Northwestern University)

Arabic (العربية)   [doc] (translated in collaboration with University of Maryland’s Center for Advanced Study of Language, CETRA Language Solutions, and Faiza Sultan, Translation4all, Inc.)

Catalan   [doc] (translated by Eloi Puig Mayenco and Susagna Tubau Muntañá, Universitat Autònoma de Barcelona)

  • Czech [doc] with explanatory notes and the name and date-of-birth fields replaced with a participant ID field (adapted by Barbora Keřková, Monika Večeřová & Karolína Knížková from the Czech National Institute of Mental Health)
  • Dutch for Belgium [doc]  (adapted by Freya De Keyser, Ghent University, and Marilyn Hall, Northwestern University)
  • Dutch for Netherlands three-language version   [doc] (translated by Lisa Vandeberg, Erasmus University, adapted by Sarah von Grebmer zu Wolfsthurn, Niels Schiller, and Leticia Pablos Robles, Leiden University)
  • English pencil-and-paper [pdf] (adapted by Marilyn Hall, Northwestern University)
  • English pencil-and-paper [pdf]  with non-identifying codes instead of name (adapted by Marilyn Hall, Northwestern University)
  • English with first name, birth date, and date of immigration omitted   [doc]  (adapted by Marilyn Hall, Northwestern University).
  • English for Australia   [doc]  (adapted by Agens Au, James Cook University, & Marilyn Hall, Northwestern University)
  • English for Australia pencil-and-paper   [pdf]  (adapted by Agens Au, James Cook University, & Marilyn Hall, Northwestern University)
  • English for Ireland [doc] (adapted by Síle Maguire Conneely and John Walsh, Ollscoil na Gaillimhe, University of Galway)
  • English for Ireland, 2022 version, includes non-binary gender option and social media/internet contexts [doc] (adapted by Síle Maguire Conneely and John Walsh, Ollscoil na Gaillimhe, University of Galway)
  • English for Singapore   [doc]  (five language version, adapted by Marilyn Hall, Northwestern University)
  • English for Switzerland   [doc] (adapted by Mehdi Purmohammad, Universität Bern, & Max R. Freeman, Northwestern University)
  • Canadian English   [doc] (translated by Zvaigne, Salem, Groleau, & Milette, McGill University)
  • Canadian English  [google doc] (adapted for online use via google forms by Marcel Farres Franch, McGill University)
  • Canadian English pencil-and-paper  [pdf]  (adapted by Marilyn Hall, Northwestern University)
  • LEAP-Q electronic version adapted for collecting child data from parents [doc]
  • Child LEAP-Q in English, paper-and-pencil version   [pdf]
  • Abbreviated version of the Child LEAP-Q in English [doc] (modified by Malgorzata Gosia Bielecka, Barry University)

Farsi (فارسی)   [doc]  (translated by Mahrokh Alamzadeh, Ali Ghanaie, & Shahla Sharifi, Mashhad Ferdowsi University)

Filipino/Tagalog [doc] (translated by Maria Khristina Manueli, Theologische Hochschule Reutlingen)

Finnish/Suomalainen  [doc] (translated by Ari Huta and Mika Läheenmäki, University of Jyväskylä)

  • French, non-Canadian   [doc]  (prepared by Anjali Bhatara, Emilie Michaud, and Judith Gervain, Universite Paris Descartes and CNRS)
  • French five-language version [doc] (adapted by Marilyn Hall, Northwestern University)
  • French pencil-and-paper [pdf]  (adapted by Marilyn Hall, Northwestern University)
  • French for Belgium [doc] (adapted by Arnaud Szmalec, Université Catholique de Louvain)
  • French Canadian (Français Canadien) [doc] (translated by Zvaigne, Salem, Groleau, & Milette, McGill University)
  • French Canadian pencil-and-paper [pdf] (adapted by Marilyn Hall, Northwestern University)
  • French Canadian pencil-and-paper [pdf] with different cultural examples (adapted by Marilyn Hall, Northwestern University)
  • French for Belgium for collecting child data from parents [doc] (adapted by Chloé Parmentier, Institut de recherche en sciences psychologiques)

Gaelic (Gàidhlig)   [doc] (translated by Euan Dickson and Maria Garraffa, Heriot-Watt University)

  • German three language version  [doc]  (translated by Larissa Weigel and Monica Gonzalez-Marquez, Cornell University, adapted by Sarah von Grebmer zu Wolfsthurn, Niels Schiller, and Leticia Pablos Robles, Leiden University)
  • German five language version  [doc]  (translated by Larissa Weigel and Monica Gonzalez-Marquez, Cornell University, adapted by Marilyn Hall, Northwestern University)
  • German Adolescent pencil-and-paper  [pdf]  (adapted by Carmen Vidal Noguera, Universidad Antonio de Nebrija)
  • German Children pencil-and-paper  [pdf]  (adapted by Helen Engemann, University of Mannheim)

Greek   [doc] (translated by Marielena Soilemezidi, National and Kapodistrian University of Athens & University of Oslo, and Valantis Fyndanis, Cyprus University of Technology & University of Oslo)

Hebrew (עִבְרִית)   [doc] (translated by Anat Prior, University of Haifa)

Hindi   [doc] (translated and adapted by Apoorva Pauranik and Bhavesh Kant Chaudha, Pauranik Academy of Medical Education and M.G.M. Medical College, Indore, India)

Hungarian ( Magyar) [doc] (translated by Timea Kutasi, University of Edinburgh, UK, & Zsuzsanna Maté)

  • Italian three language version   [doc] (translated by Luca Bevacqua and Roberta Spelorzi, University of Edinburgh, adapted by Sarah von Grebmer zu Wolfsthurn, Niels Schiller, and Leticia Pablos Robles, Leiden University)
  • Italian five language version   [doc] (translated by Luca Bevacqua and Roberta Spelorzi, University of Edinburgh, adapted by Sarah von Grebmer zu Wolfsthurn, Niels Schiller, and Leticia Pablos Robles, Leiden University)
  • Italian Child pencil-and-paper   [pdf] (translated and adapted by Anna Dürbeck and Helen Engemann, University of Mannheim)
  • Irish 2022 version, includes non-binary gender option and social media/internet contexts  [doc] (translated and adapted by Síle Maguire Conneely and John Walsh, Ollscoil na Gaillimhe, University of Galway)

Japanese (日本語)   [doc] (translated by Koji Miwa and Yoshino Okuma, University of Alberta, & Yu Ikemoto, Kwansei Gakuin University)

Kannada [doc] (translated by Srirangam Vijayakumar Narasimhan, Kashyap Sahana, Govinda Yashaswini, and Dhanashekar Divyashree, JSS Institute of Speech and Hearing). For the JSLHR article describing the cross-cultural adaptation and validation of the LEAP-Q into Kannada, click here .

Korean (한국말) [doc] (translated by Ju Young Min and Luca Onnis, University of Hawaii)

Latvian (Latviešu ) [doc] (translated by Toms Voits and Līva Kukle, University of Latvia)  

Malay (Melayu) [doc] (translated by Tze Peng Wong, University of Nottingham Malaysia, and Rosyati M. Yaakub)

Malayalam [doc] (translated by Haripriya T, Hiba Sherin, and Abhishek B P, All India Institute of Speech and Hearing, India)

  • Mandarin pencil-and-paper [pdf] (adapted by Marilyn Hall and Xin Wang, Northwestern University)
  • Taiwanese Mandarin [doc] (adapted by Yuru Shen and Yawei Cheng, National Yang Ming Chiao Tung University, Taiwan)

Nórth Sami [doc] (translated by Lisa Baa, Emma Boniface Juvik, Franziska Köder, and Maria Garraffa, Sámi University of Applied Sciences, Heriot-Watt University Edinburgh, Center for Multilingualism in Society across the Lifespan – University of Oslo)

Norwegian (Bokmål)   [doc] (translated by Emma Boniface Juvik, Maria Garraffa, and Franziska Köder, Heriot-Watt University Edinburgh, Center for Multilingualism in Society across the Lifespan – University of Oslo)

Polish ( Polski) [doc] (translated by Zuzanna Fuchs, University of Iowa and Ewa Brzezińska)

Portuguese (Português )   [doc] (translated by Ana Paula Scholl and Ana Beatriz Areas da Luz Fontes, Federal University of Rio Grande do Sul)

Romanian ( România)   [doc] (translated by Ioana Tufar and Rebeca Ciupe, Babes-Bolyai University)

  • Russian pencil-and-paper [pdf] (translated by Marina Belkina, Katy Borodkin, Olga Iukalo, and Mira Goral, Lehman College, City University of New York, adapted by Marilyn Hall, Northwestern University)
  • SoSci version in English and Serbian  [XML] (translated by Dušica Filipović Đurđević, Sara Anđelić, and Ksenija Mišić, University of Belgrade)

Slovenian   [doc] (translated by Julija Kos, adapted by Christina Manouilidou & Katarina Marjanovič, back-translated by Monica Kavalir, University of Ljubljana)

  • Spanish in United States (Español en Estados Unidos)   [doc] (translated by Rojas & Iglesias, Temple University)
  • Spanish in United States   [doc]  with first name, birth date, and date of immigration omitted (adapted by Cherri Brown, Walden University)
  • Spanish in United States pencil-and-paper [pdf] (adapted by Marilyn Hall, Northwestern University)
  • Spanish for Mexico   [doc]  (adapted by Alma Luz Rodríguez Lázaro, Natalia Arias-Trejo, and Alina Signoret Dorcasberro, Universidad Nacional Autónoma de México)
  • Spanish for Mexico   [doc]   with first name, birth date, and date of immigration omitted (adapted by Alma Luz Rodríguez Lázaro, Natalia Arias-Trejo, and Alina Signoret Dorcasberro, Universidad Nacional Autónoma de México; and by Cherri Brown, Walden University)
  • Peninsular Spanish   [doc]  (adapted by Eloi Puig Mayenco, Universitat Autònoma de Barcelona, and Jason Rothman and Jorge González Alonso, University of Reading & the Arctic University of Norway)
  • Peninsular Spanish Adolescent pencil-and-paper   [pdf]  (adapted by Carmen Vidal Noguera, Universidad Antonio de Nebrija)
  • Spanish Child pencil-and-paper [doc] (translated and adapted by Genesis Arizmendi, University of Arizona)
  • Swedish with non-identifying codes [doc] (translated by Claudia Sjöström and Anna Hjortdal, Lund University)
  • Thai pencil-and-paper [pdf] (translated and adapted by Sirada Rochanavibhata, Northwestern University)
  • Thai Child pencil-and-paper [pdf] (translated and adapted by Sirada Rochanavibhata, Northwestern University)
  • Turkish pencil-and-paper [pdf] (corrected translation – adapted by İlmiye Seçer & Tutku Öztel, Koç University, Fuat Balcı, University of Manitoba, and Aslı Aktan Erciyes, Kadir Has University)

Urdu (اردو)   [doc] (translated by Insiya Bhalloo and Ismat Fatemah Somani, Rehabilitation Sciences Institute, University of Toronto, Child and Adolescent Development Programme [CADP], Hussaini Foundation, Karachi, Pakistan)

  • Vietnamese five language version   [doc]  (translated by Hien Pham, University of Alberta, Canada, & Quyen Nguyen, Vietnam National University, Hanoi)
  • Welsh Child pencil-and-paper [doc] (translated by Rebecca Ward and Sarah Gray, Swansea University)

The LEAP-Questionnaire is now available in 40 languages. Is t he language you need not listed? Email   [email protected] .

U.S. flag

An official website of the United States government

The .gov means it’s official. Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

The site is secure. The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

  • Publications
  • Account settings
  • My Bibliography
  • Collections
  • Citation manager

Save citation to file

Email citation, add to collections.

  • Create a new collection
  • Add to an existing collection

Add to My Bibliography

Your saved search, create a file for external citation management software, your rss feed.

  • Search in PubMed
  • Search in NLM Catalog
  • Add to Search

The Language Experience and Proficiency Questionnaire (LEAP-Q): Ten years later

Affiliations.

  • 1 University of Wisconsin-Madison.
  • 2 San Diego State University.
  • 3 Northwestern University.
  • PMID: 33628083
  • PMCID: PMC7899192
  • DOI: 10.1017/s1366728919000038

The Language Experience and Proficiency Questionnaire (LEAP-Q) is a validated questionnaire tool for collecting self-reported proficiency and experience data from bilingual and multilingual speakers ages 14 to 80. It is available in over 20 languages, and can be administered in a digital, paper-and-pencil, and oral interview format. The LEAP-Q is used by researchers across various disciplines (Psychology, Neuroscience, Linguistics, Education, Communication Sciences & Disorders, etc.) to provide a comprehensive description of their bilingual participants, to substantiate a division of bilinguals into groups (e.g., early vs. late bilinguals), and to screen participants for adequate or threshold levels of language proficiency. Best practices for using the LEAP-Q include administration of the full questionnaire, consideration of acquisition and history of language use together with self-ratings of proficiency, and supplementation of self-reported data with objective language measures whenever possible. The LEAP-Q can be downloaded at no cost at https://bilingualism.northwestern.edu/leapq/.

Keywords: LEAP-Q; questionnaire; self-report; validation.

PubMed Disclaimer

The Language Experience and Proficiency…

The Language Experience and Proficiency questionnaire (LEAP-Q) is a self-report measure that can…

Similar articles

  • The Language Experience and Proficiency Questionnaire (LEAP-Q): assessing language profiles in bilinguals and multilinguals. Marian V, Blumenfeld HK, Kaushanskaya M. Marian V, et al. J Speech Lang Hear Res. 2007 Aug;50(4):940-67. doi: 10.1044/1092-4388(2007/067). J Speech Lang Hear Res. 2007. PMID: 17675598
  • Cross-Cultural Adaptation and Validation of the Language Experience and Proficiency Questionnaire Into Kannada. Narasimhan SV, Sahana K, Yashaswini G, Divyashree D. Narasimhan SV, et al. J Speech Lang Hear Res. 2023 Aug 3;66(8):2688-2700. doi: 10.1044/2023_JSLHR-22-00668. Epub 2023 Jul 12. J Speech Lang Hear Res. 2023. PMID: 37437530
  • Perspectives and Experiences of Autistic Multilingual Adults: A Qualitative Analysis. Nolte K, Fletcher-Watson S, Sorace A, Stanfield A, Digard BG. Nolte K, et al. Autism Adulthood. 2021 Dec 1;3(4):310-319. doi: 10.1089/aut.2020.0067. Epub 2021 Dec 7. Autism Adulthood. 2021. PMID: 36601639 Free PMC article.
  • Not All Bilinguals Are the Same: A Call for More Detailed Assessments and Descriptions of Bilingual Experiences. de Bruin A. de Bruin A. Behav Sci (Basel). 2019 Mar 24;9(3):33. doi: 10.3390/bs9030033. Behav Sci (Basel). 2019. PMID: 30909639 Free PMC article. Review.
  • A systematic review on the role of language-related factors in the manifestation of stuttering in bilinguals. Chaudhary C, Maruthy S, Guddattu V, Krishnan G. Chaudhary C, et al. J Fluency Disord. 2021 Jun;68:105829. doi: 10.1016/j.jfludis.2021.105829. Epub 2021 Jan 23. J Fluency Disord. 2021. PMID: 33556665 Review.
  • Modeling lexical abilities of heritage language and L2 speakers of Hebrew and English in Israel and the United States: a network approach. Fridman C, Livni A, Bar On S, Meir N. Fridman C, et al. Front Psychol. 2024 Apr 24;15:1331801. doi: 10.3389/fpsyg.2024.1331801. eCollection 2024. Front Psychol. 2024. PMID: 38778883 Free PMC article.
  • Overdue Assignment: A Case Study on Academic Writing Development for Postgraduate Health Professional Trainees in Kenya. Onguka S, de Meijer F, Basnight-Brown DM. Onguka S, et al. J Med Educ Curric Dev. 2023 Oct 16;10:23821205231206220. doi: 10.1177/23821205231206220. eCollection 2023 Jan-Dec. J Med Educ Curric Dev. 2023. PMID: 38025028 Free PMC article.
  • LexArabic: A receptive vocabulary size test to estimate Arabic proficiency. Alzahrani A. Alzahrani A. Behav Res Methods. 2023 Nov 14. doi: 10.3758/s13428-023-02286-z. Online ahead of print. Behav Res Methods. 2023. PMID: 37964176
  • Associations of multilingualism and language proficiency with cognitive functioning: epidemiological evidence from the SwissDEM study in community dwelling older adults and long-term care residents. Pacifico D, Sabatini S, Fiordelli M, Annoni AM, Frei A, Puhan M, Graf G, Albanese E. Pacifico D, et al. BMC Geriatr. 2023 Oct 6;23(1):629. doi: 10.1186/s12877-023-04311-4. BMC Geriatr. 2023. PMID: 37803260 Free PMC article.
  • What is the next structure? Guessing enhances L2 syntactic learning in a syntactic priming task. Alzahrani A. Alzahrani A. Front Psychol. 2023 Jun 29;14:1188344. doi: 10.3389/fpsyg.2023.1188344. eCollection 2023. Front Psychol. 2023. PMID: 37457077 Free PMC article.
  • Anderson JA, Mak L, Chahi AK, & Bialystok E (2018). The language and social background questionnaire: Assessing degree of bilingualism in a diverse population. Behavior Research Methods, 50, 250–263. - PMC - PubMed
  • Ansaldo AI, Ghazi-Saidi L, & Adrover-Roig D (2015). Interference control in elderly bilinguals: Appearances can be misleading. Journal of Clinical and Experimental Neuropsychology, 37, 455–470. - PubMed
  • Athanasopoulos P, Damjanovic L, Burnand J, & Bylund E (2015). Learning to think in a second language: Effects of proficiency and length of exposure in English learners of German. The Modern Language Journal, 99(S1), 138–153.
  • Bartolotti J, & Marian V (2012). Language learning and control in monolinguals and bilinguals. Cognitive Science, 36, 1129–1147. - PMC - PubMed
  • Bedore LM, Pena ED, Summers CL, Boerger KM, Resendiz MD, Greene K, Bohman TM, & Gillam RB (2012). The measure matters: Language dominance profiles across measures in Spanish–English bilingual children. Bilingualism: Language and Cognition, 15(03), 616–629. - PMC - PubMed

Related information

Grants and funding.

  • R01 DC011750/DC/NIDCD NIH HHS/United States
  • R01 HD059858/HD/NICHD NIH HHS/United States

LinkOut - more resources

Full text sources.

  • Europe PubMed Central
  • PubMed Central
  • Citation Manager

NCBI Literature Resources

MeSH PMC Bookshelf Disclaimer

The PubMed wordmark and PubMed logo are registered trademarks of the U.S. Department of Health and Human Services (HHS). Unauthorized use of these marks is strictly prohibited.

LexCHI: A quick lexical test for estimating language proficiency in Chinese

  • Open access
  • Published: 05 July 2023
  • Volume 56 , pages 2333–2352, ( 2024 )

Cite this article

You have full access to this open access article

thesis questionnaire about language proficiency

  • Yun Wen   ORCID: orcid.org/0000-0002-9917-4988 1 , 2 ,
  • Yicheng Qiu   ORCID: orcid.org/0000-0001-7336-1542 3 ,
  • Christine Xiang Ru Leong   ORCID: orcid.org/0000-0003-1110-545X 4 &
  • Walter J. B. van Heuven   ORCID: orcid.org/0000-0003-3183-4449 3  

2385 Accesses

4 Citations

3 Altmetric

Explore all metrics

A prominent methodological issue in cognitive research on bilingualism is the lack of consistency in measuring second language (L2) proficiency. To reduce the inconsistency in L2 proficiency measurements, brief and valid vocabulary tests have been developed as an objective measure of proficiency in a variety of languages (e.g., English, French, Spanish). Here, we present LexCHI, a valid lexical test to measure Chinese proficiency. This freely available short test consists of 60 two-character items presented in simplified Chinese. Although it only takes a few minutes to complete LexCHI, the LexCHI scores in two studies correlated significantly with L2 participants' performance in a translation task and a cloze test. We believe that LexCHI is a useful tool for researchers who need to objectively measure Chinese proficiency as part of their investigations.

Similar content being viewed by others

thesis questionnaire about language proficiency

LexMAL: A quick and reliable lexical test for Malay speakers

thesis questionnaire about language proficiency

Can Lextale-Esp discriminate between groups of highly proficient Catalan–Spanish bilinguals with different language dominances?

thesis questionnaire about language proficiency

LextPT: A reliable and efficient vocabulary size test for L2 Portuguese proficiency

Avoid common mistakes on your manuscript.

Introduction

It is now common practice to quantify language proficiency of bilingual participants in experimental settings. As reflected in a review of 186 bilingual studies published between 2005 and 2015, 77% of the studies reported bilinguals' language proficiency using a subjective or/and objective assessment (Surrain & Luk, 2019 ). The rationale behind this research practice is simple: bilingual populations show substantial variation in second language (L2) proficiency, and L2 proficiency is known to affect the representations and processes engaged in bilingual language processing (for reviews, see van Hell & Tanner, 2012 ; van Hell & Tokowicz, 2010 ; van Heuven & Dijkstra, 2010 ).

Despite the need for quantifying bilinguals' proficiency levels, there is little consensus on how to adequately measure L2 proficiency in experiments with bilinguals (for further discussion, see de Bruin, 2019 ; Hulstijn, 2012 ). Lacking such consensus, divergent measures of L2 proficiency have been used. For example, the subjective self-rated proficiency, which was provided in more than half of the studies from the above-mentioned review (Surrain & Luk, 2019 ), may differ in terms of the scales to rate (e.g., a scale of 1 to 7 vs a scale of 1 to 10), the endpoint labels (e.g., a 7-point scale can label its endpoint as " perfect ", " native-like " or " high proficiency "), facets of language ability (e.g., speaking vs reading), and how the rating question is framed (e.g., " How much reading experience do you have with the English language " used in Lemhöfer & Broersma, 2012 , vs " Indicate how good you consider yourself in reading English " used in Wen et al., 2018 ). Regarding objective assessments, Hulstijn ( 2012 ) reported that more than 10 different objective proficiency measurements were used in 63 bilingual studies, including the Test of English as a Foreign Language (TOEFL), International English Language Testing System (IELTS), and Test for English Majors (TEM, used in China). The existing diversity in L2 proficiency measures not only hinders across-study comparisons but also renders difficulty in including proficiency in a meta-analytic review (e.g., Lauro & Schwartz, 2017 ; Wen & van Heuven, 2017b ), thus slowing progress in bilingual investigations in the long term.

To reduce the heterogeneity in measures of L2 proficiency, one seemingly easy solution is to rely on the commonly used questionnaires in the field, for example, the language history questionnaire (LHQ, Li et al., 2006 , 2014 , 2020 ), the language experience and proficiency questionnaire (LEAP-Q, Marian et al., 2007 ; Marian & Hayakawa, 2021 ), and the language and social background questionnaire (LSBQ, Anderson et al., 2018 ). The published questionnaires present standardised proficiency questions which require participants to rate their L2 proficiency on a Likert scale for each individual skill (e.g., listening, speaking, reading, writing). Admittedly, self-rated proficiency assessment is easy and quick to administer either online or in a paper-and-pencil version, and it can be used in various languages through translation.

Although self-rated proficiency assessment has certain practical advantages, this subjective measure is limited with respect to reliability (i.e., whether it produces consistent results in repeated measures) and validity (i.e., whether it measures what it intends to measure). As shown by Tomoschuk et al. ( 2019 ), the correlations between the self-rated English oral proficiency and a standardised picture-naming task (MINT, Multilingual Naming Test, Gollan et al., 2012 ) interacted with bilingual groups. A closer look at the significant interaction revealed that Spanish–English bilinguals had higher MINT scores than Chinese–English bilinguals despite having the same self-rated speaking proficiency. A subsequent analysis replicated this pattern with speakers of the same language pair such that recently immigrated Chinese–English bilinguals scored lower in the English MINT than bilinguals who grew up in the United States even when their self-ratings were matched. Tomoschuk et al.’s ( 2019 ) findings imply that any two bilinguals with identical self-ratings may still differ in L2 proficiency levels. Another alarming finding comes from Lemhöfer and Broersma ( 2012 ), who demonstrated that self-ratings of English proficiency obtained with Korean–English bilinguals did not correlate significantly with bilinguals' performance in a standard English test (TOEIC, Test of English for International Communication). Although self-ratings showed significant correlations with objective proficiency measures in other studies (de Bruin et al., 2017 ; Marian et al., 2007 ), correlation coefficients varied considerably (e.g., ranging from 0.286 to 0.741 in Marian et al., 2007 ). Furthermore, self-rated L2 proficiency can be modulated by non-linguistic factors (for further discussion, see Amenta et al., 2020 ; Brysbaert, 2013 ; Ferré & Brysbaert, 2017 ; Izura et al., 2014 ), such as anxiety levels when using L2 (MacIntyre et al., 1997 ).

LexTALE and lextale-type vocabulary tests

Being aware of the potential issues with self-rated proficiency, researchers have striven to develop new tools to objectively assess L2 proficiency. One well-known tool is LexTALE (Lexical Test for Advanced Learners of English) developed by Lemhöfer and Broersma ( 2012 ). LexTALE consists of 60 items (i.e., 40 real English words and 20 nonwords) presented in a fixed order to all participants. LexTALE implements an untimed lexical decision task in which participants have to decide whether an item is an English word, without any time limit. To validate LexTALE, Lemhöfer and Broersma correlated the LexTALE scores against bilinguals' performance of the Quick Placement Test ( 2001 ) and a translation task. Significant positive correlations were found between the LexTALE scores and the two well-recognised measures of English proficiency, and these findings were consistent for Dutch–English bilinguals and Korean–English bilinguals. Therefore, although LexTALE seems to only tap into participants' word knowledge, its result is a valid proxy for English proficiency. As a valid test, LexTALE takes about 3.5 minutes to complete either online or as a paper-and-pencil test.

In addition to being short and easy to use, LexTALE has several advantages compared to other commonly used proficiency tests. First, in contrast to commercial tests such as the Quick Placement Test, LexTALE is freely available to the research community (included in the published paper and also available at http://www.lextale.com ). In comparison to a non-commercial translation task, LexTALE can be used with all English learners, whereas a translation task must be adapted based on participants’ languages (e.g., translating between English and Dutch for Dutch–English bilinguals versus translating between English and Korean for Korean–English bilinguals). There is a non-commercial picture-naming task (i.e., MINT, Gollan et al., 2012 ), but no paper-and-pencil version exists at the moment that can be easily distributed like LexTALE. Because of its free availability and easy administration, LexTALE has been widely used in bilingual research. In addition to describing bilinguals' English proficiency in an experiment (e.g., Van de Putte et al., 2018 ), LexTALE has been used as a screening test to select eligible participants (e.g., Declerck et al., 2020 ), and as an independent variable to investigate participants' performance in a linguistic task (e.g., Diependaele et al., 2013 ) or a non-linguistic task (e.g., Khare et al., 2013 ). Consequently, it is advisable to include LexTALE in any research protocol that involves non-native speakers of English, so that readers can compare the proficiency level of participant groups across articles (Brysbaert et al., 2017 ; Diependaele et al., 2013 ).

The idea of using LexTALE as the standard in the field clearly converges with the need for improving consistency in L2 proficiency measurements. But this unified approach requires that parallel lexical tests exist for various languages, as L2 differs among bilinguals. Along with the German and Dutch versions of LexTALE provided in Lemhöfer and Broersma ( 2012 ), researchers have extended the lextale format to seven other languages, including French (LEXTALE_FR, Brysbaert, 2013 ), Spanish (Lextale-Esp, Izura et al., 2014 ), Basque (Basque LexTALE, de Bruin et al., 2017 ), Italian (LexITA, Amenta et al., 2020 ), Portuguese (LextPT, Zhou & Li, 2022 ), Finnish (Lexize, Salmela et al., 2021 ), and logographical Chinese (LEXTALE_CH, Chan & Chang, 2018 ). When creating these extensions, researchers had to carefully sample word items based on word frequency to ensure that the extensions match the material used in LexTALE. It is important to note that lextale extensions cannot be developed by simply translating the English word items of LexTALE into another language because word frequencies of translation equivalents do not perfectly correlate (Wen & van Heuven, 2017a ). Instead, researchers typically create a lexical test for another language by testing native and non-native speakers with a larger set of items (e.g., 60 words and 60 nonwords in Brysbaert, 2013 ). When the final set of items are selected (e.g., 56 words and 28 nonwords in Brysbaert, 2013 ), the test is often administered to a new group of native and non-native speakers in a validation study. Disregarding the different numbers of items included (e.g., 84 items in the French extension vs 90 items in the Spanish extension), the ratio of words versus nonwords remains constant (i.e., 2:1) for all the equivalent lexical tests as well as the original LexTALE.

A difference between the original LexTALE tests developed by Lemhöfer and Broersma ( 2012 ) and the subsequent extensions is that authors did not try to equate the difficulty level of the tests across languages (as Lemhöfer and Broersma did for English, German, and Dutch), because there are no agreed standards in developing difficulty-matched tests for various languages (for relevant discussion, see Gollan et al., 2012 ). In addition, authors often wanted to develop a test that could be used for first language (L1) speakers as well as L2 speakers. Therefore, items were selected so that the best discrimination was possible between the L1 and L2 samples tested, thereby optimising assessment within a language rather than optimising assessment for comparison across languages. Furthermore, the lextale extensions were normally not compared to other well-established measures of proficiency via correlations because validity was considered a given. Therefore, it is important to make a distinction between the original LexTALE tests developed by Lemhöfer and Broersma ( 2012 ) for English, German, and Dutch advanced L2 speakers, and the subsequent lextale-type tests developed by others for other languages.

Altogether, lextale-type vocabulary tests in different languages provide an objective estimate of word knowledge. They take less than 5 minutes to complete, and can be used freely. This makes them ideal to combine with other tasks in experimental studies. They allow for direct comparisons of participants within a language, but comparisons between languages should be carefully applied (except for English, German, and Dutch).

Developing LexCHI

Building on LexTALE and its extensions, the current study presents the development of LexCHI as a lexical test for Chinese using simplified Chinese characters. Simplified Chinese characters are the standardised written form of Chinese in mainland China even though there are various dialects spoken by Chinese speakers (see Gu, 2006 , for more information about Chinese dialects). As mentioned above, a lextale-type test has already been created for Chinese using simplified characters (LEXTALE_CH, Chan & Chang, 2018 ), but this test only includes single characters as items. Testing only single characters to measure Chinese lexical knowledge is a limitation, because most Chinese words contain more than one character. To illustrate this, we analysed 99,121 unique Chinese words from the Chinese subtitle corpus (SUBTLEX-CH, Cai & Brysbaert, 2010 ; corpus size: 33.5 million words). This revealed that only 5.4% of the words consist of one character, whereas 46.2% of the words have two characters and 24.7% have three characters (12.0% are four-character words and 11.4% are multiple-character words). It is also noteworthy that not all Chinese characters are free morphemes (DeFrancis, 1984 ; Myers, 2006 ), so a Chinese character does not always correspond to a word at the lexical level. To create a more appropriate lextale-type test for Chinese, we focused on two-character items for LexCHI, as our analyses showed the majority of Chinese words contain two characters (cf. Li et al., 2015 ).

In line with lextale-type tests developed for other languages (e.g., Amenta et al., 2020 ; Izura et al., 2014 ; Zhou & Li, 2022 ), the present study develops LexCHI in two studies by testing L1 Chinese speakers as well as L2 Chinese speakers. A preparatory study (Experiment 1 ) was first conducted to select good items from a larger set of candidates, followed by a validation study (Experiment 2 ) to test the selected items with a new group of participants. In addition, both studies validated LexCHI akin to LexTALE, such that LexCHI scores would be correlated against a brief version of a standard Chinese proficiency test (i.e., a 20-item cloze test) and a translation task. If LexCHI is a valid test, we would expect significant positive correlations between LexCHI scores and the other two measures of Chinese proficiency. We also tested whether LexCHI is a better lexical test than LEXTALE_CH (Chan & Chang, 2018 ), as we expected, becauset single-character words are not representative for the full set of Chinese words. To this end, the LEXTALE_CH character test was included in order to compare its correlations with the cloze test and the translation task relative to LexCHI. If LexCHI is indeed a better measure, we would expect higher correlations of LexCHI scores with the other two measurements in comparison with LEXTALE_CH. Including the LEXTALE_CH character test also enables us to explore the relationship between character knowledge and word knowledge. We predicted a significant positive correlation between LEXTALE_CH and LexCHI, because character knowledge is part of word knowledge in Chinese.

Introducing the normalised Ghent score

In addition to introducing LexCHI, we further propose a new method for scoring the test. It is important to note that LexTALE and its extensions use different equations to correct for the unequal number of word and nonword items presented (e.g., 40 words and 20 nonwords). When developing LexTALE, Lemhöfer and Broersma ( 2012 ) recommended calculating the test score as follows:

Although this calculation showed the strongest correlations with the Quick Placement Test and the translation task, Brysbaert ( 2013 ) pointed out that possible scores range between 50% and 100% rather than between 0 and 100% (e.g., if participants respond Yes to all items, this will result in a score of 50%). Brysbaert further suggested computing the Ghent score for the French extension of LexTALE (see Eq. 1 ).

As indicated in Eq. 1 , the Ghent score adjusts Lemhöfer and Broersma's calculation by taking into account the incorrect trials in nonwords instead of the correct ones. Brysbaert's Ghent score has been adopted by follow-up studies, except for the Basque LexTALE. Given that the ratio of words versus nonwords is fixed in all lextale extensions (2:1), the Ghent test score calculation can be simplified to Eq. 2 .

Unfortunately, this approach leads to differences in the score range (e.g., ranging from −56 to 56 in the French extension vs −60 to 60 in the Spanish extension), because the Ghent score range, unlike the original LexTALE score, depends on the number of word and nonword items, which differs across tests (e.g., the number of items is 75, 84 and 90 in the Basque, French and the Spanish tests, respectively). Given the difference in score range, the scoring equation in Lemhöfer and Broersma ( 2012 ) is more appealing for studies which use more than one lexical test to measure proficiency in multiple languages at the same time (e.g., de Bruin et al., 2017 ). Moreover, Lemhöfer and Broersma ( 2012 ) recommend a cut-off score that can separate the participants into advanced and intermediate proficiency levels (i.e., C1/C2 vs B2 in the Common European Framework of Reference for Languages), and such cut-off point is not available when using the Ghent score. Therefore, Lemhöfer and Broersma's calculation is also applied when a LexTALE extension is used as a diagnostic tool to filter out participants (e.g., Wen et al., 2021 ). In view of the pros and cons of the existing equations, we propose the normalised Ghent score, which divides the Ghent score by the number of word items (see Eq. 3 ). Footnote 1

The normalised Ghent score has a fixed range of −100% to 100% independent of item numbers, while retaining the advantages of Brysbaert's approach. Furthermore, the normalised Ghent score expressed as percentages is easy to interpret, just like the original LexTALE scoring. Therefore, the normalised Ghent score will be used in the present study.

Experiment 1: Preparatory study

The primary goal of Experiment 1 is to test a set of 120 items and select a subset of 60 items for LexCHI so that it contains the same number of word and nonword items as the original LexTALE (Lemhöfer & Broersma, 2012 ). To assess the initial validity, LexCHI scores are correlated with the results of a cloze test and a translation task. We expected significant positive correlations between LexCHI and the cloze test as well as the translation task. Following Lemhöfer and Broersma ( 2012 ), we mainly focused on bilinguals who are non-native speakers of Chinese in the correlation analyses, but we also recruited native Chinese speakers for the purpose of item selection (see “ Data analysis ” for details). Therefore, only non-native Chinese speakers completed the cloze test and the translation task. Additionally, the LEXTALE_CH character test (Chan & Chang, 2018 ) was included for two reasons. First, it enables a comparison between LexCHI and LEXTALE_CH. For the non-native Chinese speakers, we expected higher correlations for LexCHI scores against the cloze test and the translation task relative to LEXTALE_CH. Second, it enables exploring the relationship between character knowledge and word knowledge. We expected a significant positive correlation between LEXTALE_CH and LexCHI scores. In line with the original LexTALE study (Lemhöfer & Broersma, 2012 ), a language background questionnaire was included in the experiment to gather detailed information of participants' linguistic profiles.

Participants

Two groups of participants were recruited for Experiment 1 . The first participant group was an L2 group which consisted of 75 non-native speakers of Chinese (female: 54, male: 19, prefer not to say: 2). The L2 participants were recruited at the University of Nottingham Malaysia campus because Malaysia has a considerable number of Chinese learners. They were 24.24 years old on average (range = 18–53, SD = 6.15), and indicated their first language as English ( N  = 60), Malay ( N  = 9), Thai ( N  = 3), Indonesian ( N  = 1), Tamil ( N  = 1), or Vietnamese ( N  = 1). The L2 group received an inconvenience allowance of 10 Malaysian ringgit. Data from seven additional L2 participants were excluded from the analyses due to zero accuracy in the translation task ( N  = 3) or self-reported language impairment ( N  = 4). The second participant group was the L1 group, which consisted of 54 native Chinese speakers (female: 31, male: 23). The L1 participants were recruited online via Prolific ( www.prolific.co ). They received an inconvenience allowance of £2. L1 participants were 32.2 years old (range = 20–57, SD = 7.62) on average, and they all indicated their first language as Mandarin ( N  = 49) or a Chinese dialect ( N  = 5), e.g., Cantonese. All L1 participants were speakers of Mandarin Chinese (henceforth Chinese). Data from six additional participants were excluded from the analyses due to low self-rated Chinese reading proficiency (< 5 on a 7-point scale, N  = 3) or because they took an excessively long time to finish the task (> 25 minutes, N  = 3, possibly due to consulting external tools such as a dictionary). Other language background information of the two participant groups will be presented in the Results section.

Procedure and materials

The experiment consisted of a series of tasks presented in a fixed order to participants in Qualtrics. Participants in the L2 group received written instructions in English and completed five tasks, whereas participants in the L1 group received written instructions in Chinese and completed three tasks (Task 1, 2, 5). All participants provided informed consent at the beginning of the study. The study was approved by the Science and Engineering Research Ethics Committee at the University of Nottingham Malaysia Campus (L2 group) in Malaysia and the Ethics Committee at the School of Psychology, University of Nottingham, UK (L1 group).

Task 1: LEXTALE_CH

Chan and Chang ( 2018 ) developed a character-based Chinese proficiency test and made it freely available at https://osf.io/qdy4n/ . This LEXTALE_CH test consists of 90 items including 30 non-characters and 60 real characters written in simplified Chinese. Among the real characters, 18 have zero occurrences in SUBTLEX-CH (Cai & Brysbaert, 2010 ), and eight have a frequency less than 1 per million characters. Of the remaining characters, 22 have a frequency ranging from 1 to 10 per million, eight have a frequency between 10 and 100 per million, and four have a frequency higher than 100 per million. The average number of strokes in the 60 characters is 11.5 (range = 4–25).

The 90 items were presented one at a time in a fixed order to all participants. Participants were instructed to decide whether a presented item was a real Chinese character or not by pressing the Yes or No button on the screen. They were informed that they did not need to respond rapidly, and they should not consult a dictionary. The mean duration of this task was 3.34 minutes for the L2 group (SD = 2.16 minutes) and 2.31 minutes for the L1 group (SD = 1.42 minutes).

The normalised Ghent scores were calculated for Task 1 using Eq. 3 (60 real characters and 30 noncharacters).

Task 2: LexCHI (120 items)

Similar to LEXTALE_FR (Brysbaert, 2013 ), 120 items (60 words and 60 nonwords) were selected from a megastudy of simplified Chinese using a lexical decision task (Tsang et al., 2018 ). All items consisted of two characters with error rates lower than 10% in the megastudy. Following Brysbaert ( 2013 ), we selected the same number of word items in each word frequency range. Because a megastudy of simplified Chinese demonstrated that character frequency impacts the recognition of two-character Chinese words (Sun et al., 2018 ), it is crucial to match character frequency within a word with its word frequency as closely as possible. Therefore, we selected 17 words with a word frequency less than 1 per million (character frequency of each character: < 1 per million), 11 words with a word frequency between 1 and 5 per million (character frequency of each character: 1–5 per million), 16 words with a word frequency between 5 and 10 per million (character frequency of each character: 3–25 per million), nine words with a word frequency between 10 and 20 per million (character frequency of each character: 7–22 per million), 30 words with word frequency between 30 and 100 per million (character frequency of each character: 30–100 per million), and one word with word frequency over 100 per million (character frequency of each character > 100 per million). In order to make nonwords equally difficult, the 60 nonword items (i.e., non-existing sequences of two existing characters) were also selected from the same megastudy by matching character frequency and stroke number with the selected word items. The selected nonwords include two items with a character frequency (for both characters within an item) less than 2 per million, 19 items with a character frequency less than 10 per million, 23 items with a character frequency less than 20 million per million, nine items with a character frequency between 4 and 19 per million, six items with a character frequency between 30 and 100 per million, and one item with a character frequency above 100 per million. There were no repeated characters in words and nonwords. The average stroke number of words and nonwords is 23.18 (SD = 4.87) and 23.47 (SD = 4.47) respectively. It is important to note that the average word frequency of the selected Chinese words is also comparable to that of the English words used in LexTALE (Lemhöfer & Broersma, 2012 ). Using Zipf values as the standardised word frequency measure (van Heuven et al., 2014 ), the average word frequency is 3.55 (SD = 0.81) for the selected Chinese words and 3.10 (SD = 0.60) for English words in LexTALE based on the subtitles (Cai & Brysbaert, 2010 ; van Heuven et al., 2014 ). The full list of stimuli is provided in Appendix 1 .

The 120 items were presented one at a time in an identical pseudorandom order with words or nonwords occurring no more than five times in a row (Lemhöfer & Broersma, 2012 ). Participants were instructed to decide whether a presented item was a real Chinese word or not by pressing the Yes or No button on the screen. They were informed that they did not need to respond rapidly, and that they should not consult a dictionary. Similar to Lemhöfer and Broersma ( 2012 ), participants were also instructed to press the No button if they were not sure whether an item was a word. The mean duration of this task was 5.25 minutes for the L2 group (5.73 minutes) and 2.34 minutes for the L1 group (SD = 0.74 minutes).

The normalised Ghent scores were calculated for Task 2 (using the equation in Footnote 1 with 60 words and 60 nonwords).

Task 3: Cloze test

The cloze test consisted of 20 items taken from Hanyu Shuiping Kaoshi (Chinese Proficiency Test). Hanyu Shuiping Kaoshi (HSK) is an official Chinese language proficiency test for non-native Chinese speakers administered by the Confucius Institute Headquarters. HSK provides tests at six levels (i.e., Level 1 to Level 6) for beginning, intermediate and advanced learners. Since a full HSK test takes too long to complete within empirical studies, abridged versions of HSK are often used to measure Chinese proficiency (Li et al., 2019 ; Zhang et al., 2020 ). In the reading test of HSK Level 6, there are ten multiple-choice items which require participants to choose one set of correct words from four possible options to complete sentences provided. Such fill-in-the-blank items have been used as cloze tests to measure second language proficiency (Cromheecke & Brysbaert, 2022 ; Oller, 1973 ; Tremblay, 2011 ). The cloze test in our study was not generated by using the sample exam papers provided on the HSK official website because these free resources are easily accessible to participants. Therefore, 20 items were selected from two books which published previous exam papers for HSK Level 6 (Confucius Institute Headquarters, 2016 , 2018 ). The 20 items are composed of six items with three blanks, 12 items with four blanks and two items with five blanks, which mirrors the composition commonly observed in a 10-item set within one HSK test (i.e., including 3 three-blank items, 6 four-blank items and 1 five-blank item). All words presented in the choices consist of two characters. For the words within the correct answers, the average stroke number is 16.18 (SD = 4.79) and the average word frequency is 4.03 (SD = 0.71) in Zipf values (Cai & Brysbaert, 2010 ; van Heuven et al., 2014 ).

Twenty multiple-choice items were presented one by one, and the presentation order was fixed. Participants were instructed to select the correct words that fit in the sentences by clicking on one of the four choices. They could take as long as necessary to make their choice. On average, this task took the L2 group ( N  = 75) 16.00 minutes to complete (SD = 11.44 minutes).

Task 4: Translation task

The translation task consisted of 30 English words and 30 Chinese words selected from an English-Chinese translation database (Wen & van Heuven, 2017a ). We first selected 60 English words using the following criteria similar to Lemhöfer and Broersma ( 2012 ): (1) the dominant part-of-speech of English words should be a noun; (2) the Chinese–English pairs are non-cognate translations; (3) the English words had no more than three correct Chinese translations (mean = 1.82 , SD = 0.81) and their translation error rates were higher than 50% (mean = 64.94%, SD = 7.51%). Because the translation database of Wen and van Heuven only includes Chinese translations of English words, the most frequent Chinese translations for half of the selected English words were used as the Chinese items to be translated into English. These 30 Chinese items consist of two characters. Their mean number of strokes is 15.57 (SD = 5.24), and their mean word frequency is 3.82 (SD = 0.64) in Zipf values (Cai & Brysbaert, 2010 ). None of the 30 Chinese items appeared in Task 2. The remaining 30 English words were used as the English items to be translated into Chinese. These English items had a mean word length of 6.33 letters (SD = 1.83) and a mean word frequency of 3.74 (SD = 0.41) in Zipf values (van Heuven et al., 2014 ). The Chinese and English words used in the translation task are provided in Appendix 2 .

Participants first completed the English-to-Chinese translation task and then the Chinese-to-English translation task. English or Chinese items were presented one at a time, and all items were presented in a fixed order. Participants were required to provide the first Chinese/English translation that came to their mind (Tokowicz & Kroll, 2007 ; Wen & van Heuven, 2017a ; Wu & Thierry, 2010 ). They could take as long as necessary to type in their answers, and they were asked to skip an item by pressing the Next button on the screen if they could not provide a translation. On average, this task took the L2 group ( N  = 75) 13.14 minutes to complete (SD = 7.92 minutes).

The translations provided by the participants were first automatically compared to the correct translation included in the English-Chinese translation database (Wen & van Heuven, 2017a ). All other translations were manually checked using the Oxford Advanced Learner's English-Chinese Dictionary (Hornby, 2018 ). Responses with typos or spelling mistakes were scored as incorrect.

Task 5: Language background questionnaire

The aim of the language background questionnaire is to understand participants' experience with Chinese language (e.g., age of first contact, years of experiences, the language used by parents or carers during childhood, the main instruction language used by teachers from kindergarten to university) and their self-perceived Chinese proficiency. The questions were adapted from the questionnaire used in prior work to obtain participants' linguistic profile (e.g., Wen et al., 2018 ; Wen & van Heuven, 2017a , 2018 ). For example, participants were asked to rate their ability of Chinese speaking, listening, reading and writing ability separately on a 7-point scale (1 = very poor, 7 = native-like) by receiving the following question, i.e., "Indicate how good you consider yourself in listening, speaking, reading, and writing in Chinese (Mandarin)".

Data analysis

A series of correlation analyses were conducted separately for the L1 and L2 groups. Spearman's rank correlation coefficients were calculated when involving self-rating data, and Pearson correlation coefficients were calculated when rating data were not involved.

Following recent studies (e.g., Amenta et al., 2020 ; Brysbaert, 2013 ; Izura et al., 2014 ), a two-step analysis was conducted to evaluate the items. In the first step, the point-biserial correlation was calculated. In the second step, the item response theory analysis was conducted using the ltm package (Rizopoulos, 2006 ) in R version 4.1.0 (R Core Team, 2021 ). In both steps, data of the L1 and L2 group were combined in the analyses, but the analyses were conducted separately for word and nonword items. Thus, a point-biserial correlation was first calculated between the participants' accuracy of one word/nonword item and their overall accuracy across all word/nonword items. Like the Pearson correlation coefficient, the point-biserial correlation coefficient also ranges between −1 and 1. For a word item, a positive correlation indicates that participants who correctly identify the word are likely to obtain higher scores for the word items. Likewise, a positive correlation for a nonword item suggests that participants who correctly reject the nonword are likely to have higher scores. Thus, for both word and nonword items, positive correlations are the first criterion for good items, and items with negative correlations should be deleted before running the item response theory analysis. In the second step, the item response theory analysis was conducted in which a latent variable modelling was run separately for words and nonwords (see Şahin & Anil, 2017 , for discussion about sample sizes in the IRT analysis). Each model produced two values for all items, i.e., difficulty and discriminative power. Following previous studies (Amenta et al., 2020 ; Chan & Chang, 2018 ; Izura et al., 2014 ), the selected items should vary in terms of difficulty and have a good discriminative power, so the selection was based on both difficulty and discriminative power parameters. Therefore, word and nonword items were separately ordered based on the difficulty parameter and were grouped into 20 groups (three items per group). Within each group, two words or one nonword with highest discriminative parameter were then selected given that we aimed to select 40 word items and 20 nonword items for the final set of LexCHI.

The results of each experimental task are shown in Table 1 , and details of participants' language background obtained from the language background questionnaire (Task 5) are summarised in Table 2 .

Correlations

Table 3 shows the correlations of the normalised Ghent scores of LEXTALE_CH (Task 1) and LexCHI (Task 2, 120 items) against the accuracy rates of the cloze test (Task 3), the translation task (Task 4), and self-ratings of Chinese ability for the L2 group. As Table 3 shows, LexCHI and LEXTALE_CH significantly correlated with all the other measures, and both tests had the highest correlation with the cloze test (LexCHI: r  = 0.81, p < .001, the LEXTALE_CH: r  = 0.68, p < .001). When the correlations of LexCHI with the cloze test and self-ratings were compared to those of LEXTALE_CH, LexCHI consistently outperformed LEXTALE_CH. In addition, LEXTALE_CH and LexCHI positively correlated ( r  = 0.72, p < .001).

Table 4 shows the correlations of the normalised Ghent scores of the LEXTALE_CH (Task 1) and LexCHI (Task 2, 120 items) against all the self-ratings of Chinese ability for the L1 group. As can been seen in Table 4 , LexCHI significantly correlated with all the self-ratings of Chinese ability whereas LEXTALE_CH only significantly correlated with the average self-rating across four skills and the self-rated writing ability. Consistent with the patterns observed in the L2 group, higher positive correlations were found between LexCHI and the self-ratings than between the character task and the self-ratings. For the L1 group, LEXTALE_CH and LexCHI also significantly correlated ( r  = 0.62, p < .001).

Item selection for LexCHI

For the point-biserial correlation analyses, all the word and nonword items showed positive correlations (range: 0.29–0.75 for words, 0.30–0.71 for nonwords). Therefore, all 60 words and 60 nonwords were included in the item response theory analysis. Figure 1 illustrates the results of three-word items in the item response theory analysis. In Fig. 1 , the difficulty (ability) parameter is represented on the x -axis, and each item's difficulty value is the x value when its curve reaches 0.5 on the y -axis (the dotted line). Thus, the word 亵渎 ( profanity , difficulty = 0.32) is more difficult than the word 慷慨 ( generous , difficulty = −0.73) and 愤怒 ( anger , difficulty = −1.61). On the other hand, the discriminative parameter is represented by the steepness of the curves. Thus, the word 慷慨 ( generous , discriminative = 5.14) has better discriminative power than the word 亵渎 ( profanity , discriminative = 2.71) and 愤怒 ( anger , discriminative = 2.76). Based on the results of the item response theory analysis (see “ Data analysis ” for the detailed selection procedure), 40 words and 20 nonwords were selected for the final version of LexCHI (see Appendix 3 for the full list of items). The lexical characteristics of selected items are summarised in Table 5 .

figure 1

Item response curves for three Chinese word: 亵渎 (profanity), 愤怒 (anger) and 慷慨 (generous)

In Experiment 1 , we set out to test 120 two-character Chinese items and select 60 items with varying difficult levels and good discriminative power for LexCHI. In addition to the 120-item LexCHI, we also administrated a cloze test and a translation task to a group of non-native Chinese speakers (L2 group). The L2 group's performance in the cloze test and the translation task was correlated against LexCHI as a means of accessing its initial validity. As expected, significant positive correlations of LexCHI against the cloze test and the translation task were found, providing evidence for the LexCHI score as a valid index of Chinese proficiency. Also, the correlation of the cloze test was higher for LexCHI than LEXTALE_CH, indicating that LexCHI is a better lexical test for estimating Chinese proficiency. For both L1 and L2 groups, LexCHI significantly correlated with all the self-ratings of Chinese proficiency, with higher correlations in the L2 group. This pattern is in line with previous studies which reported that correlations between objective and subjective measures of proficiency were higher in the weaker language than in the stronger language of bilinguals (Gollan et al., 2012 ; Marian et al., 2007 ; Sheng et al., 2014 ). As predicted, LexCHI and LEXTALE_CH positively correlated in both groups. Importantly, the LexCHI data of the L1 and L2 groups were combined in a two-step analysis to select 60 good items out of the 120-item set. Based on the point-biserial correlation and the item response theory analysis, 40 word items and 20 nonword items were selected.

Like other lextale extensions (e.g., Amenta et al., 2020 ; Izura et al., 2014 ; Zhou & Li, 2022 ), the final set of items in LexCHI would be evaluated with a new group of native and non-native speakers of Chinese in Experiment 2 . Experiment 2 also includes the LEXTALE_CH character test, a cloze test and a translation task. Note that five participants removed from the L1 group in Experiment 1 indicated English or Spanish as their first language even though demographic filters in Prolific were applied to target L1 Chinese speakers only. These five participants could not be included as L2 participants because they did not take the cloze test and the translation task. Therefore, the same set of tasks are administered to both L1 and L2 groups in Experiment 2 so that participants could be assigned to a different group if needed.

Experiment 2: Validation study

Experiment 2  was used to further evaluate validity and reliability of the final set of 60 items in LexCHI. This evaluation was conducted with new groups of native and non-native Chinese speakers. Similar to Experiment 1 , Experiment 2 included LEXTALE_CH (Chan & Chang, 2018 ), a cloze test, a translation task and a language background questionnaire. Because most non-native Chinese speakers in Experiment 1 indicated English as their first language, we decided to recruit non-native Chinese speakers with English as their first language for the L2 group in Experiment 2 . LexTALE (Lemhöfer & Broersma, 2012 ) was also included to obtain an objective English proficiency measure. L2 participants should have high scores in the English vocabulary test (LexTALE) because their L1 is English even though they might have low scores in the tasks involving Chinese (e.g., LexCHI). In line with Experiment 1 , we predicted significant correlations of LexCHI scores with the cloze test and the translation task in the L2 group. Furthermore, a higher positive correlation for LexCHI scores than the LEXTALE_CH scores is expected when using the cloze test and the translation task as the reference. Finally, we also expected a significant positive correlation between the LEXTALE_CH and LexCHI.

A group of non-native Chinese speakers (L2 group) and a group of native Chinese speakers (L1 group) were recruited online via Prolific (seven participants swapped between groups). Footnote 2 All participants received an inconvenience allowance of £6. The L2 group consisted of 59 non-native Chinese speakers (female: 32, male: 27; age: mean = 26.58 years, range = 18–44, SD = 6.07). The L2 participants indicated either English ( N  = 44) or Chinese ( N =15) as the first language. Although 15 participants of the L2 group indicated Chinese (Mandarin or a Chinese dialect, e.g., Shanghainese, Cantonese, Hakka) as the first language which was used by their parents/carers during childhood, most of these participants received all their education in English ( N  = 12; for one participant, Mandarin was used as the main language for instructions only during secondary education; for two participants, Mandarin or Cantonese was used the main language for instructions only in kindergarten). These participants did not speak other languages in addition to English and Chinese (Mandarin or a Chinese dialect). Their self-ratings of English proficiency for four skills (listening, speaking, reading, writing) were all 7 (i.e., native-like), whereas their average self-rated Chinese proficiency (the mean of four skills) was 3.20 (range = 1–5.75) on a 7-point scale. Footnote 3 Therefore, these 15 participants should be considered as non-native Chinese speakers. Data from 18 additional participants were excluded from the analyses due to bad performance in the English vocabulary test (LexTALE scores < 80, see Footnote 2, N  = 6), zero accuracy in the translation task ( N  = 3), using traditional Chinese characters in the translation task ( N  = 7) or self-reported language impairment ( N  = 2). The L1 group consisted of 46 native Chinese speakers (female: 23, male: 23; age: mean = 30.80 years, range = 19–60, SD = 9.15). All L1 participants indicated Mandarin ( N  = 43) or a Chinese dialect ( N  = 3) as their first language and were speakers of Mandarin Chinese. Data from four additional participants were excluded from the analyses due to low self-rated Chinese reading proficiency (< 5 on a 7-point scale, N  = 1), Footnote 4 zero accuracy in the translation task ( N  = 1) or using traditional Chinese characters in the translation task ( N  = 1). Additional language background information of the two participant groups will be presented in the Results section.

Procedure, materials and scoring

The experiment consisted of six tasks presented in a fixed order to participants in Qualtrics. Written instructions were provided in English for both groups. All participants gave informed consent at the beginning of the study. The study was approved by the Ethics Committee of the School of Psychology at the University of Nottingham. The overall procedure was identical to Experiment 1 (the L2 group) except for two aspects. First, there were 60 items in Task 2 (LexCHI) instead of 120 items. On average, this task now took 2.06 minutes to complete (SD = 2.49 minutes). Second, before the final language background questionnaire, an English vocabulary test (LexTALE, Lemhöfer & Broersma, 2012 ) was included as Task 5. This task consisted of 40 real English words and 20 nonwords which were presented one at a time in a fixed order to all participants. Participants were instructed to decide whether a letter string was an English word or not by pressing the Yes or No button on the screen. They were informed that they did not need to respond rapidly, and they should not consult a dictionary. On average, this task took 1.97 minutes to complete (SD = 1.69 minutes).

The normalised Ghent scores were calculated for Task 1, Task 2 and Task 5 using Eq. 3 .

A series of correlation analyses were conducted separately for the L1 and L2 groups. Pearson correlation coefficients were calculated except that Spearman's rank correlation coefficients were computed for data involving self-ratings. To evaluate the final set of 60 items for LexCHI, the reliability of 60 items was measured with Cronbach's alpha and the split-half correlation with the help of the ltm package (Rizopoulos, 2006 ) and the performance package (Lüdecke et al., 2021 ) in R version 4.1.0 (R Core Team, 2021 ). Data of the L1 and L2 groups were combined in the reliability analysis.

The results of each experimental task are shown in Table 6 , and details of participants' language background obtained from the language background questionnaire (Task 6) are summarised in Table 7 . As can be seen from Table 6 , the L1 group outperformed the L2 group in all tasks involving Chinese (Task 1–4). Critically, for the 60-item LexCHI (Task 2), the normalised Ghent scores of the L2 group were significantly lower than those of the L1 group (Wilcoxon rank-sum test, W = 2644, p < .001, effect size d  = −3.23).

Table 8 shows the correlations of the normalised Ghent scores of LEXTALE_CH (Task 1) and LexCHI (Task 2, 60 items) against the accuracy rates of the cloze test (Task 3), the translation task (Task 4), and self-ratings of Chinese ability.

For the L2 group, Table 8 shows that both LEXTALE_CH and LexCHI correlated significantly with the cloze test and the translation task. Compared with LEXTALE_CH, LexCHI revealed slightly higher correlations with the cloze test, the translation task and all the self-ratings of Chinese proficiency except for the self-rated reading proficiency. Both LEXTALE_CH and LexCHI had the highest correlation with the translation task. Additionally, LEXTALE_CH and LexCHI were positively correlated ( r  = 0.60, p < .001, see Fig. 2 ).

figure 2

LexCHI by LEXTALE_CH in Experiment 2 with a linear regression line which is shaded by the 95% confidence intervals (normalised Ghent scores; Left: L1 group; Right: L2 group)

Results of the L1 group revealed that LEXTALE_CH and LexCHI significantly correlated with the cloze test, but not the translation task. As can be seen in Table 8 , there are also significant correlations of LEXTALE_CH with the average self-rating (across four skills) and the self-rated writing ability. Assuming that the translation task should be correlated with participants' L2 (English) proficiency, we also checked the relationship between the translation task and LexTALE, which indeed revealed a significant correlation, r  = 0.37, p =  .011. LEXTALE_CH and LexCHI also positively correlated ( r  = 0.50, p < .001, see Fig. 2 ).

Reliability (LexCHI)

The Cronbach's alpha for the 60-item LexCHI was 0.96, which indicates a high reliability of the test. The high Cronbach's alpha obtained for LexCHI was very similar to that of other versions of LexTALE, e.g., 0.96 for the Italian version (Amenta et al., 2020 ), 0.96 for the French version (Brysbaert, 2013 ), 0.95 for the character-based Chinese version (Chan & Chang, 2018 ). In line with Cronbach's alpha, the split-half correlation was very high ( r  = .922, the Spearman-Brown corrected correlation: r  = .959). Footnote 5

Experiment 2 evaluated the final version of LexCHI with a new group of participants. The first key finding is that LexCHI with the 60 items has a high reliability (e.g., Cronbach's alpha = 0.96). Another important finding is that LexCHI scores of L1 participants were significantly higher than those of the L2 participants (effect size d  = −3.23). As predicted, LexCHI scores of the L2 group significantly correlated with the cloze test and the translation task, thus mirroring the findings in Experiment 1 . This consistent finding clearly demonstrated the validity of LexCHI. Another finding consistent with Experiment 1 is that LexCHI and the LEXTALE_CH character test positively correlated (see Fig. 2 ). In line with our prediction for the L2 group, LexCHI correlations with the cloze test and the translation task were higher than those for LEXTALE_CH with the cloze test and translation task. This result also replicated the findings of Experiment 1 . Because LexCHI consistently outperformed LEXTALE_CH in the L2 groups, LexCHI is a better lexical test for estimating Chinese proficiency of non-native speakers.

To further explore whether LEXTALE_CH and LexCHI measure different aspects of Chinese knowledge, we conducted a post-hoc regression analysis on the combined data of L1 and L2 groups with LEXTALE_CH and LexCHI as predictors. Accuracy of the cloze test was chosen as the dependent variable in the regression analysis because both LEXTALE_CH and LexCHI had higher correlation with the cloze test than with the translation task when the data of L1 and L2 groups were combined (see Appendix 4 ). To address the issue of the collinearity between the two predictors, the scores of LEXTALE_CH were first orthogonalised by fitting a linear model in which LEXTALE_CH scores were predicted by the LexCHI scores (see Wen & van Heuven, 2017a , for a similar approach). The residuals of this model were to be used in the regression analysis as the predictor of LEXTALE_CH. In the regression analysis, the LexCHI scores were entered in the first step to predict accuracy of the cloze test, and the LEXTALE_CH scores were entered in the second step (see Appendix 4 for the exploration with the LEXTALE_CH scores entered first). In the first step, LexCHI was a significant predictor (β = 0.56203, SE = 0.03782, t  = 14.86, p < .001; R 2 =  0.682, adjusted R 2 =  0.6789). In the second step, LexCHI and LEXTALE_CH were both significant predictors (LexCHI: β = 0.56203, SE = 0.03661, t  = 15.350, p < .001; LEXTALE_CH: β = 0.30580, SE = 0.10888, t  = 2.809, p < .01; R 2 =  0.7048, adjusted R 2 =  0.699). Although the regression analysis showed that participants' scores in both LexCHI and LEXTALE_CH significantly predicted their performance of the cloze test, it is clear that LEXTALE_CH scores only accounted for a small portion of the variance in the cloze test accuracy as indicated by a less-than-3% increase of R 2 in the second step. Taken together, it is likely that LEXTALE_CH is able to provide certain complementary information of participants' Chinese knowledge on top of LexCHI. However, further research is needed to find out what kind of additional knowledge can be measured by LEXTALE_CH. We recommend using both tests when researchers need to have a fine-grained description of participants' Chinese knowledge and using LexCHI alone when a valid proficiency measure of Chinese is sufficient.

General discussion

The present study was designed to develop LexCHI, a Chinese extension of LexTALE (Lemhöfer & Broersma, 2012 ). In two experiments, we evaluated LexCHI and demonstrated that this lexical test is a good indicator of Chinese proficiency. As a valid lexical test, LexCHI will be a useful instrument for researchers interested in Chinese processing by second language Chinese speakers.

When studying Chinese processing in non-native speakers (e.g., Chang et al., 2015 ; Chen et al., 2018 ; Li et al., 2019 ; Pelzl et al., 2021 ), researchers need to measure Chinese proficiency, because it is common practice to report language proficiency in non-native speakers (see Zhang, 2018 , for a review of research on Chinese learning which revealed that most studies did not report learners’ Chinese proficiency). Because participants in a study often have not taken a standard Chinese proficiency test (e.g., HSK), measures of language proficiency need to be gathered during an experiment. Even if it is possible to recruit participants who have taken a standard Chinese proficiency test, their test scores could be several months or years old, and therefore such scores do not represent their current proficiency because language proficiency is likely to change over time. When a freely available Chinese test is absent, researchers opt to either create an abridged version of a commercial proficiency test (e.g., HSK) which unavoidably varies across research groups and cannot be publicly shared as restricted by copyright, or simply rely on self-rated proficiency. Although self-rated proficiency is certainly better than no measures at all, subjective assessment is clearly inferior to objective assessment as discussed in the Introduction. The existing LEXTALE_CH character test (Chan & Chang, 2018 ) is not a widely used measure of Chinese proficiency, which may be attributed to the concern that it only measures character knowledge. Instead, LexCHI, like LexTALE and its extensions, is a short vocabulary test to measure language proficiency. The use of standardised lexical tests such as LexCHI will minimise discrepancies in proficiency measurements in future studies involving bilingual populations.

A lexical test such as LexCHI that assesses vocabulary knowledge provides, unfortunately, an assessment of only one dimension within the multidimensional construct of language proficiency. However, assuming vocabulary knowledge as a snapshot of proximate proficiency not only converges with researchers' intuition but also is supported by empirical evidence. In particular, a meta-analysis of 126 studies by Zhang and Zhang ( 2020 ) revealed that vocabulary tests correlated well with L2 speakers' performance in reading comprehension ( r  = .57, p < .01) and listening comprehension ( r  = .56, p < .01). Nevertheless, one may still argue that lexical knowledge expressed as performance in a vocabulary test is not a perfect index of proficiency. Unfortunately, there is currently no existing gold standard test for measuring proficiency because no single test can capture all aspects of language proficiency. What is crucial is to provide researchers with a valid and sensitive measurement that is feasible in experimental settings to estimate language proficiency. Therefore, it is highly advisable to include LexTALE and its extensions in bilingual studies to objectively measure proficiency. Meanwhile, language proficiency questions presented in questionnaires should not be totally avoided. While self-rated L2 proficiency is not the optimal tool for measuring proficiency levels, self-ratings assess participants' perceived proficiency and can provide useful supplementary information of linguistic profiles (Gollan et al., 2012 ). The issue lies in using the subjective measure as the only index of proficiency (see de Bruin, 2019 ; Luk & Bialystok, 2013 ; Prior & van Hell, 2021 , for the call for objectively measuring proficiency apart from subjective assessment).

In addition to developing LexCHI and providing the test freely for researchers, the present study contributes to the field by proposing the normalised Ghent score. As mentioned earlier, the normalised Ghent score is based on the equations introduced by Lemhöfer and Broersma ( 2012 ) and Brysbaert ( 2013 ). It is noteworthy that the normalised Ghent score is a linear equivalent of its predecessors. Therefore, correlation analyses involving the normalised Ghent score presented here do not change when using previous scoring equations. Compared with its predecessors, the normalised Ghent score takes into account the need for a fixed range of possible scores (ranging from −100% to 100%) and the participants' tendency to adopt a guessing strategy. For example, Table 9 shows that a participant providing a unique response to all items (all Yes or all No) will have a score of 0.

As mentioned in the Introduction, Lemhöfer and Broersma ( 2012 ) recommend a cut-off score that can classify participants into different proficiency levels (intermediate vs advanced). To provide a similar cut-off point for LexCHI using the normalised Ghent scores, a receiver operator characteristic (ROC) curve analysis was conducted (Lalkhen & McCluskey, 2008 ; Read et al., 2015 ). The ROC curve analysis has been used widely in clinical areas to evaluate how accurate a diagnostic test is in classifying two populations (e.g., people with or without dyslexia). The ROC curve analysis calculates the area under the curve (AUC) as a measure of discrimination power (e.g., > .80 means good discrimination, maximal value: 1), as well as the sensitivity (e.g., how accurate the test is in identifying people with dyslexia) and the specificity (e.g., how accurate the test is in distinguishing people without dyslexia) of a cut-off value. The results of our ROC curve analysis using the data of the L1 group and the L2 group in Experiment 2 are plotted in Fig. 3 . As can be seen in the left panel, the AUC value is near perfect (i.e., 0.974), which converged with the significant differences in LexCHI scores between the L1 and L2 participants in Experiment 2 . Furthermore, the cut-off score of 70% has high sensitivity (i.e., 0.957) and specificity (i.e., 0.898) values. Footnote 6 In light of these results, if a participant has a score lower than 70%, it is very likely that this participant is not a native speaker of Chinese. This cut-off score can not only be used as a threshold to identify bilinguals with a native-like proficiency of Chinese (e.g., the C2 level in the Common European Framework of Reference for Languages) but also as a filter to screen native Chinses speakers. It is necessary to have a screening test when recruiting native Chinese speakers for an online study because we found out that applying demographic filters in Prolific not always results in obtaining native speakers of a given language (see the Discussion in Experiment 1 and Footnote 2). A possible explanation is that participants might have different interpretations of the term native/first language. Moreover, participants may indicate Chinese as the first language, but they do not obtain native-like level of proficiency as they have been immersed in a non-Chinese-speaking environment from early childhood (i.e., Chinese heritage speakers). Taken together, LexCHI can used as a screening test to identify participants with native or native-like level of proficiency.

figure 3

Receiver operator characteristic (ROC) curve with the 95% confidence intervals (left) and optimal criterion cut-off plot (right) for LexCHI (normalised Ghent scores, %)

Overall, LexCHI is a valuable tool for the research community because it also opens the door to new avenues for further research. For instance, LexCHI can be used in megastudies of Chinese (e.g., Sze et al., 2015 ; Tsang et al., 2018 ; Tse et al., 2016 ) to explore individual differences in first language processing (Andrews et al., 2018 ; Andrews & Lo, 2012 ; Beyersmann et al., 2015 ; see Kidd et al., 2018 for a review). Another potential use is to employ LexCHI as part of a battery test and generate proficiency norms for native speakers (e.g., Dujardin et al., 2022 ). Such a proficiency norm could be used as an indicator for participants with reading difficulties (e.g., dyslexia). In terms of bilingual investigations, proficiency of bilinguals' both languages can be measured with LexCHI and other similar lexical tests, and the two test scores can be used as a composite measure to objectively determine the stronger (i.e., dominant) language of a bilingual (e.g., Gollan et al., 2012 ; Sheng et al., 2014 ; Tomoschuk et al., 2019 ).

In anticipation of future research making the most of LexCHI, we would like to emphasise that researchers should be cautious in choosing LexCHI over other valid Chinese proficiency tests, e.g., a Chinese C-test (Malone & Xu, 2019 ), the Chinese version of the MINT (Multilingual Naming Test, Gollan et al., 2012 ) and the LEXTALE_CH character test (Chan & Chang, 2018 ). As Hulstijn ( 2012 ) points out, the choice of objective tests should be justified in an experiment by the research purpose. For example, in the case of an experiment with a naming task, a standardised picture-naming test like the MINT (Gollan et al., 2012 ) may be a better predictor of experimental performance relative to LexCHI. Similarly, for a study on Chinese character recognition, LEXTALE_CH (Chan & Chang, 2018 ) may outperform LexCHI in accounting for differences in the experimental task. Moreover, a limitation of LexCHI is that the test involves simplified Chinese characters and not traditional Chinese characters. Given this limitation, LexCHI scores of native or non-native Chinese speakers who use traditional Chinese characters (e.g., people living in Hong Kong and Taiwan) should be interpreted with caution. In addition, like other lextale-type vocabulary tests (e.g., Brysbaert, 2013 ), LexCHI was developed for native/non-native adult speakers, and thus it is unclear whether LexCHI is suitable to measure Chinese proficiency of children. To measure Chinese proficiency among non-adult native speaker of Chinese, a recently published vocabulary test may be more appropriate since this freely available test was developed with middle/high-school students who are native speakers of Chinese (Qi et al., 2022 ). In brief, the practice of opting for a proficiency measurement that is convenient without justifications should be avoided because there is no one-size-fits-all measure of proficiency.

To summarise, the present study introduced LexCHI as a valid lexical test to measure Chinese proficiency. LexCHI consists of 40 words and 20 nonwords presented in simplified Chinese and implements an untimed lexical decision task. On average, it takes less than 3 minutes to complete LexCHI. This short test can be easily distributed as a paper-and-pencil test (items available in supplementary materials and on the Open Science Framework, and instructions in English or Chinese are also available on the Open Science Framework, https://osf.io/dh3ty/ ) or included in an online study (all items in png format can also be downloaded from the Open Science Framework). For the scoring of LexCHI, our normalised Ghent score is recommended. Because LexCHI is a Chinese extension of the widely used LexTALE, LexCHI also contributes to the endeavours towards reliably and effectively measuring proficiency in experimental settings, which aims to eliminate between-study variability in proficiency measures. As a useful tool, LexCHI can be further applied in new avenues for further research.

When the ratio of words versus nonwords is not 2:1, the normalised Ghent test score can be calculated using this equation: \(\frac{{\mathrm N}_\text{yes to words}-\frac{{\mathrm N}_\text{words}}{{\mathrm N}_\text{nonwords}}\times{\mathrm N}_\text{yes to nonwords}}{{\mathrm N}_\text{words}}.\)

We used demographic filters in Prolific to recruit participants with English (the L2 group) or Chinese (the L1 group) as their first language. To qualify for the L2 group (naive English speakers), participants were expected to obtain a minimal LexTALE score of 80% which is the recommended 80% cut-off score for English speakers with advanced proficiency (Lemhöfer and Broersma, 2012 ). As mentioned in the Introduction, the LexTALE score was calculated as follows \(\frac{\mathrm{number}\;\mathrm{of}\;\mathrm{correct}\;\mathrm{words}+2\ast\mathrm{number}\;\mathrm{of}\;\mathrm{correct}\;\mathrm{nonwords}}{80}\) . Similarly, to qualify for the L1 group, participants should obtain a minimal score of 80% for LEXTALE_CH and for LexCHI and the tests scores were computed as follows: character score = \(\frac{\mathrm{number}\;\mathrm{of}\;\mathrm{correct}\;\mathrm{characters}+2\ast\mathrm{number}\;\mathrm{of}\;\mathrm{correct}\;\mathrm{noncharacters}}{120}\) , LexCHI score = \(\frac{\mathrm{number}\;\mathrm{of}\;\mathrm{correct}\;\mathrm{words}+2\ast\mathrm{number}\;\mathrm{of}\;\mathrm{correct}\;\mathrm{nonwords}}{80}\) . Using such criteria, seven participants were swapped across groups in two steps. First, three participants recruited as native Chinese speakers were reassigned to the L2 group because their LexTALE scores were higher than 80% (mean = 96.25%, range = 93.75–100%), whereas their scores of LEXTALE_CH and LexCHI were much lower than 80% (LEXTALE_CH: mean = 54.44%, range = 54.17–63.33%; LexCHI: mean = 57.50%, range = 45–72.5%). Among these three participants, one participant indicated English as first language, and two participants indicated Chinese as first language. Second, four participants recruited as native English speakers were reassigned to the L1 group because their scores of LEXTALE_CH and LexCHI were higher than 80% (LEXTALE_CH: mean = 85.83%, range = 83.3–90.83%; LexCHI: mean = 99.06%, range = 96.25–100%), while their LexTALE scores were lower than 80% (mean = 70.31%, range = 66.25–72.5%). These four participants all indicated Chinese as first language.

Their scores of English LexTALE were very high (mean = 96.93%, range = 91.25–100%), while their scores of LEXTALE_CH and those of LexCHI were relatively low (LEXTALE_CH: mean = 55.78%, range = 42.5–84.17%; LexCHI: mean = 51.83%, range = 38.75–97.5%, computed as the equations in Footnote 2).

These two participants had low scores for LEXTALE_CH, LexCHI and LexTALE (all < 80%, computed as the equations in Footnote 2).

The Cronbach's alpha for was 0.823 for the L1 group and 0.881 for the L2 group. The split-half correlation was 0.728 (Spearman-Brown corrected: 0.842) for the L1 group and 0.764 (Spearman-Brown corrected: 0.866) for the L2 group.

The cut-off score is 85 (%) if using the original equation by Lemhöfer and Broersma ( 2012 ) instead of the normalised Ghent score. Because the normalised Ghent score is linear equivalent of the score obtained with the original equation, the two cut-off scores have the same values of the AUC, sensitivity and specificity. We also note here that native Chinese speakers typically have a LexCHI score above 90% (with low SD) when using the original equation. Similarly, LexTALE scores of 90% are typical for native English speakers. Therefore, LexTALE and LexCHI seem to have similar difficulty levels.

Amenta, S., Badan, L., & Brysbaert, M. (2020). LexITA: A quick and reliable assessment tool for Italian L2 receptive vocabulary size. Applied Linguistics , 1–24. https://doi.org/10.1093/applin/amaa020

Anderson, J. A. E., Mak, L., KeyvaniChahi, A., & Bialystok, E. (2018). The language and social background questionnaire: Assessing degree of bilingualism in a diverse population. Behavior Research Methods, 50 (1), 250–263. https://doi.org/10.3758/s13428-017-0867-9

Article   PubMed   PubMed Central   Google Scholar  

Andrews, S., & Lo, S. (2012). Not all skilled readers have cracked the code: Individual differences in masked form priming. Journal of Experimental Psychology: Learning, Memory, and Cognition, 38 (1), 152–163.  https://doi.org/10.1037/a0024953

Andrews, S., Lo, S., & Xia, V. (2018). Individual differences in automatic semantic priming. Journal of Experimental Psychology: Human Perception and Performance, 43 (5), 1025–1039. https://doi.org/10.1037/xhp0000372

Article   Google Scholar  

Beyersmann, E., Casalis, S., Ziegler, J., & Grainger, J. (2015). Language proficiency and morpho-orthographic segmentation. Psychonomic Bulletin & Review, 22 (4), 1054–1061. https://doi.org/10.3758/s13423-014-0752-9

Brysbaert, M. (2013). LEXTALE_FR: A fast, free, and efficient test to measure language proficiency in French. Psychologica Belgica, 53 (1), 23–37. https://doi.org/10.5334/pb-53-1-23

Brysbaert, M., Lagrou, E., & Stevens, M. (2017). Visual word recognition in a second language: A test of the lexical entrenchment hypothesis with lexical decision times. Bilingualism: Language and Cognition, 20 (3), 530–548.

Cai, Q., & Brysbaert, M. (2010). SUBTLEX-CH: Chinese word and character frequencies based on film subtitles. PloS One, 5 , e10729. https://doi.org/10.1371/journal.pone.0010729

Chan, I. L., & Chang, C. B. (2018). In LEXTALE_CH: A quick, character-based proficiency test for Mandarin Chinese (Vol. 42, pp. 114–130).

Chang, L.-Y., Stafura, J. Z., Rickles, B., Chen, H.-C., & Perfetti, C. A. (2015). Incremental learning of Chinese orthography: ERP indicators of animated and static stroke displays on character form and meaning acquisition. Journal of Neurolinguistics, 33 , 78–95. https://doi.org/10.1016/j.jneuroling.2014.09.001

Chen, L., Perfetti, C. A., Leng, Y., & Li, Y. (2018). Word superiority effect for native Chinese readers and low-proficiency Chinese learners. Applied Psycholinguistics , 39 (6), 1097–1115. Cambridge Core. https://doi.org/10.1017/S0142716418000255

Confucius Institute Headquarters. (2016). HSK Analysis (Level 6) . Higher Education Press.

Google Scholar  

Confucius Institute Headquarters. (2018). Official Examination Papers of HSK (Level 6) . People’s Education Press.

Cromheecke, O., & Brysbaert, M. (2022). A French C-test for language assessment. L’Année Psychologique, 122 (1), 3–28. https://doi.org/10.3917/anpsy1.221.0003

de Bruin, A. (2019). Not all bilinguals are the same: A call for more detailed assessments and descriptions of bilingual experiences. Behavioral Sciences, 9 (3), 33. https://doi.org/10.3390/bs9030033

de Bruin, A., Carreiras, M., & Duñabeitia, J. A. (2017). The BEST dataset of language proficiency. Frontiers in Psychology , 8 . https://doi.org/10.3389/fpsyg.2017.00522

Declerck, M., Wen, Y., Snell, J., Meade, G., & Grainger, J. (2020). Unified syntax in the bilingual mind. Psychonomic Bulletin & Review, 27 (1), 149–154. https://doi.org/10.3758/s13423-019-01666-x

DeFrancis, J. (1984). The Chinese language: Fact and fantasy . University of Hawaii Press.

Book   Google Scholar  

Diependaele, K., Lemhöfer, K., & Brysbaert, M. (2013). The word frequency effect in first-and second-language word recognition: A lexical entrenchment account. The Quarterly Journal of Experimental Psychology, 66 , 843–863. https://doi.org/10.1080/17470218.2012.720994

Article   PubMed   Google Scholar  

Dujardin, E., Jobard, G., Vahine, T., & Mathey, S. (2022). Norms of vocabulary, reading, and spelling tests in French university students. Behavior Research Methods, 54 , 1611–1625. https://doi.org/10.3758/s13428-021-01684-5

Ferré, P., & Brysbaert, M. (2017). Can Lextale-Esp discriminate between groups of highly proficient Catalan-Spanish bilinguals with different language dominances. Behavior Research Methods, 49 (2), 717–723. https://doi.org/10.3758/s13428-016-0728-y

Gollan, T. H., Weissberger, G. H., Runnqvist, E., Montoya, R. I., & Cera, C. M. (2012). Self-ratings of spoken language dominance: A Multilingual Naming Test (MINT) and preliminary norms for young and aging Spanish-English bilinguals. Bilingualism: Language and Cognition, 15 (3), 594–615. https://doi.org/10.1017/S1366728911000332

Gu, Y. (2006). Chinese. In K. Brown (Ed.), Encyclopedia of language and linguistics (Second Edition). Elsevier.

Hornby, A. S. (2018). Oxford advanced learner’s English-Chinese dictionary . The Commercial Press.

Hulstijn, J. H. (2012). The construct of language proficiency in the study of bilingualism from a cognitive perspective. Bilingualism: Language and Cognition, 15 (2), 422–433. https://doi.org/10.1017/S1366728911000678

Izura, C., Cuetos, F., & Brysbaert, M. (2014). Lextale-Esp: A test to rapidly and efficiently assess the Spanish vocabulary size. Psicológica, 35 (1), 49–66.

Khare, V., Verma, A., Kar, B., Srinivasan, N., & Brysbaert, M. (2013). Bilingualism and the increased attentional blink effect: Evidence that the difference between bilinguals and monolinguals generalizes to different levels of second language proficiency. Psychological Research, 77 , 728–737. https://doi.org/10.1007/s00426-012-0466-4

Kidd, E., Donnelly, S., & Christiansen, M. H. (2018). Individual differences in language acquisition and processing. Trends in Cognitive Sciences, 22 (2), 154–169. https://doi.org/10.1016/j.tics.2017.11.006

Lalkhen, A. G., & McCluskey, A. (2008). Clinical tests: Sensitivity and specificity. Continuing Education in Anaesthesia, Critical Care & Pain, 8 (6), 221–223. https://doi.org/10.1093/bjaceaccp/mkn041

Lauro, J., & Schwartz, A. I. (2017). Bilingual non-selective lexical access in sentence contexts: A meta-analytic review. Journal of Memory and Language, 92 , 217–233. https://doi.org/10.1016/j.jml.2016.06.010

Lemhöfer, K., & Broersma, M. (2012). Introducing LexTALE: A quick and valid lexical test for advanced learners of English. Behavior Research Methods, 44 , 325–343. https://doi.org/10.3758/s13428-011-0146-0

Li, P., Sepanski, S., & Zhao, X. (2006). Language history questionnaire: A web-based interface for bilingual research. Behavior Research Methods, 38 , 202–210. https://doi.org/10.3758/BF03192770

Li, P., Zhang, F., Tsai, E., & Puls, B. (2014). Language history questionnaire (LHQ 2.0): A new dynamic web-based research tool. Bilingualism: Language and Cognition, 17 , 673–680. https://doi.org/10.1017/S1366728913000606

Li, X., Zang, C., Liversedge, S. P., & Pollatsek, A. (2015). The role of words in Chinese reading. In The Oxford handbook of reading. (pp. 232–244). Oxford University Press.

Li, C., Wang, M., Davis, J. A., & Guan, C. Q. (2019). The role of segmental and tonal information in visual word recognition with learners of Chinese. Journal of Research in Reading, 42 (2), 213–238. https://doi.org/10.1111/1467-9817.12137

Li, P., Zhang, F., Yu, A., & Zhao, X. (2020). Language history questionnaire (LHQ3): An enhanced tool for assessing multilingual experience. Bilingualism: Language and Cognition , 23 (5), 938–944. Cambridge Core. https://doi.org/10.1017/S1366728918001153

Lüdecke, D., Ben-Shachar, M., Patil, I., Waggoner, P., & Makowski, D. (2021). Performance: An R package for assessment, comparison and testing of statistical models. Journal of Open Source Software, 6 (60), 3139. https://doi.org/10.21105/joss.03139

Luk, G., & Bialystok, E. (2013). Bilingualism is not a categorical variable: Interaction between language proficiency and usage. Journal of Cognitive Psychology, 25 , 605–621. https://doi.org/10.1080/20445911.2013.795574

MacIntyre, P. D., Noels, K. A., & Clément, R. (1997). Biases in self-ratings of second language proficiency: The role of language anxiety. Language Learning, 47 (2), 265–287. https://doi.org/10.1111/0023-8333.81997008

Malone, M., & Xu, Y. (2019). Developing a Chinese C-test for research purposes . The 2019 conference of the American Association for Applied Linguistics.

Marian, V., & Hayakawa, S. (2021). Measuring bilingualism: The quest for a “bilingualism quotient.” Applied Psycholinguistics , 42 (2), 527–548. Cambridge Core. https://doi.org/10.1017/S0142716420000533

Marian, V., Blumenfeld, H. K., & Kaushanskaya, M. (2007). The Language Experience and Proficiency Questionnaire (LEAP-Q): Assessing language profiles in bilinguals and multilinguals. Journal of Speech, Language, and Hearing Research, 50 , 940–967. https://doi.org/10.1044/1092-4388(2007/067)

Myers, J. (2006). Processing Chinese compounds: A survey of the literature. In G. Libben & G. Jarema (Eds.), The representation and processing of compound words (pp. 169–196). Oxford University Press.

Oller, J. W., Jr. (1973). Cloze tests of second language proficiency and what they measure. Language Learning, 23 (1), 105–118. https://doi.org/10.1111/j.1467-1770.1973.tb00100.x

Pelzl, E., Lau, E. F., Guo, T., & DeKeyser, R. M. (2021). Advanced second language learners of Mandarin show persistent deficits for lexical tone encoding in picture-to-word form matching. Frontiers in Communication , 6 . https://doi.org/10.3389/fcomm.2021.689423

Prior, A., & van Hell, J. G. (2021). Including bilingual participants in cognitive research. In Bilingualism across the lifespan: Opportunities and challenges for cognitive research in a global society (pp. 47–65). Routledge.

Qi, S., Teng, M. F., & Fu, A. (2022). LexCH: A quick and reliable receptive vocabulary size test for Chinese learners. Applied Linguistics Review . https://doi.org/10.1515/applirev-2022-0006

Quick Placement Test. (2001). Oxford University Press.

R Core Team. (2021). R: A language and environment for statistical computing .

Read, J. P., Haas, A. L., Radomski, S., Wickham, R. E., & Borish, S. E. (2015). Identification of hazardous drinking with the young adult alcohol consequences questionnaire: Relative operating characteristics as a function of gender. Psychological Assessment, 28 (10), 1276–1289. https://doi.org/10.1037/pas0000251

Rizopoulos, D. (2006). ltm: An R package for latent variable modeling and item response theory analyses. Journal of Statistical Software, 17 (5), 1–25. https://doi.org/10.18637/jss.v017.i05

Şahin, A., & Anil, D. (2017). The effects of test length and sample size on item parameters in item response theory. Educational Sciences: Theory & Practice, 17 (1), 321–335. https://doi.org/10.12738/estp.2017.1.0270

Salmela, R., Lehtonen, M., Garusi, S., & Bertram, R. (2021). Lexize: A test to quickly assess vocabulary knowledge in Finnish. Scandinavian Journal of Psychology, 62 (6), 806–819. https://doi.org/10.1111/sjop.12768

Sheng, L., Lu, Y., & Gollan, T. H. (2014). Assessing language dominance in Mandarin–English bilinguals: Convergence and divergence between subjective and objective measures. Bilingualism: Language and Cognition , 17 (2), 364–383. Cambridge Core. https://doi.org/10.1017/S1366728913000424

Sun, C. C., Hendrix, P., Ma, J., & Baayen, R. H. (2018). Chinese lexical database (CLD). Behavior Research Methods, 50 (6), 2606–2629. https://doi.org/10.3758/s13428-018-1038-3

Surrain, S., & Luk, G. (2019). Describing bilinguals: A systematic review of labels and descriptions used in the literature between 2005–2015. Bilingualism: Language and Cognition , 22 (2), 401–415. Cambridge Core. https://doi.org/10.1017/S1366728917000682

Sze, W. P., Yap, M. J., & Liow, S. J. R. (2015). The role of lexical variables in the visual recognition of Chinese characters: A megastudy analysis. The Quarterly Journal of Experimental Psychology, 68 , 1541–1570. https://doi.org/10.1080/17470218.2014.985234

Tokowicz, N., & Kroll, J. F. (2007). Number of meanings and concreteness: Consequences of ambiguity within and across languages. Language and Cognitive Processes, 22 , 727–779. https://doi.org/10.1080/01690960601057068

Tomoschuk, B., Ferreira, V. S., & Gollan, T. H. (2019). When a seven is not a seven: Self-ratings of bilingual language proficiency differ between and within language populations. Bilingualism: Language and Cognition, 22 (3), 516–536. https://doi.org/10.1017/S1366728918000421

Tremblay, A. (2011). Proficiency assessment standards in second language acquisition research:“Clozing” the gap. Studies in Second Language Acquisition, 33 (3), 339–372. https://doi.org/10.1017/S0272263111000015

Tsang, Y.-K., Huang, J., Lui, M., Xue, M., Chan, Y.-W.F., Wang, S., & Chen, H.-C. (2018). MELD-SCH: A megastudy of lexical decision in simplified Chinese. Behavior Research Methods, 50 (5), 1763–1777. https://doi.org/10.3758/s13428-017-0944-0

Tse, C.-S., Yap, M. J., Chan, Y.-L., Sze, W.-P., Shaoul, C., & Lin, D. (2016). The Chinese lexicon project: A megastudy of lexical decision performance for 25,000+ traditional Chinese two-character compound words. Behavior Research Methods . https://doi.org/10.3758/s13428-016-0810-5

Van de Putte, E., De Baene, W., Price, C. J., & Duyck, W. (2018). Neural overlap of L1 and L2 semantic representations across visual and auditory modalities: A decoding approach. Neuropsychologia, 113 , 68–77. https://doi.org/10.1016/j.neuropsychologia.2018.03.037

van Hell, J. G., & Tanner, D. (2012). Second language proficiency and cross-language lexical activation. Language Learning, 62 , 148–171. https://doi.org/10.1111/j.1467-9922.2012.00710.x

van Hell, J. G., & Tokowicz, N. (2010). Event-related brain potentials and second language learning: Syntactic processing in late L2 learners at different L2 proficiency levels. Second Language Research, 26 , 43–74. https://doi.org/10.1177/0267658309337637

van Heuven, W. J. B., & Dijkstra, T. (2010). Language comprehension in the bilingual brain: FMRI and ERP support for psycholinguistic models. Brain Research Reviews, 64 , 104–122. https://doi.org/10.1016/j.brainresrev.2010.03.002

van Heuven, W. J. B., Mandera, P., Keuleers, E., & Brysbaert, M. (2014). SUBTLEX-UK: A new and improved word frequency database for British English. The Quarterly Journal of Experimental Psychology, 67 (6), 1176–1190. https://doi.org/10.1080/17470218.2013.850521

Wen, Y., & van Heuven, W. J. B. (2017a). Chinese translation norms for 1429 English words. Behavior Research Methods, 49 , 1006–1019. https://doi.org/10.3758/s13428-016-0761-x

Wen, Y., & van Heuven, W. J. B. (2017b). Non-cognate translation priming in masked priming lexical decision experiments: A meta-analysis. Psychonomic Bulletin & Review, 24 , 879–886. https://doi.org/10.3758/s13423-016-1151-1

Wen, Y., & van Heuven, W. J. B. (2018). Limitations of translation activation in masked priming: Behavioural evidence from Chinese-English bilinguals and computational modelling. Journal of Memory and Language, 101 , 84–96. https://doi.org/10.1016/j.jml.2018.03.004

Wen, Y., Filik, R., & van Heuven, W. J. B. (2018). Electrophysiological dynamics of Chinese phonology during visual word recognition in Chinese-English bilinguals. Scientific Reports, 8 , 6869. https://doi.org/10.1038/s41598-018-25072-w

Wen, Y., Mirault, J., & Grainger, J. (2021). The transposed-word effect revisited: The role of syntax in word position coding. Language, Cognition and Neuroscience, 36 (5), 668–673. https://doi.org/10.1080/23273798.2021.1880608

Wu, Y. J., & Thierry, G. (2010). Chinese-English bilinguals reading English hear Chinese. The Journal of Neuroscience, 30 , 7646–7651. https://doi.org/10.1523/JNEUROSCI.1602-10.2010

Zhang, H. (2018). 研究用汉语水平分级测试方法对研究结果的影响 [The influence of different L2 Chinese proficiency measurements on the results of CSL research]. 语言教学与研究 [Language Teaching and Linguistic Studies] , 194 (6), 14–23.

Zhang, S., & Zhang, X. (2020). The relationship between vocabulary knowledge and L2 reading/listening comprehension: A meta-analysis. Language Teaching Research , 1362168820913998. https://doi.org/10.1177/1362168820913998

Zhang, H., Jiang, Y., & Yang, J. (2020). Investigating the influence of different L2 proficiency measures on research results. SAGE Open, 10 (2), 1–20. https://doi.org/10.1177/2158244020920604

Zhou, C., & Li, X. (2022). LextPT: A reliable and efficient vocabulary size test for L2 Portuguese proficiency. Behavior Research Methods, 54 , 2625–2639. https://doi.org/10.3758/s13428-021-01731-1

Download references

Acknowledgements

The authors thank Marc Brysbaert for helpful discussions and Bernadette Loi Kwang Yee and Lee Soon Tat for their help with data collection.

Author information

Authors and affiliations.

Department of Applied Linguistics, Xi’an Jiaotong-Liverpool University, Suzhou, China

Laboratoire de Psychologie Cognitive, Aix-Marseille University and Centre National de la Recherche Scientifique, Marseille, France

School of Psychology, University of Nottingham, University Park, Nottingham, NG7 2RD, UK

Yicheng Qiu & Walter J. B. van Heuven

School of Psychology, University of Nottingham Malaysia, Semenyih, Malaysia

Christine Xiang Ru Leong

You can also search for this author in PubMed   Google Scholar

Corresponding author

Correspondence to Walter J. B. van Heuven .

Ethics declarations

Conflict of interest.

The authors report no conflict of interest.

Additional information

Publisher's note.

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Open practices statement

The materials are provided in the Appendices . LexCHI (items, instructions, answer key) is available on the Open Science Framework ( https://osf.io/dh3ty/ ). We cannot publicly share the material used in the cloze test because it was adapted from commercial tests. None of the data of the experiments reported here is available, and none of the experiments was preregistered.

Appendix 1: Stimuli of LexCHI in Experiment 1

The original set of 120 items were used in Experiment 1 .

Sixty items of two-character Chinese word (with English translations in brackets):

徜徉 (wander), 踉跄 (stagger), 襁褓 (swaddling), 涟漪 (ripple), 踌躇 (hesitate), 惆怅 (melancholy), 璀璨 (luminous), 蹉跎 (wasted), 褴褛 (ragged), 囫囵 (whole), 缥缈 (ethereal), 俸禄 (salary), 狰狞 (ferocious), 匍匐 (creep), 滂沱 (torrential), 蹒跚 (stumble), 鹧鸪 (francolin), 恍惚 (trance), 憧憬 (longing), 朦胧 (hazy), 徘徊 (linger), 傀儡 (puppet), 蹂躏 (ravage), 瑕疵 (defect), 漩涡 (swirl), 鞠躬 (bow), 邋遢 (sloppy), 憔悴 (haggard), 侏儒 (dwarf), 烹饪 (cooking), 瘫痪 (paralysis), 亵渎 (profanity), 咳嗽 (cough), 吩咐 (order), 辉煌 (glorious), 俘虏 (captive), 唠叨 (nag), 脂肪 (fat), 羡慕 (envy), 瘟疫 (plague), 悬崖 (cliff), 颤抖 (tremble), 痊愈 (recover), 栅栏 (fence), 贿赂 (bribe), 贪婪 (greedy), 慷慨 (generous), 寂寞 (lonely), 锻炼 (workout), 谨慎 (cautious), 矛盾 (contradiction), 卑鄙 (despicable), 喉咙 (throat), 祈祷 (prayer), 愤怒 (anger), 骄傲 (proud), 淘汰 (eliminate), 熟悉 (familiar), 掌握 (grasp), 问题 (problem)

Sixty nonword items:

缤摹, 鳟踞, 亦筷, 蚯践, 粤抄, 巅妾, 锉洽, 猥砂, 诠疮, 拮娥, 寨禧, 炭鹑, 磷祥, 晴榻, 裳妊, 抒泵, 葵娠, 荟锐, 倚籍, 暮徙, 窍俭, 赠瞻, 霰驻, 鹌棋, 蜥浆, 谜蜍, 耕韵, 逸滞, 狐镀, 咸裕, 馅俐, 呻椒, 罕寝, 炫萃, 鸣蜓, 恒馁, 荧瓷, 契砣, 拱疹, 涵浅, 衔侈, 颁昧, 抹峨, 歪衍, 磁窿, 敞螺, 漆蝠, 濒愧, 嘲塘, 寰港, 晶嫩, 蝉税, 洪噪, 禁蕾, 糖描, 痴策, 辩菜, 舒震, 糊悲, 身候

Appendix 2: Stimuli in the translation task

Sixty items were presented in the translation task (Task 4) in Experiments 1 & 2 .

Thirty English words were used for in the English-to-Chinese translation:blanket, dignity, quilt, enquiry, nerve, curse, palm, pyjamas, bandage, sequence, misery, infection, pest, shelf, torch, missile, mayor, lawn, friction, lightning, stationery, appendix, fountain, tomb, drawer, vowel, foam, exception, beverage, dawn

Thirty Chinese words were used in the Chinese-to-English translation (with English translations in brackets):

干旱 (drought), 绝望 (despair), 屠夫 (butcher), 代词 (pronoun), 烟囱 (chimney), 下巴 (chin), 橡皮 (rubber), 沙漠 (desert), 来源 (source), 程度 (extent), 冰山 (iceberg), 背心 (vest), 乞丐 (beggar), 补偿 (compensation), 潮汐 (tide), 拳头 (fist), 难民 (refugee), 燃料 (fuel), 诗人 (poet), 无知 (ignorance), 花费 (expenditure), 商人 (merchant), 物质 (substance), 小丑 (clown), 珍珠 (pearl), 崇拜 (worship), 直觉 (instinct), 虫子 (worm), 裁缝 (tailor), 短缺 (shortage)

Appendix 3: Items of LexCHI

The 60 items are included in LexCHI.

Forty items of two-character Chinese word (with English translations in brackets):

烹饪 (cooking), 喉咙 (throat), 襁褓 (swaddling), 瑕疵 (defect), 咳嗽 (cough), 囫囵 (whole), 问题 (problem), 璀璨 (luminous), 祈祷 (prayer), 瘟疫 (plague), 亵渎 (profanity), 朦胧 (hazy), 愤怒 (anger), 匍匐 (creep), 蹂躏 (ravage), 涟漪 (ripple), 羡慕 (envy), 唠叨 (nag), 滂沱 (torrential), 痊愈 (recover), 骄傲 (proud), 瘫痪 (paralysis), 卑鄙 (despicable),恍惚 (trance), 狰狞 (ferocious), 蹉跎 (wasted), 憧憬 (longing), 淘汰 (eliminate), 贿赂 (bribe), 脂肪 (fat), 吩咐 (order), 掌握 (grasp), 邋遢 (sloppy), 俸禄 (salary), 熟悉 (familiar),寂寞 (lonely), 傀儡 (puppet), 惆怅 (melancholy), 慷慨 (generous), 徘徊 (linger)

Twenty nonword items:

鹌棋, 颁昧, 亦筷, 寰港, 裳妊, 呻椒, 糖描, 诠疮, 涵浅, 晶嫩, 抒泵, 缤摹, 锉洽, 抹峨, 咸裕, 晴榻, 契砣, 漆蝠, 巅妾, 磷祥

Appendix 4: Additional results in Experiment 2

Correlations in Experiment 2 (Table 10 )

Detailed results of the regression analysis in Experiment 2

Following the suggestion of one reviewer, an additional regression analysis was conducted in which the LEXTALE_CH scores were entered in the first step to predict accuracy of the cloze test. Like the regression analysis reported at the end of Experiment 2 (see Table 11 ), we first address the issue of the collinearity by orthogonalising the LexCHI scores in a linear model in which the LexCHI scores were predicted by the LEXTALE_CH scores. Then, in the regression analysis, the LEXTALE_CH scores were entered in the first step to predict accuracy of the cloze test, and the LexCHI scores were entered in the second step. As can be seen from Table 12 , in the first step, LEXTALE_CH was a significant predictor. In the second step, LEXTALE_CH and LexCHI were both significant predictors. The LexCHI scores uniquely accounted for almost 10% of the variance in the cloze test accuracy as indicated by the increase of R 2 in the second step.

Distributions of LexCHI scores in Experiment 2

In developing the Italian extension of LexTALE, Amenta et al. ( 2020 ) asked non-native speakers in the validation study to report their L2 proficiency according the CEFR (Common European Framework of Reference) levels and reported the distributions of LexITA scores over CEFR levels. We also asked the L2 group in Experiment 2 to rate the CEFR proficiency of Chinese reading using the self-assessment grid provided by the CEFR. The distributions of LexCHI scores over self-rated CEFR levels are plotted Fig. 4 in together with the L1 group. In addition, we asked these participants to rate their proficiency of Chinese reading based on the HSK levels. Figure 5 plots the distributions of LexCHI scores over the self-rated HSK levels as well as the L1 group's score.

figure 4

Violin plots showing the distribution of LexCHI scores over self-reported proficiency levels in the Common European Framework of Reference (CEFR). The dots within boxplots indicate the means. The CEFR has three tiers of proficiency levels: A1 (Beginner) & A2 (Elementary) as Basic Users, B1 (Intermediate) & B2 (Upper intermediate) as Independent Users, and C1 (Advanced) & C2 (Proficient) as Proficient Users

figure 5

Violin plots showing the distribution of LexCHI scores over proficiency levels in Hanyu Shuiping Kaoshi (HSK). The six levels in HSK correspond to A1, A2, B1, B2, C1 and C2 in the CEFR

The question for rating the CEFR levels in terms of Chinese reading was framed as follows: Select the description that best summarises your ability to read Chinese:

I can understand familiar names, words and very simple sentences, for example on notices and posters or in catalogues.

I can read very short, simple texts. I can find specific, predictable information in simple everyday material such as advertisements, prospectuses, menus and timetables and I can understand short simple personal letters.

I can understand texts that consist mainly of high-frequency, everyday or job-related language. I can understand the description of events, feelings and wishes in personal letters.

I can read articles and reports concerned with contemporary problems in which the writers adopt particular attitudes or viewpoints. I can understand contemporary literary prose.

I can understand long and complex factual and literary texts, appreciating distinctions of style. I can understand specialised articles and longer technical instructions, even when they do not relate to my field.

I can read with ease virtually all forms of the written language, including abstract, structurally or linguistically complex texts such as manuals, specialised articles and literary works.

The question for rating the HSK levels of Chinese proficiency was framed as follows: Select the description that best summarises your understanding and use of Chinese:

I can understand and use very simple Chinese phrases, meet basic needs for communication and possess the ability to further my Chinese language studies.

I have an excellent grasp of basic Chinese and can communicate in simple and routine tasks requiring a simple and direct exchange of information on familiar and routine matters.

I can communicate in Chinese at a basic level in daily, academic and professional life. I can manage most communication in Chinese when travelling in China.

I can converse in Chinese on a wide range of topics and are able to communicate fluently with native Chinese speakers.

I can read Chinese newspapers and magazines, enjoy Chinese films and plays and give a full-length speech in Chinese.

I can easily comprehend written and spoken information in Chinese and can effectively express myself in Chinese, both orally and on paper.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ .

Reprints and permissions

About this article

Wen, Y., Qiu, Y., Leong, C.X.R. et al. LexCHI: A quick lexical test for estimating language proficiency in Chinese. Behav Res 56 , 2333–2352 (2024). https://doi.org/10.3758/s13428-023-02151-z

Download citation

Accepted : 24 May 2023

Published : 05 July 2023

Issue Date : March 2024

DOI : https://doi.org/10.3758/s13428-023-02151-z

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

  • Chinese vocabulary test
  • Language proficiency
  • Bilingualism
  • Find a journal
  • Publish with us
  • Track your research

Northwestern Scholars Logo

  • Help & FAQ

The Language Experience and Proficiency Questionnaire (LEAP-Q): Ten years later

  • Communication Sciences and Disorders

Research output : Contribution to journal › Review article › peer-review

The Language Experience and Proficiency Questionnaire (LEAP-Q) is a validated questionnaire tool for collecting self-reported proficiency and experience data from bilingual and multilingual speakers ages 14 to 80. It is available in over 20 languages, and can be administered in a digital, paper-and-pencil, and oral interview format. The LEAP-Q is used by researchers across various disciplines (Psychology, Neuroscience, Linguistics, Education, Communication Sciences & Disorders, etc.) to provide a comprehensive description of their bilingual participants, to substantiate a division of bilinguals into groups (e.g., early vs. late bilinguals), and to screen participants for adequate or threshold levels of language proficiency. Best practices for using the LEAP-Q include administration of the full questionnaire, consideration of acquisition and history of language use together with self-ratings of proficiency, and supplementation of self-reported data with objective language measures whenever possible. The LEAP-Q can be downloaded at no cost at https://bilingualism.northwestern.edu/leapq/.

Original languageEnglish (US)
Pages (from-to)945-950
Number of pages6
Journal
Volume23
Issue number5
DOIs
StatePublished - Nov 2020
  • questionnaire
  • self-report

ASJC Scopus subject areas

  • Language and Linguistics
  • Linguistics and Language

Access to Document

  • 10.1017/S1366728919000038

Other files and links

  • Link to publication in Scopus
  • Link to the citations in Scopus

Fingerprint

  • Diseases Medicine and Dentistry 100%
  • Supplementation Medicine and Dentistry 100%
  • Self Concept Medicine and Dentistry 100%
  • Health Care Cost Medicine and Dentistry 100%
  • Division Medicine and Dentistry 100%
  • Proficiency Arts and Humanities 100%
  • Neuroscience Psychology 100%
  • Multi-lingual Arts and Humanities 14%

T1 - The Language Experience and Proficiency Questionnaire (LEAP-Q)

T2 - Ten years later

AU - Kaushanskaya, Margarita

AU - Blumenfeld, Henrike K.

AU - Marian, Viorica

N1 - Funding Information: Acknowledgements. The present project was supported by NIDCD Grant R01 DC011750 to Margarita Kaushanskaya, and Grant NICHD 2R01 HD059858 to Viorica Marian. The authors wish to express gratitude to all of the researchers who have adopted, adapted, and translated the LEAP-Q, and to the participants who provided the norming data. We would also like to thank everyone who has assisted with data coding, data analyses, data management, and email correspondence related to the LEAP-Q. Publisher Copyright: Copyright © 2019 Cambridge University Press.

PY - 2020/11

Y1 - 2020/11

N2 - The Language Experience and Proficiency Questionnaire (LEAP-Q) is a validated questionnaire tool for collecting self-reported proficiency and experience data from bilingual and multilingual speakers ages 14 to 80. It is available in over 20 languages, and can be administered in a digital, paper-and-pencil, and oral interview format. The LEAP-Q is used by researchers across various disciplines (Psychology, Neuroscience, Linguistics, Education, Communication Sciences & Disorders, etc.) to provide a comprehensive description of their bilingual participants, to substantiate a division of bilinguals into groups (e.g., early vs. late bilinguals), and to screen participants for adequate or threshold levels of language proficiency. Best practices for using the LEAP-Q include administration of the full questionnaire, consideration of acquisition and history of language use together with self-ratings of proficiency, and supplementation of self-reported data with objective language measures whenever possible. The LEAP-Q can be downloaded at no cost at https://bilingualism.northwestern.edu/leapq/.

AB - The Language Experience and Proficiency Questionnaire (LEAP-Q) is a validated questionnaire tool for collecting self-reported proficiency and experience data from bilingual and multilingual speakers ages 14 to 80. It is available in over 20 languages, and can be administered in a digital, paper-and-pencil, and oral interview format. The LEAP-Q is used by researchers across various disciplines (Psychology, Neuroscience, Linguistics, Education, Communication Sciences & Disorders, etc.) to provide a comprehensive description of their bilingual participants, to substantiate a division of bilinguals into groups (e.g., early vs. late bilinguals), and to screen participants for adequate or threshold levels of language proficiency. Best practices for using the LEAP-Q include administration of the full questionnaire, consideration of acquisition and history of language use together with self-ratings of proficiency, and supplementation of self-reported data with objective language measures whenever possible. The LEAP-Q can be downloaded at no cost at https://bilingualism.northwestern.edu/leapq/.

KW - LEAP-Q

KW - questionnaire

KW - self-report

KW - validation

UR - http://www.scopus.com/inward/record.url?scp=85065247435&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85065247435&partnerID=8YFLogxK

U2 - 10.1017/S1366728919000038

DO - 10.1017/S1366728919000038

M3 - Review article

AN - SCOPUS:85065247435

SN - 1366-7289

JO - Bilingualism

JF - Bilingualism

Academia.edu no longer supports Internet Explorer.

To browse Academia.edu and the wider internet faster and more securely, please take a few seconds to  upgrade your browser .

Enter the email address you signed up with and we'll email you a reset link.

  • We're Hiring!
  • Help Center

paper cover thumbnail

Relationship of English language proficiency to the students' academic performance

Profile image of Cres OL

Related Papers

Prism Journal

The concept of second language acquisition (SLA) is a major concern in the field of second language teaching, especially in bilingual or multilingual classrooms. This study determined the relationship between English proficiency and academic performance of the students in the College of Business and Accountancy (CBA), College of Education (CED), and College of Engineering and Architecture (CEA) at a state university in Dumaguete City. The data from 862 tertiary students were retrieved from the database of the university registrar's office. The results support the claim that English proficiency is significantly related to academic performance. However, such consistency is absent on the data derived from the CEA students. This finding implies that English proficiency may be less important in improving academic performance among CEA students than CED and CBA students. This finding suggests a possible mismatch between pre-admission English proficiency and the English proficiency required for learning highly technical academic requirements of an engineering course. The CEA curriculum may be reviewed for possible revision that could enhance the academic English proficiency as well as academic performance among the engineering students.

thesis questionnaire about language proficiency

International Journal for TESOL Studies

Dan Henry Gonzales

In English language teaching (ELT) circles, the English used in the Philippines is traditionally labelled as ESL. However, that label, including the attitudes and practices that it brings, does not accurately capture the complex language situation in the country. In this article, we argue that English education in the Philippines does not account for the multilingual and linguistically diverse context of use in the country. We argue that monolingual perspectives and approaches in ELT continue to dominate the field. Issues of language variation, the existence of varieties of English, and the reality of translanguaging practices are not addressed. From curriculum design, assessment policies and practices, to classroom teaching, ELT in the Philippines persists in taking a one-size-fits-all approach that only privileges so-called 'native speaker' norms. English in the Philippines is best characterized as Englishes. Although an educated Philippine variety of English exists (Bautista, 2008), it cannot be concluded that this is the variety of widespread use across the archipelago. The language situation of the Philippines is complex, thereby eluding labels and categories that only betray a nativespeakerism perspective and a monolingual bias. We argue that Englishes in the Philippines are more appropriately approached as ENL, ESL, and EFL. The implications of these categorizations on ELT in the country are discussed in this paper.

The paradox of Philippine education and education …

Yvette Torres

ENGLISH FOR ASEAN INTEGRATION: POLICIES AND PRACTICES IN THE REGION, edited by Terance W. Bigalke and Salbrina Sharbawi, Brunei: Universiti Brunei Dharussalam, 2015 (pp. 110-119)

Isabel Pefianco Martin

International Journal of English Language Studies

Andrea Lopez

The Philippines is considered one of the largest English-speaking nations in the world. In fact, English is one of the official languages in the Philippines. But throughout the years, a gradual deterioration in English language proficiency can be observed among Filipinos based on the EF English Proficiency Index, Test of English for International Communication (TOEIC), and the average score of Filipino IELTS takers. This paper aims to discuss the different factors behind the descending Filipino English proficiency, such as learners’ motivation, parental involvement, learning environment, teaching strategies, comprehensive input, learners’ socio-economic status, and learners&#39; age. Several articles have been reviewed and examined for the authors to come up with the following conclusions: 1) Continuous practice and usage of the English language could help us further cement our economy; 2) there is still a huge room for improvement in terms of grammar which could also help alleviate...

Rezei Tadako

This study correlated the probable factors affecting English proficiency and the English proficiency of Grades V and VI pupils of Central Philippine University (CPU). It further examined the similarities and differences of the respondents according to the factors listed, and their association to the English proficiency of the pupils. Chi-square, Cramer’s V, and Gamma analyses were used to analyze data collected from 174 Grades V and VI pupils from the CPU Elementary Department who were subjected to a self-administered one shot-survey. The findings showed that female pupils are more proficient than male pupils, as shown in the results of the English proficiency test administered by the researchers. Reading frequency also bears on the English proficiency of the pupils. Other factors such as age, sex, income of parents, educational attainment of parents, study hours, perception, extent of television viewing, text messaging, extent of use of online social networks and computer games, and reading frequency.

Safary Wa-Mbaleka

In the Philippines, English is an official language. It is used as a medium of instruction. Yet, most Filipinos learn English as an additional language because they have various mother tongues. The diverse linguistic backgrounds of Filipinos create a challenge for both English teachers and learners of English as an additional language. As all linguists continue to think of innovative ways to teach English to speakers of Filipino languages, the Philippine context presents some particular factors that must be taken into consideration today. This study explored the strengths, weaknesses, opportunities, and threats (SWOT) to the teaching of English as an additional language in the Philippines. Results show that English teaching and learning in the Philippines has been successful in general but has areas for improvement. Results are based on data collected from eight individual written interviews conducted with English educators, linguistic leaders and experts of Teaching English to speakers of Other Languages in the Philippines. The study also presents some practical recommendations for English language education policies in the multi-lingual societies of the Philippines.

Reconceptualizing English Education in a Multilingual Society: English in the Philippines

Galaxy International Interdisciplinary Research Journal

Genesis Genelza

English proficiency needs to be essential for academic achievement. The quality of English instruction in high schools is closer to that of a primary school, which is insufficient to enable students to advance academically on their own. Thus, English proficiency is considered as part of the weighted process. This is a descriptive correlational study designed to find a significant relationship between junior high school students' English proficiency and academic achievement. Results showed that there is indeed a great correlation between the two variables in the study with a p-value of 0.00776, hence the null hypothesis was hereby rejected. With this, the findings could serve as guidance for English teachers in terms of data on the degree of students' language proficiency and academic success. Students are likewise urged to look into other elements such as learning styles, instructional approaches, education-learning technologies, exposure to the English language, as well as other contextual and environmental factors that may influence students' academic achievement and English proficiency. Following these recommendations would not only benefit schools by satisfying the demands and expectations of students while simultaneously improving the institutions' overall retention and school academic rates.

Abigail Gomez

This study yielded significant results in assessing the proficiency level of the students where reading and listening examinations became difficult for the participants. Therefore, continuous remediation must be done by the University to expose the students to various accents of English and to provide more reading power tasks that could stimulate their critical thinking, which can be useful in their daily lives.

Loading Preview

Sorry, preview is currently unavailable. You can download the paper by clicking the button above.

RELATED PAPERS

Psychology and Education: A Multidisciplinary Journal

Psychology and Education

Bilingual and Multilingual Education

International Journal of Research Studies in Education

International Journal of Bilingual Education and Bilingualism

Catherine Young

14th International Nakem Conference, with the theme “Diversity and Inclusion in Mother Language Education”

Firth McEachern

Aerone Morante

Clark Dominic Alipasa

Brian Vasquez

University of San Carlos Graduate Journal Volume XXIV

Dr. John Paul R Gorgonio

International Journal of Research Studies in Language Learning

Jeffrey Ancheta

Asia Pacific Journal of Multidisciplinary Research

Research and Statistics Center

alejandro bernardo

Journal of Southeast Asian Studies

Val Dunn Panaguiton

Jonathan Adanza

Lea Mae A. Ladonga

Prixie Cruz , Ahmar Mahboob

English Language Teaching

mitra farsi

Martin, Isabel Pefianco. 2006. Language in Philippine Classrooms: Enfeebling or Enabling? Asian Englishes Journal. 9 (2), 48-66.

Ferdinand Cortez

Ryan Glenn Conda

  •   We're Hiring!
  •   Help Center
  • Find new research papers in:
  • Health Sciences
  • Earth Sciences
  • Cognitive Science
  • Mathematics
  • Computer Science
  • Academia ©2024
  • Academic Performance

English Language Proficiency and Academic Performance of Philippine Science High School Students

Robelle Racca at Department of Education

  • Department of Education
  • This person is not on ResearchGate, or hasn't claimed this research yet.

Research paradigm.

Discover the world's research

  • 25+ million members
  • 160+ million publication pages
  • 2.3+ billion citations
  • Merna Jane Ignacio
  • Nelson Julhamid
  • Masnona Asiri

Pfuurai Chimbunde

  • Julius I Malvar

Felino B Sidocon

  • Salman Al-Gburi
  • G Wael Suleiman

Imad Ibrahim Dawood Alsaedy

  • Thabang Francis Khalema
  • Mohaeka Gabriel Raselimo
  • Desiree Dawn P.Justol

American Journals Publishing Center -Publish Article In the United States

  • Noel Pugoso
  • Jeremiah Calisang

David Olugbade Fakeye

  • Afsaneh Baharloo
  • Seyyed Mohammad

Seyyed Mohammad Ali Soozandehfar

  • Judith W. Rosenthal
  • Recruit researchers
  • Join for free
  • Login Email Tip: Most researchers use their institutional email address as their ResearchGate login Password Forgot password? Keep me logged in Log in or Continue with Google Welcome back! Please log in. Email · Hint Tip: Most researchers use their institutional email address as their ResearchGate login Password Forgot password? Keep me logged in Log in or Continue with Google No account? Sign up

IMAGES

  1. (PDF) The Language Experience and Proficiency Questionnaire (LEAP-Q

    thesis questionnaire about language proficiency

  2. (PDF) English Language Competence Questionnaire

    thesis questionnaire about language proficiency

  3. sample-questionnaire-for-english-proficiency-thesis.pdf

    thesis questionnaire about language proficiency

  4. The Language Experience and Proficiency Questionnaire (LEAP-Q; Marian

    thesis questionnaire about language proficiency

  5. [PDF] The Language Experience and Proficiency Questionnaire (LEAP-Q

    thesis questionnaire about language proficiency

  6. English Language Proficiency Assessment Sample

    thesis questionnaire about language proficiency

VIDEO

  1. Thesis Instrument (Questionnaire) Distribution Instruction

  2. What is questionnaire?🤔📔#thesis #questionnaires #youtubeshorts

  3. Language Quality Report

  4. How language can strengthen your resilience

  5. Objective Thesis

  6. Thesis and Semester project Presentation |thesis defense|የምረቃ ወረቀት ስለማቅረብ| #በአማረኛ

COMMENTS

  1. PDF Language Experience and Proficiency Questionnaire

    Fig. 1. The Language Experience and Proficiency questionnaire (LEAP-Q) is a self-report measure that can capture language profiles of multilingual speakers through self-reported proficiency, learning milestones, immersion duration, contributors to language learning/ acquisition, extent of exposure and self-reported foreign accent,

  2. LEAP-Questionnaire « Bilingualism and Psycholinguistics Research Laboratory

    The LEAP-Q Language Experience and proficiency Questionnaire: Ten years later. [pdf], by Marian, Blumenfeld, & Kaushanskaya (2020), published in Bilingualism: Language and Cognition, 23 (5), 945-950. LEAP-Q SUPPLEMENT, explaining how to export data from the filled out questionnaire. [pdf]

  3. PDF The Language Experience and Proficiency Questionnaire (LEAP-Q

    Purpose: To develop a reliable and valid questionnaire of bilingual language status with predictable relationships between self-reported and behavioral measures. Method: In Study 1, the internal validity of the Language Experience and Proficiency Questionnaire (LEAP-Q) was established on the basis of self-reported data from 52 multilingual ...

  4. PDF Language Experience and Proficiency Questionnaire (LEAP-Q)

    Northwestern Bilingualism & Psycholinguistics Research Laboratory. Marian, Blumenfeld, & Kaushanskaya (2007). The Language Experience and Proficiency Questionnaire (LEAP-Q): Assessing language profiles in bilinguals and multilinguals. Journal of Speech Language and Hearing Research, 50 (4), 940-967. Adapted to pencil-and-paper version by ...

  5. PDF Factors Affecting Language Proficiency of English Language ...

    language proficiency. Two questions guided this study: (1) What do ELLs perceive to be the important factors contributing to their language learning? (2) What are the relationships between these factors and ELLs' language proficiency? A total of 606 EL Ls who studied at English language institutes in the United States constituted the final sample.

  6. PDF A Study of Language Use, Language Attitudes and Identities in Two

    This thesis investigates patterns of language use, language attitudes, identity and attitudes towards learning Arabic, within two Arabic speaking communities in the UK. An important motivation is to investigate a rarely researched group to explore language practices, language proficiency, language attitudes and identity within this group.

  7. PDF Impact of English Proficiency on Academic Performance of ...

    language proficiency and multilingualism on the academic performance of international students enrolled in a four-year university located in north central Louisiana in the United States. Data were collected through a self-reported questionnaire from 59 students who were in their sophomore, junior or senior year of college.

  8. The Language Experience and Proficiency Questionnaire (LEAP-Q): Ten

    Abstract. The Language Experience and Proficiency Questionnaire (LEAP-Q) is a validated questionnaire tool for collecting self-reported proficiency and experience data from bilingual and multilingual speakers ages 14 to 80. It is available in over 20 languages, and can be administered in a digital, paper-and-pencil, and oral interview format.

  9. LexCHI: A quick lexical test for estimating language proficiency in

    A prominent methodological issue in cognitive research on bilingualism is the lack of consistency in measuring second language (L2) proficiency. To reduce the inconsistency in L2 proficiency measurements, brief and valid vocabulary tests have been developed as an objective measure of proficiency in a variety of languages (e.g., English, French, Spanish). Here, we present LexCHI, a valid ...

  10. PDF Factors influencing the level of Filipino language proficiency among

    Hakuta's (2005) Survey Questionnaire Affecting Language Proficiency Acquisition [7]. This measured the level of students' perception on factors that affect their oral language proficiency in Filipino. Students test performance in Filipino was used as measure of their language proficiency level. 2.5. Data Collection

  11. (PDF) Language proficiency

    the amount of language and cultural preparation and support provided, language. Language pro ficiency 95. learning instruction, the setting, the depth of critical re flection promoted, the ...

  12. The Relationship Between English Language Proficiency and Academic

    English language learner students have attained a level of language proficiency in the target language at which they can successfully perform academic tasks using the target language. Below this threshold, students may be able to perform the academic task, but lack the language proficiency to perform the task successfully in the target language ...

  13. PDF Proficiency and Confidence Levels of English Language Students in

    of the English language who are regarded as fluent but not necessarily proficient. In this study, English proficiency is associated with one's skill in speaking, reading, and writing using English as the communication tool. To be proficient, one must be adept in these three macro skills. Since English is a global language,

  14. The language experience and proficiency questionnaire (LEAP-Q

    Purpose: To develop a reliable and valid questionnaire of bilingual language status with predictable relationships between self-reported and behavioral measures. Method: In Study 1, the internal validity of the Language Experience and Proficiency Questionnaire (LEAP-Q) was established on the basis of self-reported data from 52 multilingual adult participants. In Study 2, criterion-based ...

  15. Evaluating the impact of nonverbal behavior on language ability ratings

    Preliminary study of the role of eye contact, gestures, and smiles produced by Chinese-as-a-first-language test takers on ratings assigned by English-as-a-first- language examiners during IELTS speaking tests [Unpublished MA thesis]. University of Victoria, Canada.

  16. (PDF) The Language Experience and Proficiency Questionnaire (LEAP-Q

    The Language Experience and Proficiency Questionnaire (LEAP-Q) is a validated questionnaire tool for collecting self-reported proficiency and experience data from bilingual and multilingual speakers ages 14 to 80. ... The LEAP-Q was also adapted to collect language proficiency and exposure data in a trilingual individual with aphasia before and ...

  17. The Language Experience and Proficiency Questionnaire (LEAP-Q): Ten

    The Language Experience and Proficiency Questionnaire (LEAP-Q) is a validated questionnaire tool for collecting self-reported proficiency and experience data from bilingual and multilingual speakers ages 14 to 80. It is available in over 20 languages, and can be administered in a digital, paper-and-pencil, and oral interview format.

  18. (PDF) English Language Proficiency Level of Junior Students from a

    This study aimed to determine the English language proficiency of junior students from Cavite State University Imus Campus. As suggested by a language expert, the researchers utilized the free ...

  19. PDF Language Learning Strategies and Language Proficiency of Grade 12 Students

    The total mean of test scores. (84.82) of Grade 12 students is interpreted as generally "approaching proficiency," which implies that students can understand and communicate academic content using complex vocabulary. Out of 107 respondents, only 103 achieved a particular level.

  20. (DOC) Relationship of English language proficiency to the students

    A standardized questionnaire on English language proficiency was utilized as the main tool in gathering the data of the study. Environment and Participants Bohol Island State University, Candijay Campus formerly known as Bohol School of Fisheries and then became Central Visayas State College of Agriculture, Forestry and Technology (CVSCAFT) on ...

  21. A Brief Study of English Language Proficiency : Employability

    Content-based ELD links language learning and academic vocabulary directly to core instruction, and includes four steps: (1) preview the text to determine the core concept, (2) match the concept ...

  22. PDF ESLP 182 Questionnaire:

    ESLP 182 Questionnaire: Self-Assessment of English Writing and Grammar, Punctuation, & Mechanics Skills and Use of Writing & Editing Strategies Please rate your abilities for each item below a scale between 1 to 5. Circle your choice. 1=never or almost never true of me 2=usually not true of me 3=somewhat true of me 4=usually true of me

  23. English language Student Questionnaire

    In this questionnaire you will find questions about the following topics: You, your family, and your home. Language learning in school. Your view on reading. How you think about your life. Your School. Your school schedule and learning time. Please read each question carefully and answer as accurately as you can.

  24. (PDF) English Language Proficiency and Academic Performance of

    English is a language that exists across all subjects, and requires the ability to listen, speak, read and write. In most contexts, English ability, in general, was reported to be a significant ...