1887
Volume 13, Issue 1
  • ISSN 2213-8706
  • E-ISSN: 2213-8714
USD
Buy:$35.00 + Taxes

Abstract

Semantic transparency refers to the relationship between the meanings of whole-words and their constituent morphemes. Mandarin Chinese Quadrisyllabic Idiomatic Expressions (QIEs, also known as Cheng-Yu) have a similar property, in which the whole meaning is often more than the summed meanings of the component words. Using data from Wu (2016), we analyze the semantic transparency of QIEs as a function computational semantic similarity, word frequency, and syntactic structure. Our results indicate that the probability of a QIE being labelled as transparent increases with its frequency, whereas computational measures of semantic similarity and structure were not strongly associated with one existing set of semantic transparency labels. We hypothesize that these results may be influenced by the nature of QIEs, where their meaning may sometimes be obscured and based on traditional stories rather than any properties of the constituents — semantic or otherwise. Therefore, we advocate for the inclusion of word frequency as a factor in transparency rating of QIEs. Additionally, we suggest exploring other variables that may improve the transparency rating models.

Loading

Article metrics loading...

/content/journals/10.1075/ijchl.00045.ruo
2026-05-21
2026-06-07
Loading full text...

Full text loading...

References

  1. Bell, M. J., & Schaefer, M.
    (2013) Semantic transparency : challenges for distributional semantics. IWCS 2013 Workshop Towards a Formal Distributional Semantics, 1–10.
    [Google Scholar]
  2. Bell, M. J., & Schäfer, M.
    (2016) Modelling semantic transparency. Morphology, 26(2), 157–199. 10.1007/s11525‑016‑9286‑3
    https://doi.org/10.1007/s11525-016-9286-3 [Google Scholar]
  3. Chen, Y., & Xing, H.
    (2010) Semantic extraction and semantic transparency automatic assessment experiments based on distributed characteristics. Modern Chinese (Language Research Version), 031, 111–113.
    [Google Scholar]
  4. China National Office for Teaching Chinese as Foreign Language
    China National Office for Teaching Chinese as Foreign Language (1992) the Syllabus of Graded Characters for Chinese Proficiency. Beijing Language and Culture University Press.
    [Google Scholar]
  5. Frisson, S., Niswander-Klement, E., & Pollatsek, A.
    (2008) The role of semantic transparency in the processing of English compound words. British Journal of Psychology, 99(1). 10.1348/000712607X181304
    https://doi.org/10.1348/000712607X181304 [Google Scholar]
  6. Fu, P.
    (2012) Semantic Transparency Analysis of Chinese Idioms Based on TCFL [Master’s thesis]. Shandong University.
    [Google Scholar]
  7. Gagné, C. L., & Spalding, T. L.
    (2016) Effects of morphology and semantic transparency on typing latencies in English compound and pseudocompound words. Journal of Experimental Psychology: Learning Memory and Cognition, 42(9). 10.1037/xlm0000258
    https://doi.org/10.1037/xlm0000258 [Google Scholar]
  8. Gagné, C. L., Spalding, T. L., & Schmidtke, D.
    (2019) LADEC: The Large Database of English Compounds. Behavior Research Methods, 51(5), 2152–2179. 10.3758/s13428‑019‑01282‑6
    https://doi.org/10.3758/s13428-019-01282-6 [Google Scholar]
  9. Gibbs Jr, R. W., & Colston, H. L.
    (2012) Interpreting figurative meaning. Cambridge University Press.
    [Google Scholar]
  10. Ji, H., Gagné, C. L., & Spalding, T. L.
    (2011) Benefits and costs of lexical decomposition and semantic integration during the processing of transparent and opaque English compounds. Journal of Memory and Language, 65(4), 406–430.
    [Google Scholar]
  11. Kim, S. Y., Yap, M. J., & Goh, W. D.
    (2019) The role of semantic transparency in visual word recognition of compound words: A megastudy approach. Behavior Research Methods, 51(6). 10.3758/s13428‑018‑1143‑3
    https://doi.org/10.3758/s13428-018-1143-3 [Google Scholar]
  12. Landauer, T. K., Foltz, P. W., & Laham, D.
    (1998) An introduction to latent semantic analysis. Discourse Processes, 25(2–3). 10.1080/01638539809545028
    https://doi.org/10.1080/01638539809545028 [Google Scholar]
  13. Li, D., Zhang, Y., & Wang, X.
    (2016) Descriptive norms for 350 Chinese idioms with seven syntactic structures. Behavior Research Methods, 48(4). 10.3758/s13428‑015‑0692‑y
    https://doi.org/10.3758/s13428-015-0692-y [Google Scholar]
  14. Li, J.
    (2011) A quantification analysis of the transparency of the lexical meaning of the modern Chinese dictionary. Chinese Linguistics, 35(3), 54–62.
    [Google Scholar]
  15. Li, J., & Li, Y.
    (2008) On the transparency of lexical meaning. Studies in Language and Linguistics, 28(3), 60–65.
    [Google Scholar]
  16. Li, L., & Hin Tat, C.
    (2014) Acquisition of Chinese quadra-syllabic idiomatic expressions: Effects of semantic opacity and structural symmetry. First Language, 34(4), 336–353. 10.1177/0142723714544409
    https://doi.org/10.1177/0142723714544409 [Google Scholar]
  17. Li, M., Jiang, N., & Gor, K.
    (2017) L1 and L2 processing of compound words: Evidence from masked priming experiments in English. Bilingualism, 20(2). 10.1017/S1366728915000681
    https://doi.org/10.1017/S1366728915000681 [Google Scholar]
  18. Libben, G., Gibson, M., Yoon, Y. B., & Sandra, D.
    (2003) Compound fracture: The role of semantic transparency and morphological headedness. Brain and Language, 84(1), 50–64. 10.1016/S0093‑934X(02)00520‑5
    https://doi.org/10.1016/S0093-934X(02)00520-5 [Google Scholar]
  19. Liu, T. H., & Su, L. I. W.
    (2021) Chinese idioms as constructions: Frequency, semantic transparency and their processing. Language and Linguistics, 22(4). 10.1075/lali.00091.liu
    https://doi.org/10.1075/lali.00091.liu [Google Scholar]
  20. Marelli, M., & Luzzatti, C.
    (2012) Frequency effects in the processing of Italian nominal compounds: Modulation of headedness and semantic transparency. Journal of Memory and Language, 66(4). 10.1016/j.jml.2012.01.003
    https://doi.org/10.1016/j.jml.2012.01.003 [Google Scholar]
  21. Mikolov, T., Chen, K., Corrado, G., & Dean, J.
    (2013) Efficient estimation of word representations in vector space. 1st International Conference on Learning Representations, ICLR 2013 — Workshop Track Proceedings.
    [Google Scholar]
  22. Mok, L. W.
    (2009) Word-superiority effect as a function of semantic transparency of Chinese bimorphemic compound words. Language and Cognitive Processes, 24(7–8), 1039–1081. 10.1080/01690960902831195
    https://doi.org/10.1080/01690960902831195 [Google Scholar]
  23. Pennington, J., Socher, R., & Manning, C. D.
    (2014) GloVe: Global vectors for word representation. EMNLP 2014 — 2014 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference. 10.3115/v1/D14‑1162
    https://doi.org/10.3115/v1/D14-1162 [Google Scholar]
  24. Plag, I.
    (2003) Word-Formation in English. Cambridge University Press.
    [Google Scholar]
  25. Pollatsek, A., & Hyönä, J.
    (2005) The role of semantic transparency in the processing of Finnish compound words. Language and Cognitive Processes, 20(1–2), 261–290. 10.1080/01690960444000098
    https://doi.org/10.1080/01690960444000098 [Google Scholar]
  26. Rayner, K., Li, X., & Pollatsek, A.
    (2007) Extending the E-Z reader model of eye movement control to Chinese readers. Cognitive Science, 31(6). 10.1080/03640210701703824
    https://doi.org/10.1080/03640210701703824 [Google Scholar]
  27. Reddy, S., McCarthy, D., & Manandhar, S.
    (2011) An Empirical Study on Compositionality in Compound Nouns. IJCNLP 2011 — Proceedings of the 5th International Joint Conference on Natural Language Processing, 210–218.
    [Google Scholar]
  28. Rehurek, R., & Sojka, P.
    (2011) Gensim — Statistical Semantics in Python. InLecture Notes in Computer Science (Vol.66111).
    [Google Scholar]
  29. Sag, I. A., Baldwin, T., Bond, F., Copestake, A., & Flickinger, D.
    (2002) Multiword Expressions: A Pain in the Neck for NLP. InLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol.22761, pp. 1–15). 10.1007/3‑540‑45715‑1_1
    https://doi.org/10.1007/3-540-45715-1_1 [Google Scholar]
  30. Schmidtke, D., Van Dyke, J. A., & Kuperman, V.
    (2018) Individual variability in the semantic processing of English compound words. Journal of Experimental Psychology: Learning Memory and Cognition, 44(3). 10.1037/xlm0000442
    https://doi.org/10.1037/xlm0000442 [Google Scholar]
  31. Song, B., Zhou, X., & Jin, T.
    (2017) A Study on the Coverage and Semantic Transparency of Words Exceeding the Standard of Vocabulary. Chinese Language Learning, 31, 95–104.
    [Google Scholar]
  32. Song, Y., Shi, S., Li, J., & Zhang, H.
    (2018) Directional skip-gram: Explicitly distinguishing left and right context forword embeddings. NAACL HLT 2018 — 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies — Proceedings of the Conference, 21. 10.18653/v1/N18‑2028
    https://doi.org/10.18653/v1/N18-2028 [Google Scholar]
  33. Tabossi, P., Fanari, R., & Wolf, K.
    (2008) Processing Idiomatic Expressions: Effects of Semantic Compositionality. Journal of Experimental Psychology: Learning Memory and Cognition, 34(2), 313–327. 10.1037/0278‑7393.34.2.313
    https://doi.org/10.1037/0278-7393.34.2.313 [Google Scholar]
  34. Tang, X., & Liang, S.
    (2020) Study on Semantic Transparency of Chinese Compounds Based on Word Embedding. 2020 International Conference on Asian Language Processing, IALP 2020, 20141, 130–134. 10.1109/IALP51396.2020.9310483
    https://doi.org/10.1109/IALP51396.2020.9310483 [Google Scholar]
  35. the Institute of Linguistics of the Chinese Academy of Social Sciences
    the Institute of Linguistics of the Chinese Academy of Social Sciences (2012) Contemporary Chinese Dictionary(Sixth Edition). The Commercial Press.
    [Google Scholar]
  36. Tse, C. S., Yap, M. J., Chan, Y. L., Sze, W. P., Shaoul, C., & Lin, D.
    (2017) The Chinese Lexicon Project: A megastudy of lexical decision performance for 25,000+ traditional Chinese two-character compound words. Behavior Research Methods, 49(4), 1503–1519. 10.3758/s13428‑016‑0810‑5
    https://doi.org/10.3758/s13428-016-0810-5 [Google Scholar]
  37. Tsou, B. K.
    (2012) Idiomaticity and classical traditions in some East Asian languages. Proceedings of the 26th Pacific Asia Conference on Language, Information, and Computation, 39–55.
    [Google Scholar]
  38. Ullmann, S.
    (1962) Semantics. An Introduction to the Science of Meaning. InOxford: Basil Blackwell. 10.2307/3720401
    https://doi.org/10.2307/3720401 [Google Scholar]
  39. Wang, H. C., Hsu, L. C., Tien, Y. M., & Pomplun, M.
    (2012) Estimating Semantic Transparency of Constituents of English Compounds and Two-Character Chinese Words using Latent Semantic Analysis. Building Bridges Across Cognitive Sciences Around the World — Proceedings of the 34th Annual Meeting of the Cognitive Science Society, CogSci 2012, 2499–2504.
    [Google Scholar]
  40. (2014) Predicting raters’ transparency judgments of English and Chinese morphological constituents using latent semantic analysis. Behavior Research Methods, 46(1), 284–306. 10.3758/s13428‑013‑0360‑z
    https://doi.org/10.3758/s13428-013-0360-z [Google Scholar]
  41. Wang, J., Chen, T., Jiao, X., Liu, K., Tong, S., & Sun, J.
    (2021) Test-retest reliability of duration-related and frequency-related mismatch negativity. Neurophysiologie Clinique, 51(6), 541–548.
    [Google Scholar]
  42. Wang, S., Huang, C. R., Yao, Y., & Chan, A.
    (2019) The effect of morphological structure on semantic transparency ratings. Language and Linguistics, 20(2), 225–255. 10.1075/lali.00035.wan
    https://doi.org/10.1075/lali.00035.wan [Google Scholar]
  43. Wang, X.
    (2010) The Chinese Idiom Dictionary. Sinolingua Press.
    [Google Scholar]
  44. Wu, D.
    (2016) Research on the Semantic Transparency of Chinese Idioms and their Production in Essays by Primary And Secondary School Students [Master’s thesis]. Beijing Language and Culture University.
    [Google Scholar]
  45. Xing, H., Li, S., Li, M., Wu, P., Shi, G., & Shu, H.
    (2016) Research on Components and Development of Students’ Native Language Competence. Chinese Journal of Language Policy and Planning, 1(5), 28–36.
    [Google Scholar]
  46. Xu, C., Wang, F., Han, J., & Li, C.
    (2019) Exploiting multiple embeddings for Chinese named entity recognition. InProceedings of the 28th ACM international conference on information and knowledge management (pp.2269–2272).
    [Google Scholar]
  47. Xu, J.
    (2015) Corpus-based Chinese studies: A historical review from the 1920s to the present. Chinese Language and Discourse, 6(2), 218–244. 10.1075/cld.6.2.06xu
    https://doi.org/10.1075/cld.6.2.06xu [Google Scholar]
  48. Xu, S.
    (2006) Proximity and complementation: studies of the formation mechanism of idioms from a cognitive point of view. Journal of Sichuan International Studies University, 2–107.
    [Google Scholar]
  49. Xu, Y.
    (2014) Semantic Transparency Research on Frequently Used Chinese Compound Words For Second Language Teaching. Beijing Normal University.
    [Google Scholar]
  50. Yao, L., Mao, C., & Luo, Y.
    (2019) Graph convolutional networks for text classification. InProceedings of the AAAI conference on artificial intelligence (Vol.33, No.01, pp.7370–7377).
    [Google Scholar]
  51. Zhan, W., Guo, R., & Chen, Y.
    (2003) The CCL Corpus of Chinese Texts: 700 million Chinese Characters, the 11th Century BC-present, Available online at the website of Center for Chinese Linguistics (abbreviated as CCL) of Peking University.
    [Google Scholar]
  52. Zhan, W., Guo, R., Chang, B., Chen, Y., & Chen, L.
    (2019) The building of the CCL corpus: its design and implementation. Corpus Linguistics, 6(1), 71–86.
    [Google Scholar]
  53. Zhao, D.
    (2016) Semantic Transparency Study of Commonly Used Idioms. Modern Communication, 21(4), 238–241.
    [Google Scholar]
  54. Zhou, J.
    (2004) Theory on Word-structure of Chinese (in Chinese: Hanyu cihui jiegoulun). Shanghai Dictionary Press.
    [Google Scholar]
  55. Zhou, X., & Marslen-Wilson, W.
    (1995) Morphological Structure in the Chinese Mental Lexicon. Language and Cognitive Processes, 10(6). 10.1080/01690969508407114
    https://doi.org/10.1080/01690969508407114 [Google Scholar]
  56. Zwitserlood, P.
    (1994) The Role of Semantic Transparency in the Processing and Representation of Dutch Compounds. Language and Cognitive Processes, 9(3). 10.1080/01690969408402123
    https://doi.org/10.1080/01690969408402123 [Google Scholar]
/content/journals/10.1075/ijchl.00045.ruo
Loading
/content/journals/10.1075/ijchl.00045.ruo
Loading

Data & Media loading...

This is a required field
Please enter a valid email address
Approval was successful
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error