A Comparative Analysis of Language Generation Mechanisms in Cartesian Universal Grammar and Transformer-Based AI Models
DOI:
https://doi.org/10.31185/lark.5400Keywords:
UG, Cartesian Universal Grammar, MP, Transformer-Based AI ModelsAbstract
ABSTRACT
This paper provides a strict comparative point-of-view of two fundamentally opposed paradigms towards the perception and creation of human language: the nativist, computational-hierarchical, model of understanding human language as Cartesian Universal Grammar (UG) and operationalized by the Minimalist Program (MP), and the empiricist, statistical-associative, model of human language understanding as transformer-based Large Language Models (LLMs). The fundamental point of comparison is the driving generative processes of each model the structure-building process of the UG, which is Merge, and the mechanism of contextualization of the transformer, Self-Attention. Whereas UG assumes an innate domain- specific language faculty, which is discrete-infinity, recursively structure-generating, and characterized by discrete infinity, LLMs acquire their impressive language ability through statistical optimization on large corpora, using continuous, high-dimensional, vector representations. To substantiate the dissimilarities between these two models, the analysis method applies a four-part framework, namely the Computational Primitive, Representational Structure, Source of Knowledge, and Explanatory Scope. Results have shown that UG has higher explanatory depth in terms of nature of language competence, and it gives a principled explanatory account on its formal properties, but on the other hand, LLMs exhibit higher predictive power and performance on language use, which is reflected in fluency, coherence, and scalability. Finally, the paper explains the implication of the empirical success of LLMs to the nativist hypothesis and suggests that a more detailed theory of language may well need a unification of the formal limitations that are inherent to UG with the statistical power of transformer architecture
References
Alsaray, A. A. D., & Altimimiy, K. K. (2023). Employing artificial intelligence techniques among mobile journalism practitioners when covering daily events: A field study in Wasit Governorate. Lark Journal of Philosophy, Linguistics and Social Sciences, 15(3/Pt2), 537–577.
https://doi.org/10.31185/lark.Vol2.Iss50.3165
Chomsky, N. (1957). Syntactic structures. Mouton.
Chomsky, N. (1966). Cartesian linguistics: A chapter in the history of rationalist thought. Harper & Row.
Chomsky, N. (2002). On nature and language. Cambridge University Press.
Chomsky, N. (2014). The minimalist program. MIT Press.
Chomsky, N., Seely, T. D., Berwick, R. C., & Fong, S. (2023). Merge and the strong minimalist thesis. Cambridge University Press.
Chomsky, N., Watumull, J., & Roberts, I. (2023, March 8). Chomsky: The fallacy of Chat GPT and large language models. The New York Times, A21.
Clark, K., Khandelwal, U., Levy, O., & Manning, C. D. (2019). What does BERT look at? An analysis of BERT's attention. Proceedings of the 2019 ACL Workshop Black boxNLP: Analyzing and Interpreting Neural Networks for NLP, 276-286.
Goldberg, Y. (2019). Assessing the ability of LSTMs to learn syntax-sensitive dependencies. Language and Linguistics Compass, 13(1), e12319.
Hupkes, D., & De Raedt, L. (2024). The role of recursion in large language models. Trends in Cognitive Sciences, 28(2), 105-116.
Kaplan, J., McCandlish, S., Henighan, T., Brown, T. B., Chess, B., Child, R., Gray, S., Radford, A., Wu, J., & Amodei, D. (2020). Scaling laws for neural language models. arXiv preprint arXiv:2001.08361.
Manning, C. D., & Schütze, H. (1999). Foundations of statistical natural language processing. MIT Press.
Marcus, G., & Davis, E. (2020). Rebooting AI: Building artificial intelligence we can trust. Pantheon.
Piantadosi, S. T. (2023). Modern language models refute Chomsky's approach to language. Cognitive Science, 47(10), e13361.
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., Kaiser, Ł., & Polosukhin, I. (2017). Attention is all you need. Advances in neural information processing systems, 30, 5998-6008.
Downloads
Published
Issue
Section
License
Copyright (c) 2026 م.م. رقيب عماد جاسم

This work is licensed under a Creative Commons Attribution 4.0 International License.
