A Comparative Analysis of Language Generation Mechanisms in Cartesian Universal Grammar and Transformer-Based AI Models

Asst. Lecturer Raqib Imad Jassim

doi:10.31185/lark.5400

Authors

Asst. Lecturer Raqib Imad Jassim University of Wasit, College of Arts

DOI:

https://doi.org/10.31185/lark.5400

Keywords:

UG, Cartesian Universal Grammar, MP, Transformer-Based AI Models

Abstract

ABSTRACT

This paper provides a strict comparative point-of-view of two fundamentally opposed paradigms towards the perception and creation of human language: the nativist, computational-hierarchical, model of understanding human language as Cartesian Universal Grammar (UG) and operationalized by the Minimalist Program (MP), and the empiricist, statistical-associative, model of human language understanding as transformer-based Large Language Models (LLMs). The fundamental point of comparison is the driving generative processes of each model the structure-building process of the UG, which is Merge, and the mechanism of contextualization of the transformer, Self-Attention. Whereas UG assumes an innate domain- specific language faculty, which is discrete-infinity, recursively structure-generating, and characterized by discrete infinity, LLMs acquire their impressive language ability through statistical optimization on large corpora, using continuous, high-dimensional, vector representations. To substantiate the dissimilarities between these two models, the analysis method applies a four-part framework, namely the Computational Primitive, Representational Structure, Source of Knowledge, and Explanatory Scope. Results have shown that UG has higher explanatory depth in terms of nature of language competence, and it gives a principled explanatory account on its formal properties, but on the other hand, LLMs exhibit higher predictive power and performance on language use, which is reflected in fluency, coherence, and scalability. Finally, the paper explains the implication of the empirical success of LLMs to the nativist hypothesis and suggests that a more detailed theory of language may well need a unification of the formal limitations that are inherent to UG with the statistical power of transformer architecture

References

Alsaray, A. A. D., & Altimimiy, K. K. (2023). Employing artificial intelligence techniques among mobile journalism practitioners when covering daily events: A field study in Wasit Governorate. Lark Journal of Philosophy, Linguistics and Social Sciences, 15(3/Pt2), 537–577.

https://doi.org/10.31185/lark.Vol2.Iss50.3165

Chomsky, N. (1957). Syntactic structures. Mouton.

Chomsky, N. (1966). Cartesian linguistics: A chapter in the history of rationalist thought. Harper & Row.

Chomsky, N. (2002). On nature and language. Cambridge University Press.

Chomsky, N. (2014). The minimalist program. MIT Press.

Chomsky, N., Seely, T. D., Berwick, R. C., & Fong, S. (2023). Merge and the strong minimalist thesis. Cambridge University Press.

Chomsky, N., Watumull, J., & Roberts, I. (2023, March 8). Chomsky: The fallacy of Chat GPT and large language models. The New York Times, A21.

Clark, K., Khandelwal, U., Levy, O., & Manning, C. D. (2019). What does BERT look at? An analysis of BERT's attention. Proceedings of the 2019 ACL Workshop Black boxNLP: Analyzing and Interpreting Neural Networks for NLP, 276-286.

Goldberg, Y. (2019). Assessing the ability of LSTMs to learn syntax-sensitive dependencies. Language and Linguistics Compass, 13(1), e12319.

Hupkes, D., & De Raedt, L. (2024). The role of recursion in large language models. Trends in Cognitive Sciences, 28(2), 105-116.

Kaplan, J., McCandlish, S., Henighan, T., Brown, T. B., Chess, B., Child, R., Gray, S., Radford, A., Wu, J., & Amodei, D. (2020). Scaling laws for neural language models. arXiv preprint arXiv:2001.08361.

Manning, C. D., & Schütze, H. (1999). Foundations of statistical natural language processing. MIT Press.

Marcus, G., & Davis, E. (2020). Rebooting AI: Building artificial intelligence we can trust. Pantheon.

Piantadosi, S. T. (2023). Modern language models refute Chomsky's approach to language. Cognitive Science, 47(10), e13361.

Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., Kaiser, Ł., & Polosukhin, I. (2017). Attention is all you need. Advances in neural information processing systems, 30, 5998-6008.

A Comparative Analysis of Language Generation Mechanisms in Cartesian Universal Grammar and Transformer-Based AI Models

Authors

DOI:

Keywords:

Abstract

References

Downloads

Published

Issue

Section

License

How to Cite

Language

sidemenu

Information

visitor

Latest publications

template