A Case Study on the Diminishing Popularity of Encoder-Only Architectures in Machine Learning Models
Praveen Kumar Sridhar¹, Nitin Srinivasan², Adithyan Arun Kumar³, Gowthamaraj Rajendran⁴, Kishore Kumar Perumalsamy⁵

¹Praveen Kumar Sridhar, Department of Data Science, Northeastern University, San Jose, United States.

²Nitin Srinivasan, Department of Computer Science, University of Massachusetts Amherst, Sunnyvale, United States.

³Adithyan Arun Kumar, Department of Information Security, Carnegie Mellon University, San Jose, United States.

⁴Gowthamaraj Rajendran, Department of Information Security, Carnegie Mellon University, San Jose, United States.

⁵Kishore Kumar Perumalsamy, Department of Computer Science, Carnegie Mellon University, San Jose, United States.

Open Access | Editorial and Publishing Policies | Cite | Zenodo | OJS | Indexing and Abstracting
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC-BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)

Abstract: This paper examines the shift from encoder-only to decoder and encoder-decoder models in machine learning, highlighting the decline in popularity of encoder-only architectures. It explores the reasons behind this trend, including advancements in decoder models that offer superior generative capabilities, flexibility across various domains, and enhancements in unsupervised learning techniques. The study also discusses the role of prompting techniques in simplifying model architectures and enhancing model versatility. By analyzing the evolution, applications, and shifting preferences within the research community and industry, this paper aims to provide insights into the changing landscape of machine learning model architectures.

Keywords: Machine Learning, Deep Learning, Encoder, Transformers, Decoder, Natural Language Processing, Generative Model, Model Evolution.
Scope of the Article: Deep Learning

Download PDF

JOURNAL

REQUIREMENTS

PRODUCT

CONTACT US

D982713040324

JOURNAL

REQUIREMENTS

PRODUCT

CONTACT US