In this review, we focus specifically on transformer-based LLMs, which are built on the transformer architecture, an attention-based mechanism capable of capturing complex dependencies in sequential ...