Neural Networks and Language Models

Overview


In this part, we outline neural network -related work leading to contemporary language models, starting with introducing neural networks and forming machine-understandable representations of data and text, followed by problems related to using neural networks with sequential data. Finally, the part introduces self-attention and transformers, which are the basis for many of the current large language models.

The chapters of this part are as follows.