ml

Машинне навчання для інженерів із систем керування

     
<— 4__Model_components.md Зміст 4_2_Linear_layers.md —>

4.1 The notion of layer

We call layers standard complex compounded tensor operations that have been designed and empirically identified as being generic and efficient. They often incorporate trainable parameters and correspond to a convenient level of granularity for designing and describing large deep models. The term is inherited from simple multi-layer neural networks, even though modern models may take the form of a complex graph of such modules, incorporating multiple parallel pathways.

image-20230618145133546

In the following pages, I try to stick to the convention for model depiction illustrated above:

Additionally, layers that have a complex internal structure are depicted with a greater height.