This code defines three different neural network model

[3-1] Simple neural network

[3-2] Native encoder+decoder

[3-3] Encoder+decoder with attention