Rakuten Interview Question

Transformer architecture, loss function, numpy