← LexiconGELUSmooth approximation of ReLU used in transformers. Roughly x·Φ(x) where Φ is the Gaussian CDF.Continue← All termsBrowse chapters