Gaussian Error Linear Units (GELUs)
The GELU is an activation function introduced in the paper Gaussian Error Linear Units. If you're unfamiliar with activation functions or want a refresher, you can check out this blog where I explain them in detail.