We were unable to load Disqus. If you are a moderator please see our troubleshooting guide.

Douglas Houston • 1 year ago

"This implies that we shouldn’t see models larger than the 540B-parameter PaLM trained on 780B tokens for a while"
https://cobusgreyling.mediu...