The Microsoft blog post that this links to is slightly less bad, only because the "natural language understanding" part is buried in text, not the headline:

https://www.microsoft.com/en-us/research/blog/microsoft-deberta-surpasses-human-performance-on-the-superglue-benchmark/
Back to the puff piece, there's a bit where they say: "It is important to note that ..." and I thought maybe there's be something sensible coming like: these results should be treated with caution because, but no.

"... this is not the first model to surpass human baselines."
This is just pure #AIhype and it does harm in the world. Everytime you tell the public that "AI understands human language" (ahem, English), then that lends credibility to all of the AI snake oil be sold for surveillance, exam scoring, exam score interpolation, etc. >>
Also, it looks ridiculous, too. If you think that scoring high on SuperGLUE shows understanding, show me what your machine is doing that actually amounts to understanding.
You can follow @emilymbender.
Tip: mention @twtextapp on a Twitter thread with the keyword “unroll” to get a link to it.

Latest Threads Unrolled:

By continuing to use the site, you are consenting to the use of cookies as explained in our Cookie Policy to improve your experience.