Researchers at Salesforce released a framework called Robustness Gym to benchmark the robustness of NLP models.Read More
Nlp
The TuringAdvice challenge evaluates AI language models based on their ability to generate advice as good as humans that get the most upvotes on Reddit.Read More