datasets fsspec bert-score rouge-score==0.1.2 nltk==3.9.1 deepeval==3.3.2 google-generativeai==0.8.5 xai-sdk==1.0.0 ftfy groq