spacy streamlit python-dotenv openai scikit-learn reportlab langchain pdfplumber pandas numpy google-generativeai PyPDF2