Fixes on representation model, visualisations, and embeddings in CPU mode. Package updates and optimisation for compatibility db3eaec seanpedrickcase commited on Nov 13
Corrected reference to sentence transformers dependency. Updated Dockerfile packages 611584f seanpedrickcase commited on Nov 13
Adjusted requirements for max available for Huggingface python==3.10 platform 6bf616b seanpedrickcase commited on Oct 21
Test update main requirements file for huggingface compatibility 9a4b420 seanpedrickcase commited on Oct 21
Llama-cpp-python in GPU mode doesn't seem to work well with Bertopic on Huggingface, so downgrading that to CPU version 88d81fa seanpedrickcase commited on Dec 12, 2024
Rearranged functions for embeddings creation to be compatible with zero GPU space. Updated packages. cc495e1 seanpedrickcase commited on Dec 12, 2024
Added example of how to run function from command line. Updated packages. Embedding model default now smaller and at fp16. 34f1e83 seanpedrickcase commited on Nov 20, 2024
Improved initial clean options. Now has option to return embeddings only. 89c4d20 seanpedrickcase commited on Nov 18, 2024
App now retains original index following cleaning to allow for referring back to original data 90553eb seanpedrickcase commited on Sep 25, 2024
Allowed for app running on AWS to use smaller embedding model and not to load representation LLM (due to size restrictions). 22ca76e seanpedrickcase commited on Aug 12, 2024
Only aggregate topics not 'other', allowed for minimum sentence length, default max_topics now will auto aggregate topics. Added Cognito Auth functionality (boto3 with AWS). 1e2bb3e seanpedrickcase commited on Aug 9, 2024
Can split passages into sentences. Improved embedding, LLM representation models, improved zero shot capabilities 55f0ce3 seanpedrickcase commited on Jun 27, 2024
Updated packages. Improve hierarchy vis. Better models - mixedbread and phi3. Now option to split texts into sentences before modelling. 04a15c5 seanpedrickcase commited on Jun 20, 2024
Upgraded to Gradio 4.16.0. Guide for converting to exe added. 0a177ca Sonnyjim commited on Feb 5, 2024
Added clean data options, improved re-representation options and visualisation. General format changes 4effac0 Sonnyjim commited on Feb 2, 2024
Lots of general fixes. New visualisations, fixed hierarchical vis for zero shot. Added calc all probabilities. b4510a6 Sonnyjim commited on Jan 29, 2024