Skip to content

Releases: charlesdedampierre/BunkaTopics

v.046

11 Apr 08:26
Compare
Choose a tag to compare

V0.46

New changes:

  • Ability to add metadata and display them on a plot
  • Fix bugs with Mistral Generation
  • Add FlagEmbedding
  • Remove Chroma
  • Add tokens count
  • Sample documents to improve the terms extraction process
  • Save and load Bunka objects
  • load embeddings into Bunka
  • Ability to add your own projection model (TSNE, PCA, UMAP etc)
  • Add outliers detection
  • automatic language detection

v.045

21 Jan 21:53
Compare
Choose a tag to compare

In this major update, we have completely revamped our package to align it with the latest framework and to better facilitate data cleaning for the next generation of Large Language Models (LLMs).Bunkatopics version 0.45 continues to serve as a versatile package catering to a wide range of tasks, including Topic Modeling Visualization, Frame Analysis and cleaning capabilities.

Utilizing Large Language Models: Our package uses the power of Large Language Models (LLMs) to empower developers in extracting valuable insights from unstructured data.

Integration of well-known Libraries: Bunkatopics is now constructed using renowned libraries such as langchain, chroma, and transformers, ensuring seamless integration into diverse development environments.

In-depth insights: Bunkatopics excels at providing in-depth insights into specific topics within categories, such as exploring Technology topics on the Medium website.

Framing Analysis: Bunkatopics introduces a 2-dimensional unsupervised scale, the Bourdieu map, facilitating the visualization of textual data and topics. This feature offers valuable insights into data distribution patterns.

Manually Cleaning Topics: Users can now exercise greater flexibility and customization by manually changing topic names or labels to better suit their needs.

Data Filtering by Topics. This release enables users to construct tailored datasets by excluding topics that do not align with their specific interests, simplifying the process of fine-tuning models.

v0.45p

21 Jan 21:27
Compare
Choose a tag to compare
v0.45p Pre-release
Pre-release

In this major update, we have completely revamped our package to align it with the latest framework and to better facilitate data cleaning for the next generation of Large Language Models (LLMs).Bunkatopics version 0.45 continues to serve as a versatile package catering to a wide range of tasks, including Topic Modeling Visualization, Frame Analysis and cleaning capabilities.

  • Utilizing Large Language Models: Our package uses the power of Large Language Models (LLMs) to empower developers in extracting valuable insights from unstructured data.

  • Integration of well-known Libraries: Bunkatopics is now constructed using renowned libraries such as langchain, chroma, and transformers, ensuring seamless integration into diverse development environments.

  • In-depth insights: Bunkatopics excels at providing in-depth insights into specific topics within categories, such as exploring Technology topics on the Medium website.

  • Framing Analysis: Bunkatopics introduces a 2-dimensional unsupervised scale, the Bourdieu map, facilitating the visualization of textual data and topics. This feature offers valuable insights into data distribution patterns.

  • Manually Cleaning Topics: Users can now exercise greater flexibility and customization by manually changing topic names or labels to better suit their needs.

  • Data Filtering by Topics. This release enables users to construct tailored datasets by excluding topics that do not align with their specific interests, simplifying the process of fine-tuning models.

release v0.41

07 Oct 15:11
Compare
Choose a tag to compare
add final change to poetry

v0.38

18 Jun 14:29
Compare
Choose a tag to compare
add numbers on Bourdieu Graph