Add Guided Key

MaartenGr · MaartenGr · Sep 28, 2021 · Aug 27, 2021 · Sep 28, 2021 · Sep 28, 2021
commit a99fc6309467f54eefb859bbcb0b38962ac18750
diff --git a/docs/guides/quickstart.md b/docs/guides/quickstart.md
@@ -147,4 +147,30 @@ candidates = [candidate[0] for candidate in candidates]
 # KeyBERT init
 kw_model = KeyBERT()
 keywords = kw_model.extract_keywords(doc, candidates)
+```
+
+### Guided KeyBERT
+
+Guided KeyBERT is similar to Guided Topic Modeling in that it tries to steer the training towards a set of seeded terms. When applying KeyBERT it automatically extracts the most related keywords to a specific document. However, there are times when stakeholders and users are looking for specific types of keywords. For example, when publishing an article on your website through contentful, you typically already know the global keywords related to the article. However, there might be a specific topic in the article that you would like to be extracted through the keywords. To achieve this, we simply give KeyBERT a set of related seeded keywords (it can also be a single one!) and search for keywords that are similar to both the document and the seeded keywords. 
+
+Using this feature is as simple as defining a list of seeded keywords and passing them to KeyBERT:
+
+
+```python
+doc = """
+ Supervised learning is the machine learning task of learning a function that
+ maps an input to an output based on example input-output pairs.[1] It infers a
+ function from labeled training data consisting of a set of training examples.[2]
+ In supervised learning, each example is a pair consisting of an input object
+ (typically a vector) and a desired output value (also called the supervisory signal). 
+ A supervised learning algorithm analyzes the training data and produces an inferred function, 
+ which can be used for mapping new examples. An optimal scenario will allow for the 
+ algorithm to correctly determine the class labels for unseen instances. This requires 
+ the learning algorithm to generalize from the training data to unseen situations in a 
+ 'reasonable' way (see inductive bias).
+ """
+
+kw_model = KeyBERT()
+seed_keywords = ["information"]
+keywords = kw_model.extract_keywords(doc, use_mmr=True, diversity=0.1, seed_keywords=seed_keywords)
 ```