ATTENTION: This page has been migrated to the Tazama GitHub repository and is now located at: https://github.com/frmscoe/docs/tree/dev/Knowledge-Articles/Entity-Resolution This page will no longer be maintained in Confluence. |
---|
Setup Analyzer
Code Block |
---|
1. In Lens, connect to the Shell of the ArangoDB pod 2. Type "arangosh" in the shell and hit Enter - this will connect to ArangoSH 3. To connect to the correct DB, type "db.useDatabase("transactionHistory");" 4. Then import analyzers: var analyzers = require("@arangodb/analyzers"); 5. Add a new analyzer: analyzers.save("text_en_no_stem", "text", { locale: "en.utf-8", accent: false, case: "lower", stemming: false, stopwords: [] }, ["position", "frequency", "norm"]); That's it! We've added the new analyzer! |
The FullText Index (FTI) can be used to find words, or prefixes of words inside documents. In the function used to query a FTI, it will search for all the words exactly. See some samples below.
With the below dataset:
Johannes Petrus Foley | Johannes | Petrus |
Foley | Johannes Petrus | Johannes Foley |
Petrus Foley |
|
|
The following queries will yield the following results:
...