Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

alert-icon-red-11.pngImage Added

ATTENTION:

This page has been migrated to the Tazama GitHub repository and is now located at:

https://github.com/frmscoe/docs/tree/dev/Knowledge-Articles/Entity-Resolution

This page will no longer be maintained in Confluence.

Setup Analyzer

Code Block
1. In Lens, connect to the Shell of the ArangoDB pod
2. Type "arangosh" in the shell and hit Enter - this will connect to ArangoSH
3. To connect to the correct DB, type "db.useDatabase("transactionHistory");"
4. Then import analyzers:
var analyzers = require("@arangodb/analyzers");
5. Add a new analyzer:
analyzers.save("text_en_no_stem", "text", { locale: "en.utf-8", accent: false, case: "lower", stemming: false, stopwords: [] }, ["position", "frequency", "norm"]);

That's it! We've added the new analyzer!

The FullText Index (FTI) can be used to find words, or prefixes of words inside documents. In the function used to query a FTI, it will search for all the words exactly. See some samples below.

With the below dataset:

Johannes Petrus Foley

Johannes

Petrus

Foley

Johannes Petrus

Johannes Foley

Petrus Foley

 

 

The following queries will yield the following results:

...