Image Added

ATTENTION:

This page has been migrated to the Tazama GitHub repository and is now located at:

https://github.com/frmscoe/docs/tree/dev/Knowledge-Articles/Entity-Resolution

This page will no longer be maintained in Confluence.

Setup Analyzer

Code Block

1. In Lens, connect to the Shell of the ArangoDB pod
2. Type "arangosh" in the shell and hit Enter - this will connect to ArangoSH
3. To connect to the correct DB, type "db.useDatabase("transactionHistory");"
4. Then import analyzers:
var analyzers = require("@arangodb/analyzers");
5. Add a new analyzer:
analyzers.save("text_en_no_stem", "text", { locale: "en.utf-8", accent: false, case: "lower", stemming: false, stopwords: [] }, ["position", "frequency", "norm"]);

That's it! We've added the new analyzer!

The FullText Index (FTI) can be used to find words, or prefixes of words inside documents. In the function used to query a FTI, it will search for all the words exactly. See some samples below.

With the below dataset:

Johannes Petrus Foley	Johannes	Petrus
Foley	Johannes Petrus	Johannes Foley
Petrus Foley

The following queries will yield the following results:

...

Versions Compared

Old Version 1

New Version Current

Key

Setup Analyzer

Page Comparison

Versions Compared

Old Version 1

New Version Current

Key

Setup Analyzer