Toggle Main Menu Toggle Search

Open Access padlockePrints

An Assessment of Name Matching Algorithms

Lookup NU author(s): Professor Brian RandellORCiD

Downloads

Full text is not currently available for this publication.


Abstract

In many computer applications involving the recording and processing of personal data there is a need to allow for variations in surname spelling, caused for example by transcription errors. A number of algorithms have been developed for name matching, i.e. which attempt to identify name spelling variations, one of the best known of which is the Soundex algorithm. This paper describes a comparative analysis of a number of these algorithms and, based on an analysis of their comparative strengths and weaknesses, proposes a new and improved name matching algorithm, which we call the Phonex algorithm. The analysis takes advantage of the recent creation of a large list of ""equivalent surnames"", published in the book Family History Knowledge UK [Park]. This list is based on data supplied by some thousands of individual genealogists, and can be presumed to be representative of British surnames and their variations over the last two or three centuries. It thus made it possible to perform what we would argue were objective tests of name matching, the results of which provide a solid basis for the analysis that we have performed, and for our claims for the merits of the new algorithm, though these are unlikely to hold fully for surnames emanating largely from other countries.


Publication metadata

Author(s): Lait AJ, Randell B

Publication type: Report

Publication status: Published

Series Title: Department of Computing Science Technical Report Series

Year: 1996

Pages: 38

Report Number: 550

Institution: Department of Computing Science, University of Newcastle upon Tyne

Place Published: Newcastle upon Tyne

URL: http://www.cs.ncl.ac.uk/publications/trs/papers/550.pdf


Share