A Moby-Dick Gazetteer (beta)

Home About Methods Sources Data Summaries

Constructing the Moby-Dick Gazetteer

Key to styles on this page:

Rules for capturing place-names:

Capitalization and mappability

Category and type coding

A valuable feature of the Moby-Dick Gazetteer is that it includes coded attributes for each place-name. These are the category and type attributes.

Rules for assigning category and type

Handling quoted dialogue

If a sentence is part of quoted dialogue, then the following will occur regarding quotation marks:

Purpose of the normalized field

The normalized field contains a single (e.g. normalized) name for a set of place-names having the same root. For example:

Purpose of the modifier field

The modifier field, when used, provides the word (next or previous) to the place-name. This is done to add context and clarify the meaning of the term. There is special punctuation in the modifier field as follows:

Locations

How locations were collected:

Flow-chart of Categorization and Typing

Decision tree for assigning categories and names