HomeData Model

Data Model

Data Model

The data originates from two spreadsheets. The mentions spreadsheet contains nearly 50,000 rows. The manuscript spreadsheet contains 629 rows. Each row in the mentions spreadsheet indicates the manuscripts that cite the mention.

The data model shown below is constructed by a computer program (the analyzer) that reads and analyzes the mentions and manuscripts spreadsheets to create the types of items listed below and to create relationships among them.

Item Types

  • Manuscript - Contains the data for a single row in the manuscripts spreadsheets.
  • Mention - Contains the data for a single row in the mentions spreadsheet.
  • Event - Derived from a set of rows in the mentions spreadsheet that have the same values in the DATE and PLACE columns. Combines the event-related data from all of the rows in the set.
  • Person - Derived from a set of rows in the mentions spreadsheet that have the same value in the IDENTIFIER column e.g. 'Abraham 1'. Combines the person-related data from all of the rows in the set.

Relationships

The analyzer established relationships among the items listed above as shown in the diagram below. The kinsman relationship from a Person to a Person has four flavors: spouse, child of / parent of, sibling, and kinsman (for other kinship such as cousin or uncle). When you view an item, you'll see the relationships that the analyzer created for it.

Each item in the model is stored as a record in a MySQL database. Each item has many elements, one for each field in the item (corresponding to columns in the original spreadsheets). Each element, and each relationship, is stored as a separate record. Altogether, the database contains approximately 1.4 million records to store the entire data model.