|
Date: |
2007-12-13 |
|
Status: |
Release of 13 December 2007 (Public Release 7) |
|
Abstract: |
Documents the Registry of
Peoples (ROP) for the Harvest Information System (HIS). This registry
defines the standardized codes used for identifying the peoples of the
world.
|
|
Steward: |
International Mission Board, Southern Baptist Convention |
|
Editor: |
Orville
Boyd Jenkins, (researchguy@iname.com) |
Table of Contents
1.
Overview
2.
Code table
3.
Other tables
4.
Working with Ethnic
Entities by Country and Language
6.
Distribution |
The function of the Registry of Peoples is to
document the standardized codes used for identifying the primary peoples
(people groups) of the world. The ROP provides a reference list of ethnic
identifiers representing the discrete human aggregations of the world. Data
is encoded using the Latin-1 (ISO-8859-1) character set.
1.1 Identifiers
The Registry provides a unique code and
definitive name for each people.
1.1.1
ROP Code
ROP Codes are 6-digit, numeric fields that provide a
unique identifier for each ethnic entity. Each code is perpetual – it will
not be used more than once, even if the entity it identifies is removed from
the database.
1.1.2
ROP Name
ROP Names are identifiers recommended as standard
reference names. Each reference name is based on the self-name or a
representative construct name of an ethnic entity as determined by the
Registry Editor. Each name represents the largest cohesive group of
individuals considering themselves related and speaking one or more
languages.
These identifiers are stored in the
primary table of the ROP, ROP_Peoples.
1.2 Descriptors
The Registry provides two
descriptors for each people group. A minimum of two descriptors is necessary
to accurately define a people.
1.2.1
Location Descriptor
Each person in every people group lives in an
identifiable location. Thus each people group record will reference at least
one location descriptor, although it will likely reference more, since many
people groups of the world are spread across geo-political boundaries.
1.2.2 Language Descriptor
Each person in every people group speaks a language.
A people group record includes at least one language descriptor, and more if
the people group is multi-lingual.
1.3
Criteria for Ethnic Entity Identification
1.3.1
Ethnic Entities
A distinction between two ethnic groups, which are given separate
entries and assigned separate codes, derives from a large set of cultural
characteristics that vary in importance among human cultures and societies. Further detail on these
characteristics may be found in wide circulation in various academic
disciplines.
1.3.2 Determinations
Specific determinations result from extensive research at various
levels, and are intended to represent the self-identity of each listed
ethnic entity. In addition,
naming conventions and grouping are considered to take into account common
terminology and conventions for descriptions of human culture from relevant
disciplines.
1.3.3
Self-Identity
In general the term “ethnic entity” refers to the largest cohesive group
of individuals considering themselves related for reasons that may include
biological kinship, shared history, shared customs or other shared aspects
of self-identity, and speaking one or more languages.
1.3.4.
Descriptors
A people is defined by the combination of these and other factors
referred to as “ethnicity” or “culture.”
The definition of a discrete “people” requires a minimum of the two
descriptors location and language. As we investigate the real-life situations of human
existence, we do not find a one-to-one correspondence of peoples to
languages or of people names to language names.
1.3.5 Religion
Every people is associated with at least one religion. Religion may be a
significant enough factor to separate otherwise identical people groups,
constituting sufficient reason to create a separate entry for each in the
ROP. The ROP, however, does not attempt to provide a full reference of each
people group to all known religions.
1.3.6
Sub-groupings
It is often possible to identify smaller subgroupings in any of the
ethnic entities defined in the ROP.
The ROP does not provide codes for such subgroups. The ROP code for
the larger entity would be applicable to all these subgroups.
1.3.7
Social Strata
Additionally, it is possible to identify social strata or categories
which include segments of multiple people groups and may be useful
for communication and cultural access strategies. Users of the ROP codes may
want to link a variety of additional data to the provided ethnic codes in
any dynamic combination.
This is the primary table of the ROP. It contains the
ROP Code and the recommended standard reference name for each people. The
primary consideration for the standard name is the name by which that people
call themselves. Balanced with this are internal variations within the
people group itself, differences among sub-groups, common variations or
different names used in academic disciplines or standard reference sources,
variations in other languages or alternative transliterations. Common
variations will be included in the table of alternate names – ROP Alternate
People Names (q.v. below). Note that there is not a one-to-one
correspondence of peoples to languages or of people names to language names.
Note that the 116000 series entries are provisional and have not been fully
reviewed. These entries contain
likely duplicates of earlier entries. 53 of these remain in the release of
05 May 2003. Expect changes
related to these entries in the next update.
Column
Format
Description
ROP_PeopleCode
int
The HIS code for a people.
PeopleName
varchar(75)
The recommended name of the people.
Memo
varchar(255)
Space for originator’s and editor’s notes.
Source
varchar(25)
Origin of the people record.
The ROP contains seven tables in
addition to the basic code table. The attached diagram shows how they are
keyed. Reference is also made in the diagram to three other tables,
ROG_GeopoliticalRegions, ROL_Language, and ROR_Religions, none of which are
part of the Registry of Peoples, but which represent primary tables in
separate registries. There are four kinds of tables in the ROP: the primary
code table, linking tables, supplemental data tables, and a change history
table. Each table, with the exception of ROP Change History, contains a memo
field for editorial comments, and a text field for entering source
information.
3.1
ROP Geopolitical Index
This is a linking
table that cross-references each ROP_PeopleCode to codes for one or more
geographical locations in which the people is documented to live.
The ROP
Geopolitical Index contains the following six columns:
Column
Format
Description
ROPGeoID
int
A unique code that represents each instance of a people in one or
more geographical locations as described in the HIS Registry of Geography.
ROP_PeopleCode int
See ROP_Peoples above.
ROG_GeoCode
char(2)
See Registry of Geography: ROG_GeopoliticalRegions.
ROPGeoName
varchar(75)
Name of people as represented in this geographic region.
Memo
varchar(255)
Space for originator’s and editor’s notes.
Source
varchar(25)
Origin of the people record.
3.2
ROP Geopolitical Language Index
This is a linking
table that cross-references each people-geopolitical combination to one or
more languages they are known to speak. More than one people may be
linked to the same language, as necessary to represent documented
reality.
The ROP
Geopolitical Language Index contains the following five fields:
Column
Format
Description
ROPGeoLangID
int
A unique code that represents each instance of a language, as
described in the HIS Registry of Language, spoken by a people in one or more
geographical locations as described in the HIS Registry of Geography.
ROPGeoID
int
A unique code that represents each instance of a people in one or
more geographical locations as described in the HIS Registry of Geography.
ROL_LanguageCode char(3)
See Registry of Language: ROL_Languages.
Memo
varchar(255)
Space for originator’s and editor’s notes.
Source
varchar(25)
Origin of the people record.
3.3 ROP Geopolitical Religion Index
This is a linking table that cross-references each
people-geopolitical combination to one or more
religions they are known to practice. More than one people group may be
linked to the same religion, as necessary to represent documented reality.
Note that the ROP Geopolitical Religion Index contains no data in this
Original Release, because the Registry of Religions is not yet complete. As
soon as the Registry of Religions is ready for a first release,
cross-reference work between it and the Registry of Peoples will begin,
followed by a data-populated release of the ROP Geopolitical Religion Index.
The ROP
Geopolitical Religion Index contains the following five fields:
Column
Format
Description
ROPGeoReligionID int
A unique code that represents each instance of a religion, as
described in the HIS Registry of Religion, spoken by a people in one or more
geographical locations as described in the HIS Registry of Geography.
ROPGeoID
int
A unique code that represents each instance of a people in one or
more geographical locations as described in the HIS Registry of Geography.
ROR_ReligionCode char(3)
See Registry of Religion: ROR_Religions.
Memo
varchar(255)
Space for originator’s and editor’s notes.
Source
varchar(25)
Origin of the people record.
3.4
ROP Alternate People Names
This is a
supplementary table of alternate names by which this people may also be
known or listed in various sources. In some cases the name is one segment of
the group, or the name of a language or dialect associated with the primary
name or a segment.
The ROP Alternate
People Names table contains the following three columns:
Column
Format
Description
ROP_PeopleCode int
See ROP_Peoples above.
ROPAltName
varchar(75)
Alternate name of a people.
Memo
varchar(255)
Space for originator’s and editor’s notes.
Source
varchar(25)
Origin of the people record.
3.5
ROP Dialects
This is a supplementary table of dialect names and
IDs. Dialect IDs are related back to corresponding dialect IDs in the ROP
Geopolitical Language Index, and thus to an ROL_LanguageCode and an
ROP_PeopleCode. A dialect may be either determinative or simply descriptive
of a people.
The ROP Dialects table contains the following fields:
Column
Format
Description
DialectID
char(5)
An identifier that represents a spoken dialect, created and
maintained by SIL.
ROL_LanguageCode
char(3)
Three letter code from the Registry of Languages ROL_Languages.
Dialect
varchar(50)
Dialect name.
Memo
varchar(255)
Space for originator’s and editor’s notes.
Source
varchar(25)
Origin of the dialect record.
3.6
ROP Supplemental Language Data
This is a supplementary table of language names and
IDs for speech forms of identified people groups whose speech has not yet
been assigned a separate code by the HIS Registry of Languages (ROL). A
language is named by designation of the narrowest grouping of related
languages, by standard language classification, plus the term
"Unclassified." For example, a language identified by linguists as a member
of the Tibeto-Burman group, but not further identified, would be assigned
the ID "XXB," with the Language Name "Tibeto-Burman, Unclassified." For
peoples whose language status is unknown, the ID "XXX" is assigned, with the
Language Name "Unknown."
The ROP Supplementary Language Data table contains
the following fields:
Column
Format
Description
3.7
ROP Supplemental Data
This is a
supplementary table that holds data that is related to specific
people-geopolitical combinations, and which in some cases, may help clarify
understanding of the group, but which is not present in any defining factors
for the group.
The ROP
Supplemental Data table contains following fields:
Column
Format
Description
ROPGeoID
int
A unique code that represents each instance of a people in one or
more geographical locations as described in the HIS Registry of Geography.
Ethno14Popl
numeric
The Ethnologue Fourteenth Edition number of speakers.
JPLPopl
numeric
Joshua Project population.
ISPD96Popl
numeric
Population from the 1996 IMB ISPD database.
AMOPopl
numeric
AMO (Asia Harvest) 2000 Database.
OmidPopl
numeric
Confidential Source from South Asia.
CPPIPopl
numeric
IMB Church Planting Progress Indicators database
population.
Memo
varchar(255)
Space for originator’s and editor’s notes.
3.8 ROP Change History
All changes to ROP_Peoples table are reported in ROP Change History.
This table is cumulative, listing all changes to successive versions of the
registry.
The ROP Change History contains the following fields:
Column
Format
Description
Code
int
The code that is affected by the change reported in this record.
Type
char(1)
A one-letter code indicating the type of change: There are four
possible values:
|
Code |
Action |
Meaning |
|
C |
Created |
The code is a new
one that has been added. |
|
E |
Extended |
The range of
meaning of the code has been extended by merger with a now retired
code. The description field tells what it was merged with.
|
|
R |
Retired |
The code has been
retired from use. The description field tells what code or codes
replace it. |
|
U |
Updated |
The name, primary
country, or status of the language has been changed. |
Date
char(10)
The date the change was released in a new version of the registry.
Dates are expressed as 8 digits with hyphens to separate the parts of the
date, e.g. YYYY-MM-DD.
Description
varchar(255)
Describes the change. In the case of R changes, it also describes
what a user should do to fix existing data that uses the now retired code.
Note that there is
not a change type for the cases of narrowing or extending the meaning of a
code, such as when the ethnic entity denoted by one code is split into two
ethnic entities. In such a case, the original code is retired, and two new
codes are added. In this way, the user of the code set is assured that once
a code has been used to tag an item of data, it will continue to be the
right code to use for as long as the code remains an active member of the
code set.
The change history
table holds the cumulative list of all changes that have ever been made to
the ROP_Peoples code table. For a site that has used ROP_Peoples codes in
its own database, an important use of the change history table is to
discover codes used in its data that are now obsolete and thus need to be
changed. These will be only the codes that have been retired.
The table ROP
Geopolitical Language Index provides a cross-reference of each ethnic
entity to each country in which they are known to be present, and with each
language a portion of that entity is known to speak. This will present a combination of people by country by any
language.
4.1
Same People in Many Countries
If research
indicates that groupings of people living in various countries consider
themselves to be one people (or research indicates this for classification
purposes), the same code will be assigned to that name in all those
countries. Thus in the tables linking People Names to Country Names separate
entries will show for each country, but all entries will have the same code.
|
ROP_PeopleCode |
ROG_GeopoliticalCode |
PeopleName |
Source |
|
100925 |
ML |
Bambara |
CPPI |
|
100925 |
GA |
Bambara |
CPPI |
|
100925 |
GM |
Bambara |
CPPI |
|
100925 |
GV |
Bambara |
Guinea
Research Network |
|
100925 |
FR |
Bambara |
BM |
|
100925 |
UV |
Bambara |
PIN / SIL |
|
100925 |
MR |
Bambara |
CPPI |
|
100925 |
NG |
Bambara |
CPPI |
|
100925 |
PU |
Bambara |
BM |
|
100925 |
SG |
Bambara |
CPPI |
|
100925 |
IV |
Bambara |
CPPI |
4.2 Same People or Other People by Same
Name in Many Countries
The multi-table join below illustrates how
codes will distinguish between peoples with the same name. Research may indicate the same name
occurs in several countries due only to a phonetic coincidence. In other cases somewhat related
ethnic entities consider themselves separate peoples for other reasons. Some entities may need to be
considered separate peoples for more technical academic or anthropological
reasons.
The difference in code assigned to this name
in that country will indicate that the entries represent separate ethnic
entities (people groups).
Sometimes language is the distinguishing factor (Tonga NDC and Tonga TOI in
Zimbabwe; geography also in this case). Sometimes it is other factors (a new lake and two generations
of separated history -- Tonga TOI in Zambia and Zimbabwe).
|
ROP_PeopleCode |
ROG_GeopoliticalCode |
ROL_LanguageCode |
PeopleName |
CountryName |
Source |
|
114836 |
TH |
TNZ |
Tonga |
Thailand |
Ethnologue |
|
110153 |
MZ |
TOH |
Tonga |
Mozambique |
CPPI |
|
114836 |
MY |
TNZ |
Tonga |
Malaysia |
Ethnologue |
|
114837 |
MI |