Geographical information illuminates many features of culture that cannot be seen otherwise. Higher-resolution mapping of cultural traits, now possible with computerized techniques, can open a new window on human cultural adaptation. It can look at where people live on a scale that is small enough to reveal the features of the environment to which their cultures respond. This article discusses the techniques of mapping of language distribution in a rural area of Mexico. Much GIS technology has become so elaborate that it is not appropriate to cultural anthropology. The project described was successful because it developed and used techniques that were appropriate to the problem being studied and the situation in the field. It was carried out in 1990 with what is today rather primitive software and hardware. The techniques are still applicable to the mapping of cultural data.
Geographic information system (GIS) technology holds considerable promise as a research tool in cultural anthropology. In the last decade anthropologists began to explore the use of Geographical Information Systems (GISs). White (1985) notes that there were over a hundred responses to an solicitation of professional interest put out by the Anthropology Working Group in Remote Sensing in 1985 ("Working group," 1985).
Although the potential of GIS technology in cultural anthropological research was great, it was not realized in 1990, when this project was carried out. The needs of companies seeking to exploit the earth's mineral riches and of governments seeking to control these resources dominated the development of GISs after 1985. Data on human populations and on their social characteristics had been poorly integrated into GIS data bases used by others (Martin & Bracken, 1993). Although GIS technology is still spreading at a furious pace, [1] it is not being aimed at the needs of the cultural anthropology with the exception of archeology (Brandt, Bert, & Kvamme, 1992; Robbins, 1984).
Now that desktop microcomputers are becoming more powerful, GIS software, ranging in price from $1000 to $6000, is appearing for them. Yet, commercial interests still dominate the scientific. The software is not of great interest to cultural anthropologists because much of it is intended for use with maps of the United States rather than with maps of other countries, and large part of it is aimed at regional marketing analysis (Smith & Eglowstein, 1993).
The slow adoption of GIS techniques in cultural anthropology is also due in part to the way in which much cultural anthropological research is carried out. Ethnography, the study of an individual culture by one researcher, seldom gets a good geographical picture of the cultural traits with which it deals because its methods focus on individuals and communities rather than on regions. There are also problems in dealing with the low level of technological infrastructure in the many developing countries in which cultural anthropologists work. The hazards of doing field work over a wide area in an underdeveloped region often limit geographical understanding.
At the same time the promise of GIS techniques is improving. Data acquisition is becoming easier. In the last decade the conditions limiting the acquisition of geographical information have changed considerably. Modern communication has brought people in contact with each other. Information about what people are doing and where they live is now available from new sources such as remote sensing, missionaries, government agencies, health teams, NGOs, etc. Topographic maps are being made and aerial surveys are being conducted in new areas of the globe. Portable global positioning systems (GPS) can be taken into the field to locate people and villages on a global scale. The processing of all this information has been enhanced by rapid progress in digital electronic technology.
Thus, GIS techniques are more useful than ever before in cultural anthropology and the other social sciences, but it is necessary for social scientists to develop the techniques that are appropriate to their research needs. In 1990 I found it possible to put together readily available microcomputer software to deal with the problem of locating cultural groups and examining their relationship to natural and man-made environments. I discovered that general CAD (computer automated design) and data base software were better suited to dealing with these problems than specialized GIS software.
What is the essence of a GIS? Fundamentally it consists of: (1) one or more data bases of information, (2) a relational data base manager that brings together the data from the different data bases, and (3) a program that draws maps. One does not need the relational data base manager if all the information is kept in a single data base. Having all the data together in one data base actually permits more sophisticated statistical analysis. [2] In political science or sociology, where one might want to make use of preexisting data bases, relational data base management is more appropriate, but having the relational data base manager packaged with a GIS can inhibit statistical analysis. The relational data base management features of commercial GISs are usually not needed for cultural anthropological research done in the field.
There are two types of drawing programs used in modern GISs, the raster type and the vector type. The raster type can use remote sensing data more easily and has become very popular among analysts dealing with information on physical features of the earth (Martin & Bracken, 1993). The vector type precisely describes geometrical figures such as points, lines, circles, and three dimensional surfaces. Vector types of drawing programs are more adapted to describing data that has been collected by traditional ground-based means.
A commercial GIS was not used in the research described below. Instead, a common microcomputer data base, Reflex 2.0, was integrated with a common vector CAD program, AUTOCAD. This had the following advantages: (1) the learning process was shortened because the programs were better known; (2) statistical analysis could be performed on the data independently; (3) simpler versions of the software and pieces of the overall system could be run on smaller, cheaper field computers; and (4) the hardware needs of raster-type GIS software were avoided. The drawing in Figure 1 illustrates the integration of the components in the system that was used.

The cultural feature examined was language. Language is traditionally used as the primary cultural marker of native cultures in Mesoamerica. The new maps covered the eastern sierra of Hidalgo, Mexico, parts of the southern Huasteca, and parts of the northern Sierra de Puebla from 20° to 20° 45' north latitude and from 97° 40' to 98° 30' west longitude. Only vague maps of the location of the different linguistic groups had been previously published. The distributions of five regional languages, Spanish, Ñähñu, Nahua, Totonac, and Tepehua were newly mapped. The project followed these steps: (1) The settlements being studied were located by latitude, longitude, and altitude. (2) The locations were entered into a data base. (3) Data on the language spoken at these places was collected and added to the data base. (4) Map coordinates were calculated from the latitude and longitudes. (5) An intermediate DXF file was written from the data base and then read into AUTOCAD. (6) The locations of the villages where the various languages were spoken were displayed graphically. (7) Language boundaries and other features were drawn electronically on layers over the displayed data. (8) Maps of the language distributions were printed from the computerized drawings.
The field computer was located in a rural village with AC electric power. The original field computer was an XT clone with a coprocessor. It was later replaced by a 286 AT-type microcomputer. All software ran under MS-DOS. An XT lap-top computer without a hard disk was used to take the data base to informants. At a U.S. university, a 286 clone was used to add features to the maps and to print them in final form. Thus, hardware requirements were quite minimal even for the 1990 period in which the data was collected.
A DOS program, Reflex 2.0 managed the data base. It was capable of performing elementary univariate and multivariate statistics and could display the data in a variety of graphs. Some statistical packages like SAS offer geographical displays, but they do not integrate well with the CAD programs that are necessary for adding, drawing, and digitizing other geographical data. By 1998 standards the software and hardware used was minimal and could be acquired on the junk computer market for a pittance, yet the results it produced were unique and very valuable to cultural anthropology. Any data base manager capable of writing text files in a specified form would work quite well with the techniques that were employed.
The data base consisted of one record for each settlement. The information was displayed in Spanish to make it usable by Mexican collaborators. Table 1 lists some of the important fields in the data base records. Reflex 2.0 not utilizing extended or expanded memory in DOS needed only 23% of its available RAM for the 751 settlements that were entered. In fact, one need not worry too much about the data base manager with which such a mapping project starts because modern microcomputer data base managers permit the exchange of data between different programs.
|
|
|
|
| NO | integer | A unique number assigned to the settlement |
| MAPA | text | The map or geographical table from which the settlement name and coordinates were taken |
| LOCALIDAD | text | Full name of the settlement |
| LATDEG | numeric | Degrees of latitude of the settlement, without fractions. North latitude is expressed as a positive number. |
| LATMIN | numeric | Minutes of latitude, with decimal fractions. |
| LONDEG | numeric | Degrees of longitude, without fractions. West longitude is expressed as a negative number |
| LONMIN | numeric | Minutes of longitude, with decimal fractions |
| ALT | numeric | Altitude in meters above sea level |
| LENGUA | text | Code for the languages spoken in the settlement school. See Table 2 |
| YMAP | numeric | Map coordinate calculated in kilometers south to north from map zero. |
| XMAP | numeric | Map coordinate calculated in kilometers west to east from map zero. |
All the known settlements in the region were located. The coordinates were taken from three sources: (1) tables of topographical data, when they could be located in libraries (INEGI, 1987, 1988); (2) tables of coordinates on the back of topographic maps, when they were published; and (3) topographic maps themselves. The main source of coordinate information was the topographic maps. The 1:50,000 series produced by the Dirección General de Geografía of the Instituto Nacional de Estadística, Geografía, e Informática (INEGI) were used. Comparisons of the tables with the maps showed that the maps represented more recent and more accurate photogrammetric studies. A small, paper, square ruler allowing quick interpolation between latitude and longitude was invented to speed up the taking of coordinate data from the topographic maps. It was printed on heavy paper in the field from an AUTOCAD drawing and scaled to the maps. Modern printers have accurate positioning motors and are capable of producing cheap custom drawing instruments of this sort.
Language data were provided by bilingual school teachers from their knowledge of the languages spoken by children in the local schools. Therefore the coordinates of a settlement were assigned to the settlement school, when it could be located on a topographic map. Otherwise, the coordinates of the center of the dwellings was taken as the language location.
When entering coordinates, it is important to consider whether or not they are appropriate to the other data being entered for the points specified. Sometimes a project needs the coordinates of a family dwelling, at other times the coordinates of a village. Perhaps several coordinates for a village might be needed, one for each church in a study of religion for example. Sometimes an area instead of a point is the proper geographical reference, for example, if types of land use are being considered. In vector CAD programs areas can be specified by lines enclosing them. Now that relatively inexpensive GPS satellite receivers are available, they should be used by anthropological field workers when gathering survey data at locations that do not otherwise appear on maps.
The means of acquiring geographical data can make or break a mapping project. One should explore carefully new potentials for gathering and storing geographical data. Cultural data can be drawn from many sources. Sometimes it is available in government reports and archives. Sometimes it is gathered by missionary or development projects. Western cultures have spread the idea of gathering social and cultural data, so organizations all over the world are now accumulating data that is useful for a variety of purposes.
To obtain the language data for this project, repeated sessions were held with bilingual school teachers at regional centers of the Secretaría de Educación Pública (SEP). The teachers looked at lists of settlements and at preliminary maps showing the relative location of the settlements. They identified the languages spoken by the students in the local schools. Their information was entered into the geographic data base. There were always several teachers present during the identification process. One teacher would correct the observations of another if necessary. This added an extra measure of reliability to the data. Thus, a great deal of reliable information was recorded without time consuming travel in the rugged rural areas. Although they lived elsewhere, teachers visited the regional centers quite often for training and coordination. They were energetic travelers in the remote mountain regions. Their enthusiasm and dedication were great assets to the project.
At times the data provided by the teachers was entered directly into the data base with a lap-top computer. A small lap-top without a hard disk was adequate to run the data base manager, Reflex 2.0. At other times printed versions of the data base were annotated during interviews. The settlements on printed versions were sorted into latitude, or longitude, bands so that nearby settlements appeared on the same page to make the identification process more orderly. Preliminary maps were produced by a dot-matrix printer connected to a the non-portable field microcomputer to help the teachers locate contiguous settlements. Xeroxed sections of 1:250,000 topographic maps were also used at times to help the teachers.
The teachers were able to reliably identify languages spoken in 439 of the settlements in the data base. This produced a map of language distribution that had a resolution at least ten times greater than any linguistic map of the region heretofore produced. The increase in resolution resulted from a fusion of new easily accessible sources of data, the bilingual teachers, and the rapid processing of the geographic data on field microcomputers.
The language data were coded in the LENGUA field of the data base. The codes are shown in Table 2. Often, the teachers reported two or more languages spoken by students at a school. This was due either to their being two cultural groups in the area or to multilingualism. Thus, the LENGUA field could contain codes for several languages. For example, the code "EToN" indicated that Spanish, Totonac, and Nahua were spoken in a settlement school. The mapping techniques did not require the codes to be in a particular order. The teachers were asked to give only reliable information and, when they were uncertain, to refer to others who knew the area better. When two or more teachers gave contradictory indications of a language, a group discussion was held with the teachers in order to resolve the ambiguity. In all cases, they were able to decide what the correct language designations should be. The process of entering the coordinates of new settlements into the data base and of obtaining information on the languages was repeated several times.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
The standard projection for relatively small-scale maps of this sort is a transverse Mercator projection. A nearby point of latitude and longitude, 20° N by 98° W, was selected as the origin of the map coordinates and the point of tangency of the projection. If an origin-tangent point has latitude LA and longitude LO measured in degrees, the map coordinates XMAP in the horizontal direction and YMAP in the vertical direction are given by the following formulae. The other variables are described in Table 1. Note that west longitude is expressed as a negative number because it is measured from right to left from the prime meridian. [3]
YMAP = 1.85325 ×((60 × LATDEG) + LATMIN - (60 × LA))
XMAP = 1.85325 × cos(LA) × ((60 × LONDEG) + LONMIN
- (60 × LO))
The graphical part of the mapping project was carried out with the program AUTOCAD, run first on the field computer and later on a microcomputer at the university. AUTOCAD is a typical vector CAD program. It stores drawings in the computer as a set of entities such as lines, circles, points, planes, cylinders, etc. These are located and described numerically in a three dimensional space. Maps are just one type of drawing. Map data may be entered directly with a mouse or digitizer or it may be read by the program from other computer files. Both means of entering map data were utilized in the project. The program allows the operator to view and print any part of the drawing from any perspective. Parts of the drawing may be magnified, and view points may be taken from any location. In this mapping project views of only two dimensions were needed. However, the data were entered in three dimensions at little cost in time, the altitude of the settlement being the third dimension. The altitude data can be used at a later date for other displays. There are many features of AUTOCAD that were not used in this project. AUTOCAD is simple to use and the user need not be concerned with features that are not activated. The project started out with Version 2.62 of AUTOCAD and ended up with Version 10, both were early DOS versions of this program that has now evolved into something more sophisticated.
An important step in cultural mapping is transferring the data from a data base to the map. This can be done by writing a file from the data base and reading it into the CAD program. In this project all field data were recorded in the data base, not on the map. The locations of the settlements, their names, numbers, altitudes, and languages were transferred to the map by writing and reading a computer file. Such a file describes rather than draws the objects in the drawing. In general, data base programs do not write specialized CAD files, but CAD programs do read text files that can be written by data base programs. Over the years AUTOCAD has developed a special text file format for this use. It is called "the DXF format" (Omura, 1988) and has been adopted as a standard by many other CAD programs.>
There are other methods of entering data into a map created by a CAD or GIS program. Very recently more powerful portable computers have made it possible to enter language information directly into a digital map in the field. However, such data entry creates problems. For example, if one had not located a village correctly on the map, the coordinates of the language information, if entered directly into the map, would be incorrect. A separate data base permits better control of the data. Having the data base separate from the CAD program has other advantages as well. A separate data base can be managed by much less powerful, even expendable, portable field computers and can be used in the field more flexibly. It also allows much more sophisticated statistical analysis to be performed on the data. Specialized GIS programs that simultaneously manage a data base and a drawing together are often too large, too expensive, too specialized, or too demanding of hardware to be useful in anthropological research away from the conveniences of a modern office.
The best way of understanding how to transfer data in the DXF format is to draw a very simple map and to export it as a DXF text file. The DXF file will reproduce the drawing when given as input. One can then examine the exported file to see where the data is placed and copy the input into the same format. Also, one should consult handbooks on AUTOCAD to understand the general structure of the DXF format. Once the format for the DXF input to the language maps was determined, a "report" form was set up in the database manager to insert the data for each record into the correct locations in a DXF text input file. Figure 2 shows the layout of the form which generated the DXF file transferring the language codes to the map. Other forms were created to transfer other things such as altitude. A single large form could have been written to transfer all the geographic information at once.

After the AUTOCAD program read the drawing information from the DXF file, the pattern of language codes appeared on the video screen. The location of the each settlement was at the lower left corner of its code according to the AUTOCAD convention. Figure 3 shows a section of the map with the language codes displayed. At this time it was necessary to draw the language boundaries. Conventions had to be established, as described below, to locate the most probable place for a boundary. Using these conventions, language boundaries were drawn with a mouse directly in the AUTOCAD program.

An important feature of AUTOCAD, and other CAD programs, is that drawings are separated into "layers" that can be turned on and off before printing. The settlement language codes were placed on a layer called LANG. Layers can be specified in the DXF file, so that the entities go into the drawing on their assigned layers. The DXF layer designator LANG, for the language codes, can be seen in Figure 2. Before printing the final maps the LANG data layer was turned off. Only language boundaries, located on other layers, were displayed. There was a separate layer for each language. The different language distribution maps are actually the same drawing printed with different layers turned on or off.
First, a multi-language map was prepared by putting settlements in which several languages were spoken on the boundary between two of these languages. When more than two languages were spoken in a settlement, a compromise had to be reached. It became apparent that a single map with all the languages could not do justice to the area of influence of a particular language. In fact the languages interpenetrated each other. There were multilingual hamlets and villages. Therefore, separate maps were prepared for the five different languages. In preparing these maps a settlement was included within the language boundary if there was a population of speakers in the school.
An exceptional village, Loma Bonita (20° 26.3' N, 98° 5.2' W), seen in the upper right of Figure 3, contained a population that spoke all five languages Nahua, Ñähñu, Totonac, Tepehua, and Spanish and could not be perfectly harmonized within the language contours. Loma Bonita is a special place deep in the sierra to which peasants migrate to work on coffee plantations. It is a true multiethnic community that was not easily located within a single language area. The remainder of the villages fitted neatly into areas, once the decision was made to draw separate boundaries for each language. Wage labor migration, although extensive in this region, did not confuse the cultural geography at these higher levels of resolution. In general, one can assume that, as one increases the resolution of cultural geographic mapping, the complexities of migration patterns will reveal themselves and inform rather than confuse the anthropological interpretations.
In general, one has to map cultural traits with caution. Not every cultural trait will have a distinct geographical distribution. For example, would one expect to find a geographical separation of American families that drank Coca Cola vs. families that drank Pepsi Cola? Geographical clustering first has to be verified empirically if it is not immediately apparent. The boundary approach used in this project is not always appropriate to the problem under investigation. Three dimensional cluster analysis and/or density maps may be more appropriate to other kinds of data.
Anomalies appeared as the language boundaries were drawn. An isolated village speaking one language would appear amidst an area where another language predominated. These anomalies had to be checked with the informants. If the maps were not produced on a field computer, the process of checking anomalies would have taken very much longer. Map sections containing the anomalies were printed on a field printer. The villages in question were also located in the data base. The bilingual teacher centers were visited with a list of anomalies to be discussed. In most cases the problem was an incorrect identification of the particular settlement. The data base was so large that it contained many settlements with the same name. The informants had to know precisely which settlement was meant when queried for linguistic data. Informants could become confused, so the data had to be collected slowly in a relaxed atmosphere. It was helpful to have several teachers working on the problem during an interview session. In some cases, such as Loma Bonita, the anomalies were real.
After the language boundaries were drawn, other details were added to make the maps more informative: latitude and longitude lines, major towns, a scale of kilometers, and labels. Colors could have been printed with a color printer, but color reproduction was not needed at this level of complexity. In the field, a nine-pin dot-matrix printer gave enough resolution to the maps that were used by the informants. AUTOCAD allows one to enlarge any section of the drawing for printing; therefore, higher resolution printers were not necessary. Finally a laser printer was used when it became available. A 300 dot-per-inch laser printer produced maps on 8.5 x 11 inch paper that had more resolution than a plotter at 12 x 17 inches. The final map of one of the languages, Ñähñu, is shown in Figure 4.

Interesting results emerged from this fairly simple language boundary analysis. The primary reason for these new results was simply the higher resolution of the maps. No one had a perfect picture of exactly where these cultural groups were located before this project was carried out. For example, an ethnic group, the Tepehua, that had been declared to be "dispersed" in the literature turned out to have two definite areas of concentration. The Ñähñu shown in Figure 4 had a "hole" in their home territory that was filled by Spanish speakers, and so forth.
The mapping procedures described above are applicable to many types of field research in the social sciences. Such maps can be produced quickly and economically in the field. Not only did the project move quickly, it also produced maps with resolutions better than anything done before. High resolution cultural mapping can reveal significant features of historical and human environment interactions that low resolution mapping cannot. One can compare it to the Voyager space craft trips to the outer planets. Some general features of the outer planets were known before, but the scientifically valuable knowledge came only after high resolution photos were returned by space craft. Similarly, high resolution cultural mapping opens a window on human adaptation and human history. The advantages of doing the computer mapping with a separate data base and CAD program are: (1) One does not have to invest money in expensive specialized GISs. (2) Data input and output are more flexible. (3) Less computer power is needed at any one time. (4) Operators do not have to learn a completely new system. (5) The separate data base permits spatial analysis to be raised to a more sophisticated level on other statistical and GIS programs.
The GIS that has been described allows the cultural anthropologist to remain close to the source of the field data. It allows him or her to check the data with informants and to work with computers in the field. In employing new technology to solve old problems, cultural anthropology can retain the strengths of its field work principles. It can maintain close contact with its informants and the culture that it is examining.
* James Dow is Professor of Anthropology at Oakland University in Michigan. He can be contacted by email. More information appears on his web page. An earlier version of this paper was presented at the 13th International Congress of Anthropological and Ethnological Sciences held in Mexico City, July 29 - August 5, 1993. Another version appears in the Social Science Computer Review 12(4):479-492 in 1994. The author is grateful for the aid received from school teachers at the SEP centers in Huyacocotla, Ver., Tenango de Doria, Hgo. and Huachinango, Pue. He particularly thanks Prof. Ramiro Cajero José, and Dr. Roberto Garrido G.
1. GIS revenues were over a billion dollars per year in 1992 ("Standards," 1992).
2. This ability to retain all the data in a single data base can be an advantage in larger GIS projects too (Batty 1992).
3. The constant 1.85325 is the number of kilometers in a nautical mile.
Batty, P. (1992). Exploiting relational data base technology in a GIS. Computers and Geosciences 18, 453-462.
Brandt, R., Groenewoudt, B. J., & Kvamme, K. L. (1992). An experiment in archaeological site location: Modeling in the Netherlands using GIS techniques. World Archeology 24, 268-282.
Instituto Nacional de Estadística, Geografía, e Informática. (1987). Síntesis geográfica, nomenclátor y anexo cartográfico del Estado de Puebla. Mexico City: Author.
Instituto Nacional de Estadística, Geografía, e Informática. (1988). Síntesis geográfica, nomenclátor y anexo cartográfico del Estado de Veracruz. Mexico City: Author.
Martin, D., & Bracken, I. (1993). The integration of socioeconomic and physical resource data for applied land management information systems. Applied Geography 13 (1), 45-53.
Omura, G. (1988). Mastering AUTOCAD (Second ed.). San Francisco: Sybex.
Robbins, C. (1984). Computer-assisted mapping. Practicing Anthropology, 6(2), 15.
Smith, B. & Eglowstein, H. (1993). Putting your data on the map. Byte 18, 188-190.
Standards for geographic information systems (GIS) under development. (1992, December). ASTM Standardization News.
White, Douglas R. (1985) . Satellite images, ecology, and geographic information systems. World Cultures 1(3) [Electronic Journal on MS-DOS disk]. Columbia, SC: Analytic Technologies, 306 S. Walker St. (Distributor).
Working group formed to focus on anthropology and remote sensing. (1985) Science 229, 292.
AUTOCAD, Autodesk, Inc. Sold only through authorized vendors. For nearest vendor call 800-964-6432.
REFLEX 2.0, Borland International, Customer Service, P.O. Box 660001, Scotts Valley, CA 95066-0001; 800-331-0877
SAS, SAS Institute Inc., SAS Campus Dr., Cary, NC 27513; 919-677-8000.
SYGRAPH, SYSTAT, Systat Inc., 1800 Sherman Ave., Suite 801, Evanston, IL 60201; 708-864-5670.