<link href="template.css" rel="stylesheet" type="text/css"/> <meta content="urn:uuid:3933952e-46b0-4527-893e-d888a17f5290" name="Adept.expected.resource"/> </head> <body> <table class="cn"><tr><td class="top"><p class="LessonHead">Lesson</p> <p class="ChapterNumber">2</p></td><td class="bottom"><p class="ExerciseMainHeadline"><b>Preview the data</b></p></td></tr></table> <p class="BodyText"><span class="LeadLine">YOU NEED THE RIGHT DATA</span> to solve the problem of where to locate a park. You explored the study area in <a href="Lesson001.xhtml">lesson 1</a>. Now you can proceed more systematically. What data do you have? How useful is it? Is there data that you need but don’t have? Has the problem been stated clearly enough for you to know what data you need?</p> <p class="BodyText">Acquiring, evaluating, and organizing data is a big part of an analysis project. This book doesn’t fully re-create the complexity of the real world, because all the basic data required is provided. But much of the data isn’t project-ready, and that need for further preparation reflects the real world of GIS.</p> <p class="BodyText">The first thing you’ll do in this lesson is draw up a planning document to help keep your tasks in focus. You’ll use this document to list the guidelines for the new park and translate them into specific needs for spatial and attribute data.</p> <p class="BodyText">After you itemize your data requirements in general terms (park data, river data, and so on), you’ll take stock of your source data and investigate its spatial and attribute properties. You’ll also familiarize yourself with metadata, which is the data you have about your data. Before you decide to use a particular dataset, you may want to know things such as who made the data, when, and to what standard of accuracy.</p> <p class="BodyText">Once you have a better working knowledge of your data, you’ll reframe the problem statement. GIS is a quantitative technology: you can’t analyze a problem until it’s been stated in measurable terms. Wherever you find the city council’s guidelines to be vague, you’ll replace them with hard numbers.</p> <p class="img"><img alt="" class="img3" src="images/Page-065.jpg"/></p> <p class="pimg"><img alt="" class="img3" src="images/Page-066.jpg"/></p> <p class="H1" id="sec3"><b>Exercise 2a: List the data requirements</b></p> <p class="BodyText">You must relate the guidelines for the new park to data requirements for the project.</p> <p class="H2"><b>Open the data requirements table</b></p> <p class="BodyText">A table has been made in advance to help you keep track of your requirements. It’s an informal document, but it will still be helpful. You can refer to it as the data requirements table.</p> <p class="ExerciseStepFirst"><span class="ex">1)</span>Open Windows Explorer and navigate to C:\EsriPress\UGIS4\ParkSite\MapsAndMore.</p> <p class="ExerciseStep"><span class="ex">2)</span>Double-click the file DataRequirementsTable.doc to open it in Microsoft<sup>®</sup> Word.</p> <p class="tip"><span class="tip"><img alt="" src="images/bl.jpg"/></span><b>If you don’t have Microsoft Word, open the RTF version of the document in another app, or print the PDF version and fill it out with a pencil.</b></p> <p class="img"><img alt="" class="img3" src="images/Page-067.jpg"/></p> <p class="H2"><b>List the requirements</b></p> <p class="BodyText">In this section, you’ll review the city council’s guidelines (refer to “Park guidelines” in <a href="Lesson001.xhtml">lesson 1</a>, under “Frame the problem”) and describe in a general way the data needed to satisfy them. The specifics of choosing datasets are presented in <a href="Lesson003.xhtml">lesson 3</a>.</p> <p class="BodyText">The first guideline was to find a vacant piece of land at least one-quarter acre in size. You can break this down into three requirements:</p> <p class="BulletList"><span class="bl">•</span>Land parcel</p> <p class="BulletList"><span class="bl">•</span>Vacancy</p> <p class="BulletList"><span class="bl">•</span>Size</p> <p class="BodyText">The requirement for a land parcel is already listed in the table. You need spatial data representing parcels so that you can see candidate sites on the map.</p> <p class="BodyText">The second requirement is vacancy, which is a characteristic, or attribute, of a parcel. In a GIS dataset, vacancy is often listed with other descriptions of land use (commercial, residential, industrial, and so on). In general terms then, you’re looking for a land-use attribute.</p> <p class="ExerciseStepFirst"><span class="ex">1)</span>In row 2 of the table, under Attribute Data, type (or write) <span class="Step-strong">land use</span>.</p> <p class="BodyText">The third requirement is that the park be one-quarter acre or larger. Like vacancy, acreage is an attribute, although it is one that can be calculated by the software. Because ArcGIS Pro can convert one unit of area to another, you don’t even have to start with acres—any measurement of parcel size will suffice.</p> <p class="ExerciseStep"><span class="ex">2)</span>In row 3, under Requirement, type <span class="Step-strong">a quarter acre or more</span>. Under Attribute Data, enter <span class="Step-strong">area</span>.</p> <p class="BodyText">The second guideline under “Park guidelines” is that the park be within the Los Angeles city limits. This sounds like spatial data, and you’ll treat it that way for now. (It could be an attribute, too, because a field in a table might store the name of the city in which each parcel is recorded.)</p> <p class="ExerciseStep"><span class="ex">3)</span>Fill in row 4 as you think it should look, and then check the figure.</p> <p class="img"><img alt="" class="img3" src="images/Page-068.jpg"/></p> <p class="BodyText">The third guideline is that the park be as close as possible to the Los Angeles River.</p> <p class="ExerciseStep"><span class="ex">4)</span>In row 5, for the requirement, put <span class="Step-strong">near LA River</span>. Under Spatial Data, put <span class="Step-strong">rivers</span>.</p> <p class="BodyText">Using spatial datasets of parcels and rivers, you can measure the distance from any given parcel to the river.</p> <p class="BodyText">The fourth guideline is to locate the park not in the vicinity of another park, or away from existing parks.</p> <p class="ExerciseStep"><span class="ex">5)</span>Fill out row 6 as you think it should look.</p> <p class="BodyText">The fifth guideline also needs to be broken down. You need a neighborhood (spatial data) that has the following:</p> <p class="BulletList"><span class="bl">•</span>high population density (attribute data) and</p> <p class="BulletList"><span class="bl">•</span>lots of children (attribute data).</p> <p class="BodyText">Neighborhoods tend not to have formal boundaries, so you’re probably not going to find them as such in a spatial dataset. As a proxy, or substitute, you’ll use a set of small, standardized areas defined by the US Census Bureau: either the tracts or block groups you looked at in <a href="Lesson001.xhtml">lesson 1</a>.</p> <p class="ExerciseStep"><span class="ex">6)</span>In row 7, enter <span class="Step-strong">in a neighborhood</span> as the requirement. Enter <span class="Step-strong">census unit</span> for the spatial data.</p> <p class="ExerciseStep"><span class="ex">7)</span>In row 8, enter <span class="Step-strong">densely populated</span> for the requirement and <span class="Step-strong">population density</span> for the attribute data.</p> <p class="ExerciseStep"><span class="ex">8)</span>In row 9, enter <span class="Step-strong">lots of kids</span> for the requirement. For the attribute data, enter <span class="Step-strong">age</span>.</p> <p class="BodyText">The sixth guideline is that the park be in a lower-income neighborhood. You don’t need to repeat the spatial requirement for a neighborhood from step 6.</p> <p class="ExerciseStep"><span class="ex">9)</span>In row 10, enter <span class="Step-strong">lower income</span> for the requirement and <span class="Step-strong">income</span> for the attribute data.</p> <p class="BodyText">The last park guideline is to serve as many people as possible. For this guideline, you need a population attribute.</p> <p class="ExerciseStep"><span class="ex">10)</span>In row 11, enter <span class="Step-strong">serving the most people</span> as the requirement and <span class="Step-strong">population</span> as the attribute data.</p> <p class="img"><img alt="" class="img3" src="images/Page-069.jpg"/></p> <p class="BodyText">Eventually, you’ll want to make a map of potential sites, and you may need some additional data for cartographic purposes. For example, political boundaries and roads put maps in a familiar context. Physical relief creates texture, and imagery provides realistic detail.</p> <p class="ExerciseStep"><span class="ex">11)</span>In rows 12 to 15, enter <span class="Step-strong">final map</span> for the requirement. Under Spatial Data, list the examples just mentioned in step 10.</p> <p class="img"><img alt="" class="img3" src="images/Page-070.jpg"/></p> <p class="ExerciseStep"><span class="ex">12)</span>Save and minimize the table. You’ll continue to use it in the next exercise.</p> <p class="H1" id="sec4"><b>Exercise 2b: Examine the data</b></p> <p class="BodyText">Now you can see what data you actually have on hand. To do this, you’ll work in the Catalog pane. In <a href="Lesson001.xhtml">lesson 1</a>, you used the Catalog pane to manage your maps and data folders. The Catalog pane is great for going back and forth between your map and your data (which is what you do most of the time).</p> <p class="H2"><b>Get started</b></p> <p class="ExerciseStepFirst"><span class="ex">1)</span>Start ArcGIS Pro, if necessary, and open your LARiverParkSite project. You’ll continue working with your project document from exercise 1b.</p> <p class="ExerciseStep"><span class="ex">2)</span>Display the Catalog pane. The Catalog pane is generally docked to the right of the map (sometimes hidden as a tab).</p> <p class="tip"><span class="tip"><img alt="" src="images/bl.jpg"/></span><b>If you close the Catalog pane or can’t find it, you can always open it by clicking the Catalog Pane button on the View tab.</b></p> <p class="H2"><b>Insert a new map from the Catalog pane</b></p> <p class="BodyText">Recall that you can insert a map from the ribbon, but this time you’ll add it from the Catalog pane.</p> <p class="ExerciseStepFirst"><span class="ex">1)</span>In the Catalog pane, expand the Maps folder <img alt="" class="img4" src="images/l2_Ico_007a.jpg"/>. Note that the map(s) you’ve created in previous lessons are listed here.</p> <p class="ExerciseStep"><span class="ex">2)</span>Right-click Maps in the Catalog pane and click New Map in the context menu. A new map is added to the list and opens to the map view.</p> <p class="ExerciseStep"><span class="ex">3)</span>Right-click the new map item and click Rename. Name the new map <span class="Step-strong">Lesson2</span>.</p> <p class="ExerciseStep"><span class="ex">4)</span>In the Catalog pane, expand Folders by clicking the arrow to the left of it (or double-click Folders). Here you should see the project folder that was created when you started the project (LARiverParkSite) as well as the ParkSite folder that you connected to in <a href="Lesson001.xhtml">lesson 1</a>.</p> <p class="H2"><b>Survey the SourceData folder</b></p> <p class="ExerciseStepFirst"><span class="ex">1)</span>Expand the ParkSite folder.</p> <p class="ExerciseStep"><span class="ex">2)</span>Expand the SourceData folder.</p> <p class="ExerciseStep"><span class="ex">3)</span>Expand everything you can under SourceData.</p> <p class="img"><img alt="" class="img3" src="images/Page-071.jpg"/></p> <p class="BodyText">It’s a long list of items. You may have to scroll down or maximize the application to see everything. Each item is a piece of geographic data or a data container. The icons signify the type of data, as illustrated in the sidebar “Representing the real world as data.”</p> <p class="BodyText">Under the SourceData folder are three folders and a geodatabase <img alt="" class="img4" src="images/l2_Ico_007b.jpg"/>:</p> <p class="BulletList"><span class="bl">•</span>The census folder contains three feature classes of census data in shapefile (.shp) format.</p> <p class="BulletList"><span class="bl">•</span>The City of LA folder contains three shapefiles and a stand-alone table in dBASE (.dbf) format.</p> <p class="BulletList"><span class="bl">•</span>The ParkData folder contains a shapefile.</p> <p class="BulletList"><span class="bl">•</span>The geodatabase contains 10 feature classes in geodatabase format. The feature classes are thematically organized in containers called <i>feature datasets</i> <img alt="" class="img4" src="images/l2_Ico_007c.jpg"/>.</p> <p class="BodyText">In the next sections, you’ll preview a lot of this data to make sure you have the features and attributes you listed in the data requirements table.</p> <div class="box1"> <p class="BlueBoxTitleh">Representing the real world as data</p> <p class="BlueBoxMidParagraph">How would you create an information system to organize and manage the huge variety of geographic stuff in the world? One approach is to think of all that stuff in terms of discrete objects.</p> <p class="BlueBoxH1"><b>The discrete-object view of the world</b></p> <p class="BlueBoxMidParagraph">If you conceive of geography in terms of objects, you can sort these objects by similarities. Shape is a fundamental sorting principle: every object can be drawn—in two dimensions—as either a point, a line, or a polygon. Theme, or type, is another principle: every object can be classified as a school, a road, a park, or something else.</p> <p class="BlueBoxMidParagraph">Applying these sorting principles of shape and theme, you can come up with collections of things you would recognize on a map: schools represented as points, roads represented as lines, parks represented as polygons, and so on.</p> <p class="img"><img alt="" class="img3" src="images/Page-072a.jpg"/></p> <p class="BlueBoxMidParagraph">Each object in a collection has a unique location, specified by a pair of spatial coordinates (for points) or a list of coordinate pairs (for lines and polygons).</p> <p class="img"><img alt="" class="img3" src="images/Page-072b.jpg"/></p> <p class="BlueBoxMidParagraph">Besides a unique location, every object has a set of facts that pertain to it: a name, a description, or whatever bits of information have been gathered about it. These facts are the object’s attributes.</p> <p class="img"><img alt="" class="img3" src="images/Page-072c.jpg"/></p> <p class="BlueBoxMidParagraph">In ArcGIS, a collection of such objects—with a common shape, common theme, and common attributes—is called a <i>feature class</i>. An individual object in the collection is a <i>feature</i>. The feature class is the basic storage unit for GIS data created according to the discrete-object view of the world, commonly called the <i>vector data model</i>.</p> <p class="BlueBoxMidParagraph">Feature classes can be stored in various file formats, notably the geodatabase and the shapefile. The geodatabase format is newer and more highly developed.</p> <p class="img"><img alt="" class="img3" src="images/Page-072d.jpg"/></p> <p class="BlueBoxH1"><b>The continuous-surface view of the world</b></p> <p class="BlueBoxMidParagraph">Although it’s a powerful model, the discrete-object view is not an intuitive way to think of certain kinds of geographic information, such as elevation or temperature, that don’t have shapes or boundaries and that cover the world everywhere. It’s quite possible to represent these phenomena as features (for example, contour lines represent elevation on topographic maps), but a more natural way to think of them is in terms of continuous expanses, or surfaces.</p> <p class="BlueBoxMidParagraph">The most common way to model a geographic surface is with a matrix of square cells, or pixels. Each cell represents a unit of area, such as a square meter, and stores a single piece of geographic information—typically, a measured or estimated value—at that location.</p> <p class="img"><img alt="" class="img3" src="images/Page-073a.jpg"/></p> <p class="BlueBoxMidParagraph">This way of modeling surfaces is called the <i>raster data model</i>. It’s commonly used for elevation and its derivatives (slope, aspect); for temperature, precipitation, and land cover; for statistical data, such as densities and means; and especially for imagery.</p> <p class="BlueBoxMidParagraph">The raster dataset is the basic storage unit for GIS data created according to the continuous-surface view of the world. Raster datasets can be stored in geodatabases or in various standard image file formats, such as TIFF and JPEG.</p> <p class="img"><img alt="" class="img3" src="images/Page-073b.jpg"/></p> <p class="BlueBoxMidParagraph">Feature classes and raster datasets are complementary. In many maps, raster datasets are used for background display, whereas feature classes are used for foreground display and analysis.</p> </div> <div class="box"> <p class="BlueBoxTitle">Acquiring data</p> <p class="BlueBoxLastParagraph">In any GIS project, acquiring good data is a big part of your job. ArcGIS<sup><b>SM</b></sup> Online provides datasets representing many types of geography. You can access this data from a web browser (<a href="http://www.arcgis.com">www.arcgis.com</a>) or directly from ArcGIS Pro by switching from the Project tab on the Catalog pane to the Portal tab. You can search for particular types of data by keyword; you can also search for ArcGIS Online groups, such as the Living Atlas, that curate a variety of authoritative content. For spatial data, you can use ArcGIS<sup><b><sup>®</sup></b></sup> Open Data open-source software. Spatial data is also widely available from government agencies, educational institutions, and commercial vendors. All these sources may supplement data collected and managed by your own organization.</p> </div> <p class="H2"><b>Preview parcels</b></p> <p class="BodyText">First, you’ll preview the parcels data, which comes from the County of Los Angeles.</p> <p class="ExerciseStepFirst"><span class="ex">1)</span>Switch to the Imagery with Labels basemap using the Basemap button on the Map ribbon.</p> <p class="ExerciseStep"><span class="ex">2)</span>In the Catalog pane, under the City of LA folder, click Parcels.shp to highlight it.</p> <p class="img"><img alt="" class="img3" src="images/Page-074.jpg"/></p> <p class="BodyText">The polygon icon <img alt="" class="img4" src="images/l2_Ico_010.jpg"/> signifies a polygon feature class in shapefile format.</p> <p class="ExerciseStep"><span class="ex">3)</span>Right-click Parcels.shp and click View Metadata.</p> <p class="BodyText">A new Catalog tab is displayed and shows metadata, or data documentation. What you see here is the Details, an overview of the dataset. Complete metadata includes a description about the data, when and how it was created, attribute information, and so on.</p> <p class="img"><img alt="" class="img3" src="images/Page-075.jpg"/></p> <p class="ExerciseStep"><span class="ex">4)</span>Scroll down through the Details. From the top, you see the following:</p> <p class="BulletList"><span class="bl">•</span>The dataset name (City of Los Angeles Parcels) and file type (shapefile)</p> <p class="BulletList"><span class="bl">•</span>A thumbnail image</p> <p class="BulletList"><span class="bl">•</span>Tags that make the data searchable</p> <p class="BulletList"><span class="bl">•</span>A summary of the intended use of the data</p> <p class="BulletList"><span class="bl">•</span>A description of the data content</p> <p class="BulletList"><span class="bl">•</span>Credits attributing where the data came from</p> <p class="BulletList"><span class="bl">•</span>Use limitations related to the license agreement</p> <p class="BulletList"><span class="bl">•</span>Extent of the dataset in latitude and longitude</p> <p class="BulletList"><span class="bl">•</span>Scale range of the dataset from maximum to minimum</p> <p class="ExerciseStep"><span class="ex">5)</span>Switch from the Details view to the Preview view in the lower-left corner of the Parcels metadata. This view allows a visual inspection of the data. The view can be panned and zoomed.</p> <p class="ExerciseStep"><span class="ex">6)</span>Close the Catalog tab (currently displaying the metadata).</p> <p class="ExerciseStep"><span class="ex">7)</span>Add the Parcels shapefile to the Lesson2 map. The map will zoom to the extent of the shapefile.</p> <p class="ExerciseStep"><span class="ex">8)</span>Zoom in until you can easily distinguish features.</p> <p class="tip"><span class="tip"><img alt="" src="images/bl.jpg"/></span><b>The Parcels dataset is large, so the response may be a bit slow. Shapefiles can also be considerably slower than other formats.</b></p> <p class="img"><img alt="" class="img3" src="images/Page-076a.jpg"/></p> <p class="BodyText">This is the spatial data you need, showing individual parcel boundaries.</p> <p class="ExerciseStep"><span class="ex">9)</span>With the Explore tool enabled on the Map tab, click any parcel to identify it. The results will be displayed in a pop-up window showing all the attributes of the feature.</p> <p class="img"><img alt="" class="img3" src="images/Page-076b.jpg"/></p> <p class="BodyText">Depending on exactly where you click, you may identify one or more features. The number of identified features will be displayed on the lower left of the pop-up window. You can use the arrow buttons at the bottom to move through the features.</p> <p class="BodyText">The vacancy attribute you need isn’t here, but you do have an area attribute in unspecified units. Once you find out what the units are (which you’ll do in <a href="Lesson003.xhtml">lesson 3</a>), you can convert them to acres. The figure provides some visual context for the size of a one-quarter-acre parcel compared with a five-acre park.</p> <p class="img"><img alt="" class="img3" src="images/Page-077.jpg"/></p> <p class="ExerciseStep"><span class="ex">10)</span>Close the pop-up window.</p> <p class="H2"><b>Preview the table of vacant parcels</b></p> <p class="BodyText">Since you don’t have a vacancy attribute in the parcels shapefile, you’ll look for it elsewhere.</p> <p class="ExerciseStepFirst"><span class="ex">1)</span>In the Catalog pane, under the City of LA folder, right-click VacantParcels.dbf and click Add to Current Map.</p> <p class="BodyText">This stand-alone table has information about parcels, but no polygons or any other association with spatial features. Stand-alone tables are added to the Contents pane in a special group labeled Standalone Tables.</p> <p class="ExerciseStep"><span class="ex">2)</span>Open the table by right-clicking it in the Contents pane and clicking Open. Each of the 29,461 records represents a vacant parcel within the city of Los Angeles.</p> <p class="img"><img alt="" class="img3" src="images/Page-078.jpg"/></p> <p class="ExerciseStep"><span class="ex">3)</span>Read the column headings.</p> <p class="BodyText">OID (object identifier) is a sequential number created and managed by ArcGIS automatically. AIN (assessor identification number) is a user-managed identifier. The UseCode field identifies the parcel use. The CityCode field identifies the city in which the parcel is located.</p> <p class="ExerciseStep"><span class="ex">4)</span>Close the VacantParcels attribute table.</p> <p class="H2"><b>Preview cities</b></p> <p class="BodyText">Row 4 of the data requirements table lists cities as needed spatial data. You have this data: you used it in <a href="Lesson001.xhtml">lesson 1</a>, when you made a definition query on Los Angeles.</p> <p class="ExerciseStepFirst"><span class="ex">1)</span>In the Contents pane, turn off the Parcels layer.</p> <p class="ExerciseStep"><span class="ex">2)</span>In the Catalog pane, under Folders > SourceData > ESRI.gdb, under Boundary, add the City_ply feature class to the Lesson2 map.</p> <p class="ExerciseStep"><span class="ex">3)</span>Right-click the City_ply layer in the Contents pane and click Zoom To Layer. You zoom out to a view of all the features in the feature class, which is the entire United States.</p> <p class="ExerciseStep"><span class="ex">4)</span>Turn off the City_ply layer.</p> <p class="ExerciseStep"><span class="ex">5)</span>Add City_pt from the Catalog pane (Folders > SourceData > ESRI.gdb > Boundary).</p> <p class="ExerciseStep"><span class="ex">6)</span>Zoom to the City_pt layer.</p> <p class="BodyText">The main difference between the two feature classes is that City_pt represents cities as points rather than polygons (hence the names City_pt and City_ply). There’s also a difference in spatial extent, because City_pt includes a feature for Attu Station in Alaska, a place that is so far “west” that it is actually in the Eastern Hemisphere.</p> <p class="ExerciseStep"><span class="ex">7)</span>Zoom in on the data to see individual features.</p> <p class="BodyText">Why would the same features, such as these cities, be represented with two different types of shapes (points here and polygons in City_ply)? It’s because each shape is appropriate for maps of different scales. For a national map, you would show cities as points. For a local map, you might show them as polygons. For your requirement, which is to make sure that potential park sites lie inside the city limits, you need features that have boundaries—and hence, polygons rather than points.</p> <p class="ExerciseStep"><span class="ex">8)</span>Turn off the City_pt layer.</p> <p class="H2"><b>Preview the LA River</b></p> <p class="BodyText">In row 5 of the data requirements table, you have rivers as needed spatial data. In <a href="Lesson001.xhtml">lesson 1</a>, you added the River feature class from the ESRI geodatabase. You also have another feature class to look at: LARiver.shp in the City of LA folder.</p> <p class="ExerciseStepFirst"><span class="ex">1)</span>Add the LARiver shapefile to the Lesson2 map.</p> <p class="ExerciseStep"><span class="ex">2)</span>Zoom to the layer (right-click LARiver and click Zoom To Layer).</p> <p class="ExerciseStep"><span class="ex">3)</span>Open the attribute table, and scroll to the bottom of the table.</p> <p class="BodyText">In fact, the Los Angeles River is the only river in the feature class, but it’s composed of 265 separate features (the FID, or feature identification, of the first record is 0). Why so many? As noted in <a href="Lesson001.xhtml">lesson 1</a>, the answer has to do with attributes.</p> <p class="ExerciseStep"><span class="ex">4)</span>Review the table to see the attributes, and then scroll up and notice the different values in the Capacity, Discharge, and Protection fields. Many of the rows at the top even have empty attribute values.</p> <p class="BodyText">For your purposes, it doesn’t really matter what these attributes mean. The point is, if the river was represented as a single feature, it would also have just one row in the attribute table. That would mean that only a single value could be stored for each attribute—fine for the river name (which doesn’t change), but a problem for anything you might want to measure or describe at different locations along the river: flow, depth, water chemistry, navigability, or anything else. The creators of this data wanted to gather facts about the river at different places. To do that, they had to define the river as a spatially connected series of individual features.</p> <p class="BodyText">You don’t have that need. All you want is the spatial data. If you end up using this feature class in your analysis (rather than the River feature class in ESRI.gdb), you’ll probably combine the 265 features into one.</p> <p class="ExerciseStep"><span class="ex">5)</span>Turn off LARiver and close its attribute table.</p> <p class="H2"><b>Preview parks</b></p> <p class="BodyText">In row 6 of the data requirements table, you need spatial data representing parks. You already know you have parks data: in <a href="Lesson001.xhtml">lesson 1</a>, you symbolized and labeled the Parkland feature class. There’s also a shapefile named Parks in the City of LA folder.</p> <p class="ExerciseStepFirst"><span class="ex">1)</span>Add the Parks layer to the Lesson2 map.</p> <p class="ExerciseStep"><span class="ex">2)</span>Zoom to the layer.</p> <p class="ExerciseStep"><span class="ex">3)</span>Open the attribute table.</p> <p class="BodyText">The table preview shows that there are 337 records (features) and just a few attributes.</p> <div class="box"> <p class="BlueBoxTitle">Software-managed attributes</p> <p class="BlueBoxMidParagraph">Every shapefile feature class has FID and Shape attributes that are created and managed by the software. The FID attribute stores a unique number for each feature. The Shape attribute stores the geometry type. Behind the scenes, it also links each feature to coordinates that define its spatial location. Measurement attributes, such as length and area, can be calculated for shapefiles, but if the values change—because of a spatial edit, for example—the software doesn’t update them automatically.</p> <p class="BlueBoxMidParagraphL">A geodatabase feature class has up to four software-managed attributes. Like a shapefile, it has a feature identifier (called OBJECTID instead of FID) and a Shape attribute. The Shape_Length attribute stores the lengths of line and polygon features. This attribute doesn’t exist for point features. The Shape_Area attribute stores the internal areas of polygon features. It doesn’t exist for point or line features. Shape_Length and Shape_Area are automatically kept up to date by the software.</p> </div> <p class="ExerciseStep"><span class="ex">4)</span>Turn off the Parks layer and close its attribute table.</p> <p class="ExerciseStep"><span class="ex">5)</span>In the Catalog pane under the ParkData folder, add NewParks.shp to the Lesson2 map.</p> <p class="ExerciseStep"><span class="ex">6)</span>Zoom to the NewParks.shp layer.</p> <p class="ExerciseStep"><span class="ex">7)</span>Open the attribute table.</p> <p class="BodyText">This shapefile has just two features. One is Los Angeles State Historic Park, and the other is Rio de Los Angeles State Recreation Area. Note the absence of length and area attributes. You can create them if you want—for example, the Parks shapefile has them—but they don’t exist by default because this is a shapefile format.</p> <p class="ExerciseStep"><span class="ex">8)</span>View the metadata for NewParks.shp (right-click it in the Catalog pane) and read the summary.</p> <p class="BodyText">The data represents two newly developed parks. In <a href="Lesson003.xhtml">lesson 3</a>, when you choose a parks feature class for the analysis, you’ll have to make sure that it incorporates these two parks.</p> <p class="ExerciseStep"><span class="ex">9)</span>Turn off the NewParks.shp layer and close its attribute table.</p> <p class="BodyText">We are also aware of a third park, Vista Hermosa Park, which has been completed but is not in NewParks.shp. It’s located just north of downtown and is one more park to keep track of.</p> <p class="ExerciseStep"><span class="ex">10)</span>Click the Locate tool <img alt="" class="img4" src="images/l2_Ico_016a.jpg"/> located on the Map tab, in the Inquiry group, and search for Vista Hermosa Park in the Locate pane. Then select Vista Hermosa Park, Los Angeles.</p> <p class="img"><img alt="" class="img3" src="images/Page-080.jpg"/></p> <p class="BodyText">The park location will be marked on the map as a <img alt="" class="img4" src="images/l2_Ico_016b.jpg"/> symbol.</p> <p class="ExerciseStep"><span class="ex">11)</span>Zoom in for a closer look at the park.</p> <p class="ExerciseStep"><span class="ex">12)</span>Bookmark it as <span class="Step-strong">Vista Hermosa Park</span>.</p> <p class="ExerciseStep"><span class="ex">13)</span>Close the Locate pane.</p> <p class="H2"><b>Preview census units</b></p> <p class="BodyText">In row 7 of the data requirements table, you decided to use census units as a proxy for neighborhoods.</p> <p class="ExerciseStepFirst"><span class="ex">1)</span>In the Catalog pane, under the census folder, add tracts.shp to the map and zoom to the layer.</p> <p class="img"><img alt="" class="img3" src="images/Page-081.jpg"/></p> <p class="BodyText">The data covers Los Angeles County. The tracts to the north are much bigger than the ones to the south. That’s because census tracts are designed to have a fairly consistent population range, and the northern part of the county, with mountains and desert, is less densely populated.</p> <p class="ExerciseStep"><span class="ex">2)</span>Zoom in somewhere on the southern part of the city.</p> <p class="ExerciseStep"><span class="ex">3)</span>Add block_groups.shp to the map.</p> <p class="ExerciseStep"><span class="ex">4)</span>Add block_centroids.shp to the map. If necessary, zoom in further to see individual points.</p> <p class="BodyText">Tracts are subdivided into block groups, and block groups are subdivided into blocks. A block centroid is a block represented spatially as a point rather than a polygon. (That’s not so strange—you’ve seen the same thing with cities.) You can learn more about census units in the sidebar “Fundamentals of US Census geography.”</p> <p class="BodyText">Either block groups or tracts will satisfy your spatial data requirement for a neighborhood. Because of their point geometry, block centroids won’t.</p> <p class="BodyText">Thus far, you’ve confirmed that you have the spatial and attribute data listed in rows 1–7 of the data requirements table. In three cases (rivers, parks, and census units), you’ll have to choose between feature classes. You’ll tackle that problem in <a href="Lesson003.xhtml">lesson 3</a>. You still have more data requirements to consider and more data to preview. You must also review your requirements for specificity. You’ll do this in the next exercise.</p> <p class="ExerciseStep"><span class="ex">5)</span>Save your ArcGIS Pro project.</p> <div class="box1"> <p class="BlueBoxTitleh">Fundamentals of US Census geography</p> <p class="BlueBoxMidParagraph">The US Census Bureau reports data by various geographic units. The top-to-bottom relationship shown here represents containment: the nation contains states, states contain counties, counties contain tracts, tracts contain block groups, and block groups contain blocks. Many other nonnesting reporting units are not shown.</p> <p class="img"><img alt="" class="img3" src="images/Page-084.jpg"/></p> <p class="BlueBoxMidParagraph">Census tracts are relatively small subdivisions of a county. They typically have between 1,000 and 8,000 inhabitants and vary in size. They are designed to be fairly homogeneous with respect to demographic and economic conditions.</p> <p class="BlueBoxMidParagraph">A block group is a cluster of blocks within a tract. A block group typically has between 600 and 3,000 inhabitants.</p> <p class="BlueBoxMidParagraph">A census block (commonly an ordinary city block) is an area bounded by visible features, such as streets or railroad tracks, or by invisible boundaries, such as city limits. A block centroid is a census block represented as a point rather than a polygon. A centroid is located in the geographic center of the block it represents and has the attributes of that block.</p> <p class="BlueBoxMidParagraph">The Census Bureau conducts a new census every 10 years. The latest one was conducted in 2010. Professional demographers estimate values for the intervening years.</p> </div> <p class="H1" id="sec5"><b>Exercise 2c: Reframe the problem statement</b></p> <p class="BodyText">Some of the city council’s park guidelines are specific and measurable:</p> <p class="BulletList"><span class="bl">•</span>On a vacant land parcel one-quarter acre or larger</p> <p class="BulletList"><span class="bl">•</span>Within the LA city limits</p> <p class="BodyText">Others are vague:</p> <p class="BulletList"><span class="bl">•</span>As close as possible to the LA River (Is there a maximum allowed distance from the river? If so, what is it?)</p> <p class="BulletList"><span class="bl">•</span>Not in the vicinity of an existing park (How close is “in the vicinity”?)</p> <p class="BulletList"><span class="bl">•</span>In a densely populated neighborhood with lots of children (How densely populated? How many children?)</p> <p class="BulletList"><span class="bl">•</span>In a lower-income neighborhood (How is “lower income” defined?)</p> <p class="BulletList"><span class="bl">•</span>Serving as many people as possible (How big an area does a park “serve”?)</p> <p class="BodyText">You can’t do the analysis until you eliminate the vagueness.</p> <p class="H2"><b>Define “proximity to the LA River”</b></p> <p class="BodyText">Unless you set a maximum distance limit, every vacant parcel in Los Angeles becomes a potential park candidate. That’s absurd and could waste a lot of data processing time. You’ll set one-half of a mile as an arbitrary outer limit. That stretches the idea of proximity somewhat, but it’s just a cutoff point. Hopefully, you’ll find some good locations that are closer than that.</p> <p class="ExerciseStepFirst"><span class="ex">1)</span>From Windows Explorer, navigate to C:\EsriPress\UGIS4\ParkSite\MapsAndMore and open the DataRequirementsTable.doc (or the .rtf file if you don’t have Microsoft Word).</p> <p class="ExerciseStep"><span class="ex">2)</span>In row 5 of the data requirements table, in the Defined As column, enter <span class="Step-strong"><= 0.5 miles</span>.</p> <p class="BodyText">The symbol <= means “less than or equal to.”</p> <p class="img"><img alt="" class="img3" src="images/Page-083.jpg"/></p> <p class="H2"><b>Define “away from other parks”</b></p> <p class="BodyText">What minimum distance should a candidate site have to be from existing parks? In open-space planning, a quarter mile is often used to define a convenient walking distance. (That’s typically about a five-minute walk.) Following that standard, you can say that a site is not in the vicinity of an existing park if the site’s border is at least a quarter mile away from the border of the nearest park.</p> <p class="ExerciseStepFirst"><span class="ex">1)</span>In row 6 of the data requirements table, in the Defined As column, enter <span class="Step-strong">>= 0.25 miles</span>.</p> <p class="BodyText">This measure is a simplification because it’s based on straight-line distance.</p> <p class="H2"><b>Define a “densely populated” neighborhood</b></p> <p class="BodyText">As you make the rest of the requirements concrete, you’ll also make sure that you have the appropriate data.</p> <p class="ExerciseStepFirst"><span class="ex">1)</span>Open the attribute table of the tracts.shp layer and scroll across its attributes.</p> <p class="img"><img alt="" class="img3" src="images/Page-085.jpg"/></p> <p class="BodyText">As shown in the figure, the population density is in the POPDENS_CY field, as noted in <a href="Lesson001.xhtml">lesson 1</a>. This attribute stores population per square mile for the current year, in this case census year 2015. Close the attribute table when finished.</p> <p class="ExerciseStep"><span class="ex">2)</span>Open the attribute table of block_groups.shp and scroll across its attributes.</p> <p class="BodyText">The table has many of the same fields as the tracts.shp table, but there is no population density attribute. This won’t be a problem because population density is just total population divided by area. Because you have total population, you need only the area of each block group, which is something that ArcGIS Pro can calculate automatically. So either shapefile satisfies your need: the tracts file already has population density, and you can derive it from block_groups.</p> <p class="BodyText">You still need a definition of “densely populated.” To keep it simple, you can call a neighborhood densely populated if its value exceeds that of the city of Los Angeles. The population density of Los Angeles as of the year 2015 was 8,474.7 people per square mile. Rounding down to an even number, use 8,500 people per square mile as your threshold.</p> <div class="box"> <p class="BlueBoxTitle">Threshold values</p> <p class="BlueBoxMidParagraph">To find the population density of Los Angeles, as well as other threshold values for the analysis, we used online US Census Bureau data, especially the QuickFacts page at <a href="http://www.census.gov/quickfacts">http://www.census.gov/quickfacts</a>.</p> </div> <p class="ExerciseStep"><span class="ex">3)</span>In row 8 of the data requirements table, in the Defined As column, enter<span class="Step-strong"> >= 8,500 per sq mi</span>.</p> <p class="H2"><b>Define “lots of children”</b></p> <p class="BodyText">Again, you’ll look for attributes and then set a threshold.</p> <p class="ExerciseStepFirst"><span class="ex">1)</span>Open the attribute table of the block_groups layer and locate the POP18UP_CY attribute.</p> <p class="img"><img alt="" class="img3" src="images/Page-086.jpg"/></p> <p class="BodyText">This is the population age 18 or older for the year 2015. If you define a child as a person under 18 (which is reasonable), you can subtract the values in this field (POP18UP_CY) from those in TOTPOP_CY to get the number of children.</p> <p class="BodyText">Because neighborhoods vary in size and population, you can make more valid comparisons if you base your threshold on a ratio rather than on an absolute number. In Los Angeles, 22.2 percent of the population is under 18 years old. You’ll therefore define a neighborhood as having “lots of children” if it meets or exceeds this value. (You can derive the percentage of children from your data using simple arithmetic.)</p> <p class="ExerciseStep"><span class="ex">2)</span>In row 9 of the data requirements table, in the Defined As column, enter<span class="Step-strong"> >= 22%</span>.</p> <p class="ExerciseStep"><span class="ex">3)</span>In the Attribute Data column, replace the entry “age” with <span class="Step-strong">age under 18</span>.</p> <p class="H2"><b>Define “lower income”</b></p> <p class="BodyText">Now look at your income attributes.</p> <p class="ExerciseStepFirst"><span class="ex">1)</span>In the attribute table of block_groups, scroll all the way to the right. The last two attributes might be income measures. To find out, you’ll look at the metadata—not the item descriptions you looked at before, but the full data documentation.</p> <p class="ExerciseStep"><span class="ex">2)</span>Click the Project tab (on the far left end of the ribbon), and click Options.</p> <p class="ExerciseStep"><span class="ex">3)</span>Click Metadata in the left column and change the Metadata style to North American Profile of ISO19115 2003.</p> <p class="img"><img alt="" class="img3" src="images/Page-087a.jpg"/></p> <p class="ExerciseStep"><span class="ex">4)</span>Click OK and return to the project using the <img alt="" class="img4" src="images/l2_Ico_023.jpg"/> button.</p> <p class="BodyText">Changing the metadata style gives you access to the full set of metadata from the dataset. See the sidebar “Metadata” for more information.</p> <p class="ExerciseStep"><span class="ex">5)</span>If necessary, open the Catalog pane and browse to ParkSite > SourceData > census.</p> <p class="ExerciseStep"><span class="ex">6)</span>Right-click block_groups.shp and click View Metadata.</p> <p class="img"><img alt="" class="img3" src="images/Page-087b.jpg"/></p> <p class="BodyText">The metadata is divided into three sections that can be expanded or collapsed. (They are expanded by default.)</p> <div class="box1"> <p class="BlueBoxTitleh">Metadata</p> <p class="BlueBoxMidParagraph">Metadata is a description of what is known about a dataset. It serves two important purposes. First, it vouchsafes the integrity of data by explaining things such as how, when, and by whom the data was created. Second, it makes the data searchable. Metadata includes tags that identify essential properties of the data (for example, “rivers,” “Los Angeles,” and “2010”) and other descriptions that make it possible to find specific datasets among large inventories of spatial data.</p> <p class="BlueBoxMidParagraph">Metadata may be kept according to one of various official standards. Data created by government agencies, commercial data vendors, and many large enterprises typically conforms to one of these standards. Data created by small organizations or by individuals commonly does not. In ArcGIS, metadata can be displayed in a style that is suited to a particular standard. The default style is the Item Description, which displays a thumbnail image of the data and a small amount of important information. This style is suited to metadata that is not kept to an official standard. It can also be used to provide a filtered, summary view of metadata that is kept to an official standard. Anyone who creates and shares data should at least maintain metadata at the Item Description level.</p> <p class="BlueBoxMidParagraph">To see the full metadata for a dataset that is kept to an official standard, you must change the metadata style in ArcGIS Pro, under Project Options. All the styles, apart from Item Description, are similar, and all afford access to the full set of metadata—no matter what standard they conform to—although they may present the information slightly differently.</p> <p class="img"><img alt="" class="img3" src="images/Page-088.jpg"/></p> </div> <p class="ExerciseStep"><span class="ex">7)</span>Confirm that block_groups.shp is selected on the left side of the Catalog tab. Click Topics and Keywords to collapse it. Right-click block_groups.shp, and click View Metadata.</p> <p class="img"><img alt="" class="img3" src="images/Page-089a.jpg"/></p> <p class="ExerciseStep"><span class="ex">8)</span>Collapse the next several headings (Citation, Citation Contacts, and so on) until you come to the Fields heading.</p> <p class="BodyText">This heading contains the metadata that you’re interested in.</p> <p class="ExerciseStep"><span class="ex">9)</span>Scroll through the Fields data.</p> <p class="ExerciseStep"><span class="ex">10)</span>Scroll down until you see the MEDHINC_CY and AVGHINC_CY fields and read the description.</p> <p class="img"><img alt="" class="img3" src="images/Page-089b.jpg"/></p> <p class="BodyText">Note that MEDHINC_CY is described as 2015 Median Household Income, and AVGHINC_CY is 2015 Average Household Income.</p> <p class="BodyText">Both are good possibilities. Median income is a statistical midpoint: it marks the value that half the households are above and half are below. You’ll adopt this measure because it’s less sensitive to extreme values. (A millionaire in a low-income neighborhood might significantly change the average income but not the median income.)</p> <p class="BodyText">According to the US Census Bureau, the median household income for the city of Los Angeles for the years 2011–2015 is $50,205. Rounding off, you’ll call a neighborhood “lower income” if the median household income is $50,000 or less.</p> <p class="ExerciseStep"><span class="ex">11)</span>In row 10 of the data requirements table, in the Defined As column, enter <span class="Step-strong"><= $50,000</span>.</p> <p class="ExerciseStep"><span class="ex">12)</span>In the Attribute Data column, replace “income” with <span class="Step-strong">median hh income</span>.</p> <p class="H2"><b>Define “serving the most people”</b></p> <p class="BodyText">Finally, you want to know which potential site serves the most people. Anyone can come to a park, so for this criterion you want to count all the people nearby, regardless of their demographic profile. The attribute you need for this element is total population, which you have in both the tract and block_ group feature classes.</p> <p class="BodyText">You’ll treat this guideline as a preference. If half a dozen sites meet your other requirements, you’ll prefer those serving more people overall to those serving fewer. Eventually, this preference may have to be subjectively weighed against others. For example, which is better: a park closer to the river that serves fewer people or a park farther from the river that serves more people? (How much closer? How many more people?)</p> <p class="BodyText">To define the size of the area served by a park, you’ll apply the standard of easy walking distance discussed earlier, and say that a park serves anyone who lives within a quarter mile of it. “Serving the most people” therefore means having the largest population within a quarter-mile radius.</p> <p class="ExerciseStepFirst"><span class="ex">1)</span>Open the block_centroids table and browse across its attributes.</p> <p class="BodyText">This table also has a total population attribute (POP2010). You can’t use block centroids as your spatial data for neighborhoods—you need polygons rather than points—but you can conveniently use them to sum population. Given a distance of a quarter mile around the park, ArcGIS Pro can count the block centroids, or points, that fall within this distance and add their population values.</p> <p class="ExerciseStep"><span class="ex">2)</span>In row 11 of the data requirements table, in the Defined As column, enter <span class="Step-strong"><= 0.25 miles</span>.</p> <p class="BodyText">Your data requirements table should look like the figure.</p> <p class="img"><img alt="" class="img3" src="images/Page-091.jpg"/></p> <p class="BodyText">You can now state the problem in measurable terms that allow you to solve it with GIS tools. Someone might take issue with your interpretation of the city council’s park guidelines, but that’s fine—you’ll always be happy to improve your methodology. For now, you can state your project analysis requirements as follows.</p> <p class="BodyText">You want to locate a site for a new park on a land parcel that is</p> <p class="BulletList"><span class="bl">•</span>vacant,</p> <p class="BulletList"><span class="bl">•</span>a quarter acre or more in size,</p> <p class="BulletList"><span class="bl">•</span>within the LA city limits,</p> <p class="BulletList"><span class="bl">•</span>within a half mile of the LA River (preferring closer sites),</p> <p class="BulletList"><span class="bl">•</span>more than a quarter mile from the nearest park,</p> <p class="BulletList"><span class="bl">•</span>in a census unit in which</p> <p class="BulletList2"><span class="bl2">•</span>the population density is 8,500 or more people per square mile,</p> <p class="BulletList2"><span class="bl2">•</span>where at least 22 percent of the population is under 18 years old, and</p> <p class="BulletList2"><span class="bl2">•</span>where median household income is $50,000 or less, and</p> <p class="BulletList"><span class="bl">•</span>considering that all other conditions are satisfied, the total population within a quarter-mile radius is maximized.</p> <p class="BodyText">You’ve formalized the data requirements in a table and confirmed that the essential data is available. In some cases, multiple datasets contain suitable features and attributes. In the next lesson, you’ll compare these datasets and choose which ones to use in the analysis.</p> <p class="ExerciseStep"><span class="ex">3)</span>Close the Lesson2 map and any open tables.</p> <p class="ExerciseStep"><span class="ex">4)</span>Save your project.</p> <p class="ExerciseStep"><span class="ex">5)</span>Continue to the next lesson or close ArcGIS Pro. Save your changes if prompted.</p> </body> </html>