Friday, November 20, 2015

Network Analysis

Goal and Objectives:

The goal of this assignment was to learn how to perform network analysis, and use model builder to automate the process. In order to apply network analysis to the over-arching theme of frac sand mining, we assessed the cost incurred by several counties in Wisconsin as a result of increased truck traffic from overland shipping of frac sand between mine and rail terminals.

Although the calculations were performed using hypothetical numbers, the workflow is still the same as if the calculations were performed with proper values.

Methods:

In order to estimate the damage incurred per county, it was first necessary to determine the routes between the mines and rail terminals. This was done using the  closest facility analysis, a technique that uses a street network dataset to determine which facility is closest to a specific incident, and is also used for everything from emergency service dispatching to pizza delivery. to input the streets network dataset, and the Add Locations tool to load the locations of the mines and rail terminals. The mines were added as "incidents", and the rail terminals were added as "facilities". Next, the solve tool was run to calculate the route. The "Select Data" and "Copy Data" tools were used to add the calculated routes to a geodatabase. Before any further calculation could be performed, it was necessary to project the routes into a projected coordinate system, so their lengths wouldn't be calculated in decimal degrees.
Figure 1: The completed data flow model.

 In order to calculate the total road length per county, I used the "Intersect" tool to separate the routes into individual sections per county. The Summary Statistics tool was used to obtain a table featuring the total length of the routes per county. These lengths were output in meters, however, so some calculations were needed before cost-per-county could be determined. First, the Add Field tool was used to add a float column named "Distance_Miles". Next, I used the Calculate Field tool to write an equation converting the distances from meters to miles and adding the calculated values to "Distance_Miles". Once the distances were calculated, I added another float field "Annual_Cost" to hold the estimated cost value. I used the Calculate Field tool to write an equation so "Annual_Cost" = "Distance_Miles" * 50 (hypothetical number of trips per year) * 2 (to and from the terminal) * $0.022 (using a hypothetical cost per truck mile of 2.2 cents).

Results:

Figure 2: Haul distance per County.
The table showed that five of the counties had routes longer than 100 miles, two counties had routes totaling between 50 to 100 miles, with 10 counties having under 50 miles (Figure 2). Three of the counties have mines or rail terminals almost exactly on their borders, limiting their haul distances to 2.57, 1.28, and 0.96 miles, respectively (Figures 2,5).

Figure 3: Hypothetical Annual Cost per County
The annual cost per county is directly related to the haul distance per county (Figures 2,3.

Figure 4: Hypothetical Annual Cost per County.
Figure 5: Frac sand mine locations, rail terminal locations, and the routes between the two.



Discussion:

It was extremely important to use a float field for the calculations, as the default long integer cannot hold decimal values. Other students were having computational difficulties as a result of choosing the wrong field type.

The data showed very different results from what I was expecting, as counties' roads weren't only suffering damage from traffic from mines within the county, but also from mines in surrounding counties. The extra traffic on Chippewa County's roads from mines in Barron County potentially double the cost incurred by Chippewa County (Figure 5).

Conclusions:

This lab provided an opportunity for me to learn how to perform network analysis. Although the results are completely hypothetical, the workflow is accurate.

Sources:

Mine location information, County Boundaries, and was acquired from the Wisconsin Department of Natural Resources.
http://midamericafreight.org/wp-content/uploads/FracSandWhitePaperDRAFT.pdf

Sunday, November 8, 2015

Geocoding Frac Sand Mines

Goal and Objectives:

The goal of this lab was to geocode a subset of frac sand mine locations in Wisconsin. This was to gain an understanding of data normalization and the accuracy of data collected by others.

Methods:

Before geocoding could occur, the addresses for the mines needed to be normalized (Figure 1). This involved adding additional columns to the excel spreadsheet, and copying different portions of the address field to them. This was done so the geocoder tool would be able to properly locate the mine's positions.

After all of the addresses were normalized, I added the excel sheet to the geocoding tool, and matched the fields to match the geocoding style. The geocoder then matched the addresses to their corresponding locations according to Esri's "World Geocoding Service". All of the addresses matched locations in the "World Geocoding Service", but weren't exactly accurate, so manual re-positioning of all of the points was necessary. In order to do this, the "address inspector" was used to "Unmatch" the point from its geocoded location, and the correct point was selected using the "Pick Address from Map" tool. Public Land Survey System (PLSS) information was used with the address information to identify the true locations of the mines.

Next, I compared my results to my colleagues. First, I used the "merge" tool to combine all of their shapefiles into one shapefile. I then used the "near" tool to determine the distances between each of my points and the closest point collected by my colleagues.

Results:
Figure  1: The table before normalization

Figure 2: The normalized table

The resulting table from the normalizing procedure was easier for the computer and persons to interpret (Figures 1,2) .

The initial geocoding results were extremely erroneous, with one of the initial results being located in Scotland, rather than its true location in Wisconsin (Figure 3).
Figure 3: One of the points was erroneously located in Scotland.
When comparing my results with those of my classmates, the differences between our resulting locations varied between 3 meters to 9 kilometers (Figure 4). 
Figure 4: A table showing the distances between my point locations
 and my classmates' point locations.


Discussion:

different types of errors
how can we know what points are correct?

Conclusion: