Map merge: How USGS integrates geospatial systems

 

Connecting state and local government leaders

While geographic information systems have become well established in the federal government, the current challenge for agencies is to integrate their data with that from their counterparts in other jurisdictions.


From NASA to NOAA, and the Department of Agriculture to the Department of Defense, federal agencies and departments are collecting geospatial data at a pace that would choke many server farms. The data isn't just being amassed: It's being analyzed to guide troops in unfamiliar terrain, to track the spread of disease and to decipher crime patterns across the law enforcement enterprise.

While geographic information systems (GIS) have become well established in the federal government, the current challenge for agencies is to develop tools and techniques to connect their geospatial programs with their counterparts in other jurisdictions.

While some agencies have been coordinating their data collection and analyses, as a general rule data collected by one agency can rarely be integrated easily and effectively with data collected by another. 

Steps are now being taken to change that.  In fact, the Federal Geographic Data Committee (FGDC) – the main federal unit charged with integrating federal geospatial efforts – in its National Spatial Data Infrastructure Plan 2014-2016, said its primary goal is to "develop capabilities for national shared services."

More specifically, the FGDC, hosted at the U.S. Geological Survey, has been tasked with ensuring, "that spatial data from multiple sources (federal, state, tribal, regional and local governments; academia; and the private sector) are available and easily integrated to enhance understanding of our physical, natural, and cultural world.”

According to agency geospatial leaders and technology experts, there are two major hurdles to achieving those goals: neither the resolution of collected data nor the formats for creating metadata are compatible across data fields.

USGS takes the lead

Not surprisingly, the lead organization researching geospatial data integration and developing tools to enable such integration is USGS. In fact, the agency has staked its future on the goal of geospatial interoperability, having created the Center of Excellence for Geospatial Information Science (CEGIS) to lead the effort.

The Center was established "to conduct, lead and influence the research and innovative solutions required by the National Spatial Data Infrastructure and the emerging GeoSpatial and GeoSemantic Web," said USGS.

According to Lynn Usery, director of CEGIS, the initial impetus for finding ways to integrate geospatial data was the USGS National Map – a collaborative effort among USGS and other federal, state and local partners to improve and deliver topographic information across the nation, including orthoimagery (aerial photographs), elevation, geographic names, hydrography, boundaries, transportation, structures and land cover.

The first hurdle Usery's team faced was the different resolutions of collected data.

"The reason we started the [CEGIS data integration] project was that when we were developing the national map, we realized that the different layers of the national map – hydrography, transportation, contours and all those things – were all actually compiled and generated separately,” said CEGIS’s Usery. That meant that when the layers were put together, it might look as if they didn’t match up.  

Usery’s team then tried to determine exactly what it means for a dataset to be integrated. “If you take two datasets and you put them together, how does the user perceive that as being integrated?" he asked.

According to Usery, the team found that if the resolution of two datasets – say transportation data superimposed on image data – was within about 6.4 meters, users would perceive the data as being integrated.

But unless both datasets were already geocoded using the same projection system, getting the two datasets to align correctly can be a major problem.  For each data set, an application needs to be created to perform the integration.  In the case of integrating transportation data with underlying imagery, said Usery, USGS worked with researchers at the University of Southern California. 

"USC was already doing some work in this area and had developed some algorithms to automatically locate intersections on the images. Then [they could] use those intersections as control points to get vector data to the images," Usery explained.  USGS provided data and small grants to the research team at USC. 

In similar fashion, USGS provided support to another group to integrate hydrography data with contour layers in the National Map.

Integrating metadata

Of course, integrating data sets collected by agencies would be much faster, easier and less expensive, if the data sets were created using standardized metadata – the data about the data, including geographic coordinates and object labels.

To address the problem, CEGIS is considering a “semantic” approach that allows data to be used across application and agency boundaries.

"We are primarily looking at ontology and semantics as a way to integrate data across a variety of organizations and different kinds of data layers," said Usery.  The team is trying a semantic web approach, he said. They built an ontology for all of the data and then built the semantics around that so they can use those semantics to actually integrate with other data sets.

As it is now, government agencies at all levels – as well as the private sector – apply different labels to features, spatial concepts and other data.  As a result, while an agency might be able to populate a map layer with data points from another agency, the analytic potential is very limited if the metadata doesn't share the same architecture. 

Accordingly, CEGIS is working to develop a single integrated language or ontology for describing geospatial data. 

"We're taking geospatial data and building an ontology for the data based on all of our features, relationships and interrelationships of features, and then we structure the data using RDF – resource description framework," said Usery. 

"If our data is structured in that form and other data is structured as RDF we can actually bring the data sets together."

In fact, USGS did just that with data it brought in from the Environmental Protection Agency, which was also in RDF format.  "We just ran an automatic query to locate all of the EPA pollution sites within five miles a local firehouse," said Usery.

Dealing with legacy data

Not surprisingly, the biggest snag to implementing a robust scheme for organizing metadata is the existence of large amounts of legacy data.

According to Usery, , USGS has not converted all of its data to RDF.  "Our data resides in GIS format and it works very well," he said.  "There are lots of procedures designed around those things and we can't just completely change over and lose all the legacy developments that we've done around GIS platforms." 

Instead, the team has developed a tool that allows analysts to take any section of vector data sets and convert it to RDF.

While most integration of federal geospatial data is currently happening through ad hoc arrangements between agencies, the FGDC has developed a process for managing data sets to encourage their availability to, and usability by, all sectors of government.

The FGDC's Geospatial Data Asset Management Plan – issued in March 2014 – requires agencies and departments to adopt standardized processes for "racking, maintaining, expanding and aligning" geospatial data assets.  "At a national level," said the report, "this approach is intended to overcome the single agency, stovepipe model by applying consistent policy, improved organization, better governance and public engagement to deliver outstanding results.”

By the end of 2016, the process is expected to result in the collection of searchable and downloadable datasets that agencies, departments and the public can access via the GeoPlatform.gov portal, which already offers more than 95,000 datasets.

While the goals of the FGDC are ambitious, according to Usery, there's a long way to go to meet them.  "All agencies that use geospatial data try to comply with the FGDC standards," he said.  "That's been the case for many years.  Beyond that I don't think there's a lot that goes on."

For now, Usery said, at USGS most efforts to integrate geospatial data continue to be between agencies that share an immediate interest in the specific data.  "We try to leverage data from other agencies and not have to collect all the data ourselves," he said. 

X
This website uses cookies to enhance user experience and to analyze performance and traffic on our website. We also share information about your use of our site with our social media, advertising and analytics partners. Learn More / Do Not Sell My Personal Information
Accept Cookies
X
Cookie Preferences Cookie List

Do Not Sell My Personal Information

When you visit our website, we store cookies on your browser to collect information. The information collected might relate to you, your preferences or your device, and is mostly used to make the site work as you expect it to and to provide a more personalized web experience. However, you can choose not to allow certain types of cookies, which may impact your experience of the site and the services we are able to offer. Click on the different category headings to find out more and change our default settings according to your preference. You cannot opt-out of our First Party Strictly Necessary Cookies as they are deployed in order to ensure the proper functioning of our website (such as prompting the cookie banner and remembering your settings, to log into your account, to redirect you when you log out, etc.). For more information about the First and Third Party Cookies used please follow this link.

Allow All Cookies

Manage Consent Preferences

Strictly Necessary Cookies - Always Active

We do not allow you to opt-out of our certain cookies, as they are necessary to ensure the proper functioning of our website (such as prompting our cookie banner and remembering your privacy choices) and/or to monitor site performance. These cookies are not used in a way that constitutes a “sale” of your data under the CCPA. You can set your browser to block or alert you about these cookies, but some parts of the site will not work as intended if you do so. You can usually find these settings in the Options or Preferences menu of your browser. Visit www.allaboutcookies.org to learn more.

Sale of Personal Data, Targeting & Social Media Cookies

Under the California Consumer Privacy Act, you have the right to opt-out of the sale of your personal information to third parties. These cookies collect information for analytics and to personalize your experience with targeted ads. You may exercise your right to opt out of the sale of personal information by using this toggle switch. If you opt out we will not be able to offer you personalised ads and will not hand over your personal information to any third parties. Additionally, you may contact our legal department for further clarification about your rights as a California consumer by using this Exercise My Rights link

If you have enabled privacy controls on your browser (such as a plugin), we have to take that as a valid request to opt-out. Therefore we would not be able to track your activity through the web. This may affect our ability to personalize ads according to your preferences.

Targeting cookies may be set through our site by our advertising partners. They may be used by those companies to build a profile of your interests and show you relevant adverts on other sites. They do not store directly personal information, but are based on uniquely identifying your browser and internet device. If you do not allow these cookies, you will experience less targeted advertising.

Social media cookies are set by a range of social media services that we have added to the site to enable you to share our content with your friends and networks. They are capable of tracking your browser across other sites and building up a profile of your interests. This may impact the content and messages you see on other websites you visit. If you do not allow these cookies you may not be able to use or see these sharing tools.

If you want to opt out of all of our lead reports and lists, please submit a privacy request at our Do Not Sell page.

Save Settings
Cookie Preferences Cookie List

Cookie List

A cookie is a small piece of data (text file) that a website – when visited by a user – asks your browser to store on your device in order to remember information about you, such as your language preference or login information. Those cookies are set by us and called first-party cookies. We also use third-party cookies – which are cookies from a domain different than the domain of the website you are visiting – for our advertising and marketing efforts. More specifically, we use cookies and other tracking technologies for the following purposes:

Strictly Necessary Cookies

We do not allow you to opt-out of our certain cookies, as they are necessary to ensure the proper functioning of our website (such as prompting our cookie banner and remembering your privacy choices) and/or to monitor site performance. These cookies are not used in a way that constitutes a “sale” of your data under the CCPA. You can set your browser to block or alert you about these cookies, but some parts of the site will not work as intended if you do so. You can usually find these settings in the Options or Preferences menu of your browser. Visit www.allaboutcookies.org to learn more.

Functional Cookies

We do not allow you to opt-out of our certain cookies, as they are necessary to ensure the proper functioning of our website (such as prompting our cookie banner and remembering your privacy choices) and/or to monitor site performance. These cookies are not used in a way that constitutes a “sale” of your data under the CCPA. You can set your browser to block or alert you about these cookies, but some parts of the site will not work as intended if you do so. You can usually find these settings in the Options or Preferences menu of your browser. Visit www.allaboutcookies.org to learn more.

Performance Cookies

We do not allow you to opt-out of our certain cookies, as they are necessary to ensure the proper functioning of our website (such as prompting our cookie banner and remembering your privacy choices) and/or to monitor site performance. These cookies are not used in a way that constitutes a “sale” of your data under the CCPA. You can set your browser to block or alert you about these cookies, but some parts of the site will not work as intended if you do so. You can usually find these settings in the Options or Preferences menu of your browser. Visit www.allaboutcookies.org to learn more.

Sale of Personal Data

We also use cookies to personalize your experience on our websites, including by determining the most relevant content and advertisements to show you, and to monitor site traffic and performance, so that we may improve our websites and your experience. You may opt out of our use of such cookies (and the associated “sale” of your Personal Information) by using this toggle switch. You will still see some advertising, regardless of your selection. Because we do not track you across different devices, browsers and GEMG properties, your selection will take effect only on this browser, this device and this website.

Social Media Cookies

We also use cookies to personalize your experience on our websites, including by determining the most relevant content and advertisements to show you, and to monitor site traffic and performance, so that we may improve our websites and your experience. You may opt out of our use of such cookies (and the associated “sale” of your Personal Information) by using this toggle switch. You will still see some advertising, regardless of your selection. Because we do not track you across different devices, browsers and GEMG properties, your selection will take effect only on this browser, this device and this website.

Targeting Cookies

We also use cookies to personalize your experience on our websites, including by determining the most relevant content and advertisements to show you, and to monitor site traffic and performance, so that we may improve our websites and your experience. You may opt out of our use of such cookies (and the associated “sale” of your Personal Information) by using this toggle switch. You will still see some advertising, regardless of your selection. Because we do not track you across different devices, browsers and GEMG properties, your selection will take effect only on this browser, this device and this website.