Have a problem with information overload now? Just wait a while

 

Connecting state and local government leaders

It's not hard to understand why. The Web, which two years ago had an estimated 21T worth of static HTML pages, is doubling in size each year. We are inundated with e-mails, online documents, photographs, images and videos.

It's not hard to understand why. The Web, which two years ago had an estimated 21T worth of static HTML pages, is doubling in size each year. We are inundated with e-mails, online documents, photographs, images and videos.According to a study by the University of California at Berkeley, at www.sims.berkeley.edu/how-much-info, the world 'produces between 1 exabyte and 2 exabytes of information each year'about 250M for every man, woman and child on earth.' An exabyte is a billion gigabytes.The study also said photographs are accumulating at an annual rate of 410 petabytes (410 million gigabytes), while video files add up to 300P annually.It's impossible to browse so much information, but can't it be summarized better? Web search engines return too many irrelevant hits and put too much of a burden on the user, who typically has to know key words, browse returned documents and then enter new words to refine the search.Cross-language document searches'for example, using English key words to find relevant documents in Chinese'are very difficult. Finding a certain piece of nontext material, such as a speech or video file, can turn into a major research project.The Information Access Division of the National Institute of Standards and Technology's IT Laboratory is looking for better ways to access unstructured multimedia and other complex information: text, Web pages, images, video, voice, audio, and 2-D and 3-D graphics. NIST doesn't create such technologies, but we do contribute to them by developing performance metrics, evaluation methods, test suites and standards. We also work with industry to speed up commercial transition.Our human language technology program focuses on the huge growth of non-English content, focused information as distinct from lists of documents, and human-computer interfaces.Since 1992, NIST has held annual text-retrieval conferences to evaluate systems and algorithms developed by participating organizations. They have three goals: to improve the state of the art in Web retrieval, to locate answers rather than merely documents and to bridge language barriers.NIST is working with the Defense Advanced Research Projects Agency on several human language projects. One involves summarization, or reducing the number of words that must be read to understand the main points. Our goal is a reliable, comprehensive evaluation program for summarization tools.Another project involves tracking broadcast news. We are trying to develop ways to evaluate technologies that claim to search and organize multilingual text from broadcast news media.Since 1988, the division has been benchmarking speech-to-text transcription systems. We started with speech read aloud at various vocabulary sizes; then we moved into spontaneous speech, radio and TV speech, telephone conversations and, finally, interactive meetings with unlimited vocabularies. NIST is working with DARPA to drive down the transcription error rates while producing useful metadata.A multimedia project is evaluating MPEG-7, a metadata standard for content-based, audio-visual information access. We are also developing metrics to test compliance of MPEG-7 encoders.A program in user interaction was motivated by the enormous growth in Web information and applications, as well as by the government's mandate to make digital information more accessible.One area deals with commercial software usability. Although poor usability drives up the total cost of owning productivity software, it is not generally considered as a cost itself.Industry approached NIST to help buyers gauge usability in their procurement decisions, which would cut the cost of training and maintenance. The result is the Common Industry Format for reporting usability results.A related project deals with evaluating the usability of Web sites in a consistent way. NIST's Common Industry Format Testing, Evaluation and Reporting project will set up benchmarks for Web usability evaluation and eventually create a 'usability ground truth' for each Web site. This ground truth could serve as a benchmark for comparing different accessibility evaluation methods.Other programs are developing protocols and software to evaluate the accuracy of biometric identifiers such as fingerprints and facial recognition. As more individual biometrics are collected, much more data will have to be stored and retrieved, both to identify persons and to control their access to facilities. High accuracy here is critical; low-accuracy biometric systems require extra information to confirm identity and should be avoided.Eventually, as computing becomes pervasive throughout our environment, there will be multimodal interfaces for speech and visual input, multiple kinds of sensors, and new mobile devices and collaboration technologies. We will need ways to handle this information overload, too.

The Web, which two years ago had an estimated 21T worth of static HTML pages, has been doubling in size each year.

'Martin Herman

Henrik G. DeGyor

Many knowledge workers feel as if they are drowning in information.















Evaluation program








Multimedia sources















Martin Herman is chief of NIST's Information Access Division in Gaithersburg, Md.
X
This website uses cookies to enhance user experience and to analyze performance and traffic on our website. We also share information about your use of our site with our social media, advertising and analytics partners. Learn More / Do Not Sell My Personal Information
Accept Cookies
X
Cookie Preferences Cookie List

Do Not Sell My Personal Information

When you visit our website, we store cookies on your browser to collect information. The information collected might relate to you, your preferences or your device, and is mostly used to make the site work as you expect it to and to provide a more personalized web experience. However, you can choose not to allow certain types of cookies, which may impact your experience of the site and the services we are able to offer. Click on the different category headings to find out more and change our default settings according to your preference. You cannot opt-out of our First Party Strictly Necessary Cookies as they are deployed in order to ensure the proper functioning of our website (such as prompting the cookie banner and remembering your settings, to log into your account, to redirect you when you log out, etc.). For more information about the First and Third Party Cookies used please follow this link.

Allow All Cookies

Manage Consent Preferences

Strictly Necessary Cookies - Always Active

We do not allow you to opt-out of our certain cookies, as they are necessary to ensure the proper functioning of our website (such as prompting our cookie banner and remembering your privacy choices) and/or to monitor site performance. These cookies are not used in a way that constitutes a “sale” of your data under the CCPA. You can set your browser to block or alert you about these cookies, but some parts of the site will not work as intended if you do so. You can usually find these settings in the Options or Preferences menu of your browser. Visit www.allaboutcookies.org to learn more.

Sale of Personal Data, Targeting & Social Media Cookies

Under the California Consumer Privacy Act, you have the right to opt-out of the sale of your personal information to third parties. These cookies collect information for analytics and to personalize your experience with targeted ads. You may exercise your right to opt out of the sale of personal information by using this toggle switch. If you opt out we will not be able to offer you personalised ads and will not hand over your personal information to any third parties. Additionally, you may contact our legal department for further clarification about your rights as a California consumer by using this Exercise My Rights link

If you have enabled privacy controls on your browser (such as a plugin), we have to take that as a valid request to opt-out. Therefore we would not be able to track your activity through the web. This may affect our ability to personalize ads according to your preferences.

Targeting cookies may be set through our site by our advertising partners. They may be used by those companies to build a profile of your interests and show you relevant adverts on other sites. They do not store directly personal information, but are based on uniquely identifying your browser and internet device. If you do not allow these cookies, you will experience less targeted advertising.

Social media cookies are set by a range of social media services that we have added to the site to enable you to share our content with your friends and networks. They are capable of tracking your browser across other sites and building up a profile of your interests. This may impact the content and messages you see on other websites you visit. If you do not allow these cookies you may not be able to use or see these sharing tools.

If you want to opt out of all of our lead reports and lists, please submit a privacy request at our Do Not Sell page.

Save Settings
Cookie Preferences Cookie List

Cookie List

A cookie is a small piece of data (text file) that a website – when visited by a user – asks your browser to store on your device in order to remember information about you, such as your language preference or login information. Those cookies are set by us and called first-party cookies. We also use third-party cookies – which are cookies from a domain different than the domain of the website you are visiting – for our advertising and marketing efforts. More specifically, we use cookies and other tracking technologies for the following purposes:

Strictly Necessary Cookies

We do not allow you to opt-out of our certain cookies, as they are necessary to ensure the proper functioning of our website (such as prompting our cookie banner and remembering your privacy choices) and/or to monitor site performance. These cookies are not used in a way that constitutes a “sale” of your data under the CCPA. You can set your browser to block or alert you about these cookies, but some parts of the site will not work as intended if you do so. You can usually find these settings in the Options or Preferences menu of your browser. Visit www.allaboutcookies.org to learn more.

Functional Cookies

We do not allow you to opt-out of our certain cookies, as they are necessary to ensure the proper functioning of our website (such as prompting our cookie banner and remembering your privacy choices) and/or to monitor site performance. These cookies are not used in a way that constitutes a “sale” of your data under the CCPA. You can set your browser to block or alert you about these cookies, but some parts of the site will not work as intended if you do so. You can usually find these settings in the Options or Preferences menu of your browser. Visit www.allaboutcookies.org to learn more.

Performance Cookies

We do not allow you to opt-out of our certain cookies, as they are necessary to ensure the proper functioning of our website (such as prompting our cookie banner and remembering your privacy choices) and/or to monitor site performance. These cookies are not used in a way that constitutes a “sale” of your data under the CCPA. You can set your browser to block or alert you about these cookies, but some parts of the site will not work as intended if you do so. You can usually find these settings in the Options or Preferences menu of your browser. Visit www.allaboutcookies.org to learn more.

Sale of Personal Data

We also use cookies to personalize your experience on our websites, including by determining the most relevant content and advertisements to show you, and to monitor site traffic and performance, so that we may improve our websites and your experience. You may opt out of our use of such cookies (and the associated “sale” of your Personal Information) by using this toggle switch. You will still see some advertising, regardless of your selection. Because we do not track you across different devices, browsers and GEMG properties, your selection will take effect only on this browser, this device and this website.

Social Media Cookies

We also use cookies to personalize your experience on our websites, including by determining the most relevant content and advertisements to show you, and to monitor site traffic and performance, so that we may improve our websites and your experience. You may opt out of our use of such cookies (and the associated “sale” of your Personal Information) by using this toggle switch. You will still see some advertising, regardless of your selection. Because we do not track you across different devices, browsers and GEMG properties, your selection will take effect only on this browser, this device and this website.

Targeting Cookies

We also use cookies to personalize your experience on our websites, including by determining the most relevant content and advertisements to show you, and to monitor site traffic and performance, so that we may improve our websites and your experience. You may opt out of our use of such cookies (and the associated “sale” of your Personal Information) by using this toggle switch. You will still see some advertising, regardless of your selection. Because we do not track you across different devices, browsers and GEMG properties, your selection will take effect only on this browser, this device and this website.