Stories
Slash Boxes
Comments

SoylentNews is people

Submission Preview

Link to Story

NASA's JPL Joins DARPA on "Memex" Deep Web Search

Accepted submission by takyon at 2015-05-28 12:15:32
Software
NASA's Jet Propulsion Laboratory (JPL) is teaming up [nasa.gov] with the Defense Advanced Research Projects Agency (DARPA) on its Memex "deep Web" search project [darpa.mil]:

The Defense Advanced Research Projects Agency (DARPA) has been developing tools as part of its Memex program that access and catalog this mysterious online world. Researchers at NASA's Jet Propulsion Laboratory in Pasadena, California, have joined the Memex effort to harness the benefits of deep Web searching for science. Memex could, for example, help catalog the vast amounts of data NASA spacecraft deliver on a daily basis.

"We're developing next-generation search technologies that understand people, places, things and the connections between them," said Chris Mattmann, principal investigator for JPL's work on Memex. Memex checks not just standard text-based content online but also images, videos, pop-up ads, forms, scripts and other ways information is stored to look at how they are interrelated. "We're augmenting Web crawlers to behave like browsers -- in other words, executing scripts and reading ads in ways that you would when you usually go online. This information is normally not catalogued by search engines," Mattmann said.

Additionally, a standard Web search doesn't get much information from images and videos, but Memex can recognize what's in this content and pair it with searches on the same subjects. The search tool could identify the same object across many frames of a video or even different videos.

The video and image search capabilities of Memex could one day benefit space missions that take photos, videos and other kinds of imaging data with instruments such as spectrometers. Searching visual information about a particular planetary body could greatly facilitate the work of scientists in analyzing geological features. Scientists analyzing imaging data from Earth-based missions that monitor phenomena such as snowfall and soil moisture could similarly benefit. Memex would also enhance the search for published scientific data, so that scientists can be better aware of what has been released and analyzed on their topics. The technology could be applied to large NASA data centers such as the Physical Oceanography Distributed Active Archive Center, which makes NASA's ocean and climate data accessible and meaningful. Memex would make PDF documents more easily searchable and allow users to more easily arrive at the information they seek. Awareness of existing publications also helps program managers to assess the impact of spacecraft data.

JPL had previously been involved with DARPA's XDATA project [darpa.mil]. Memex [wikipedia.org] is inspired by a 1945 article by Vannevar Bush [theatlantic.com] in The Atlantic Monthly.


Original Submission