Los Alamos National Laboratory
Lab Home  |  Phone
 
 
News
News Bulletin

Quick Links

Other Publications

Questions, news tips: send an
e-mail to lanltoday@lanl.gov
or call (505) 667-6103.



Los Alamos researchers create 'Map of Science'

By James E. Rickman

March 11, 2009

Data provides high-resolution picture of scientists’ information retrieval habits

Laboratory scientists have produced the world's first Map of Science—a high-resolution graphic depiction of the virtual trails scientists leave behind when they retrieve information from online services. The research, led by Johan Bollen of the Research Library (STBPO-RL), appears this week in PLoS ONE (the Public Library of Science).

“This research will be a crucial component of future efforts to study and predict scientific innovation, as well novel methods to determine the true impact of articles and journals,” Bollen said.

While science is of tremendous societal importance, it is difficult to probe the often hidden world of scientific creativity. Most studies of scientific activity rely on citation data, which takes a while to become available because both the cited publication and the publication of a particular citation can take years to appear. In other words, citation data observes science as it existed years in the past, not the present.

Bollen and colleagues from the Lab and the Santa Fe Institute collected usage-log data gathered from a variety of publishers, aggregators, and universities spanning a period from 2006 to 2008. Their collection totaled nearly 1 billion online information requests. Because scientists typically read articles online well before they can be cited in subsequent publications, usage data reveal scientific activity nearly in real-time. Moreover, because log data reflect the interactions of all users—such as authors, science practitioners, and the informed public—they do not merely reflect the activities of scholarly authors.

Whenever a scientist accesses a paper online from a publisher, aggregator, university, or similar publishing service, the action is recorded by the servers of these Web portals. The resulting usage data contains a detailed record of the sequences of articles that scientists download as they explore their present interests. After counting the number of times that scientists, across hundreds of millions of requests, download one article after another, the research team calculated the probability that an article or journal accessed by a scientist would be followed by a subsequent article or journal as part of the scientists’ online behavior. Based on such behavior, the researchers created a map that graphically portrays a network of connected articles and journals.

Bollen and colleagues were surprised by the map’s scope and detail. Whereas maps based on citations favor the natural sciences, the team’s maps of science showed a prominent and central position for the humanities and social sciences, which, in many places, acted like interdisciplinary bridges connecting various other scientific domains. Sections of the maps were shaped by the activities of practitioners who read the scientific literature but do not frequently publish in its journals.

The maps also revealed unexpected relations between scientific domains that point to emerging relationships that are capturing the collective interest of the scientific community—for instance a connection between ecology and architecture.

“We were surprised by the fine-grained structure of scientific activity that emerges from our maps,” said Bollen.

According to Bollen, future work will focus on issues involved in the sustainable management of large-scale usage data, as well the production of models that explain the online behavior of scientists and how it relates to the emergence of scientific innovation. This information will help funding agencies, policy makers, and the public to better understand how best to tap the ebb and flow of scientific inquiry and discovery.

The research team includes Bollen, Herbert Van de Sompel, Ryan Chute, and Lyudmila Balakireva of the Digital Library Research and Prototyping Team and Aric Hagberg, Luis Bettencourt and Marko A. Rodriguez of Applied Mathematics and Plasma Phycis (T-5), and the Center for Nonlinear Studies. Bettencourt also is part of the Santa Fe Institute.

Bollen and colleagues received funding from the Andrew W. Mellon foundation to examine the potential of large-scale usage data. The study is part of the MESUR (Metrics from Scholarly Usage of Resources) project of which Bollen is the principal investigator. The MESUR usage database is now considered the largest of its kind.

Other Headlines


reserchers monitor fish

Robinson and Fresquez monitor air, soil, animals, plants

Data looks very favorable as the Lab continues working to mitigate its impact on the environment

Fresquez and his team are sampling lakes and rivers to determine the source and migration of potential contaminants.

read more

Currents, the Laboratory's monthly employee magazine, highlighting people in the workplace.

Operated by Los Alamos National Security, LLC for the U.S. Department of Energy's NNSA

Inside | © Copyright 2008-09 Los Alamos National Security, LLC All rights reserved | Disclaimer/Privacy | Web Contact