skip navigation
  • Ask a LibrarianDigital CollectionsLibrary Catalogs
  •   Options
The Library of Congress > Digital Preservation > Partners > Collecting & Preserving Content
Digital Preservation
  • Digital Preservation Home
  • Importance of Digital Preservation
  • What the Library Is Doing
  • Partners
  • Achievements
  • What You Can Do
  • News & Events
  • Contact Us

Related Resources

  • Digital Collections & Programs
  • Digital Formats Sustainability
  • Section 108 Study Group
  • Standards at the Library of Congress
  • U.S. Copyright Office
  • Web Capture
  • World Digital Library

 RSS | Blog

 Podcasts

 Email Updates

About Digital Preservation

View a short presentation about digital preservation.

 Try it out

Partners

The Library of Congress has formed a growing network of preservation partners both in the United States and abroad to help save digital information that would otherwise be lost.
Partners Home | Collecting & Preserving Content | Digital Preservation Research | Organizational Alliances | Technical Tools and Services | U.S. States | Index of Partners
Project Descriptions | Meetings | Interviews

Page: 1 | 2 | 3

Peter Lyman, University of California at Berkeley

Following is an interview with Peter Lyman, professor in the School of Information Management and Systems, University of California at Berkeley. Lyman is the author of the essay "Archiving the World Wide Web." In August 2001, the Council on Library and Infornmation Resources commissioned six "environmental scans" at the request of the Library of Congress. The essays represented the informed views of the "state of the art" by recognized experts.

Your survey published in 2003 states that the surface Web in 2002 was 167 terabytes. In a March 2003 survey (not yours) it was stated that the "estimated annual production of materials in Web-ready formats by the year 2007 is projected to be too large to estimate by many analysts." Do you believe there is validity in the statement that we are approaching a point where it will not be possible to estimate the amount of information?

Yes, in principle. In practice there are some interesting issues. One is: What do we mean by "information"? A large amount of "information" that is stored is actually data, such as weather and space satellite pictures or video camera content, which cannot actually be used without rendering by software. Two is: We can always estimate how much storage technology is needed to hold it, because we have industry production numbers that are very accurate. This is what we did, but the problem is that it ignores the quality and utility of information, since all of Shakespeare online takes up much less space than a few pictures – but which is more important?

You estimate that the amount of new information stored on paper, film, magnetic and optical media doubled in 1999-2002. Do you see that increase getting even larger after 2002 or will we reach a plateau in the increase of information?

I believe it will continue to expand at this rate at least, doubling every three years. Remember that there was a global recession during that period, for one, and that the Third World is just beginning to produce much more information (look at cinema production for example) and to keep accurate numbers as well. Note also the growth of the Web, which shows that "the means of production" of information is in everyone's hands. The growth of phone calls through mobile technology will also increase the amount of exchange of information – if you count this as information (but it is usually not stored).

Page: 1 | 2 | 3

 Next interview

About | Site Map | Contact | Accessibility | Legal | USA.gov