One tool that I have used in the past to capture static HTML web sites
is HTTrack (http://www.httrack.com/), an open source offline browsing
utility.  Depending on what you are trying to capture it may work for
you.

 

Cory Nimer

Manuscripts Cataloger/Metadata Specialist

1108 HBLL

Brigham Young University

Provo, UT  84602

(801) 422-6091

 

________________________________

From: Archives & Archivists [mailto:[log in to unmask]] On
Behalf Of Sokolow, Daniel
Sent: Tuesday, July 25, 2006 10:02 AM
To: [log in to unmask]
Subject: Re: Capturing websites

 

That all depends on what you're trying to accomplish.  The simplest way
of doing it is using Adobe Acrobat (the full version, not just the
Reader.)  Using Acrobat's WebCapture feature, you can enter a URL, tell
the program how many levels to capture, and it will produce a PDF with
all the pages and images.  I just tested it, and all the links within
the document will still work.  Acrobat retails for about $250 solo, and
is included with Adobe's Creative Suite 2 for a lot more money.

 

The non-archival nature of the document is that once captured, you no
longer have the HTML behind a webpage.  To properly capture that part,
you'd need to save the HTML code page by page, and I don't know of a
utility that will do that.  There are web archiving technologies out
there that will preserve the pages as web pages, and I suspect save the
HTML as well, but I'm unfamiliar with them.

 

DS

 

______________________________________ 
Daniel Sokolow, Archives Coordinator 
David Taylor Archives 
North Shore-Long Island Jewish Health System 
155 Community Drive 
Great Neck, NY 11021 
mailto:[log in to unmask] 

 

	 

	
________________________________


	From: Archives & Archivists
[mailto:[log in to unmask]] On Behalf Of Molly Alexander
	Sent: Tuesday, July 25, 2006 11:22 AM
	To: [log in to unmask]
	Subject: Capturing websites

	I am posting this for my supervisor who is not on the listserv.
You can respond to the listserv or you can e-mail me directly at
[log in to unmask] and I will pass it along.

	Molly Alexander

	Archivist

	Association of American Medical Colleges 

	 

	Does anyone know of a product which will allow us to capture
websites of 3 levels down or less for the purpose of documenting them?
This would not necessarily need to agree with archival conventions, but
would allow us to store the basic pages of a website to a file, and
would allow a sense of understanding the context of the information (at
least for the main pages). Something which could be stored on DVD would
be preferable. Thanks in advance for any advice which you might offer.

_____________________________________________________________________


The information contained in this electronic e-mail transmission

and any attachments are intended only for the use of the individual

or entity to whom or to which it is addressed, and may contain

information that is privileged, confidential and exempt from

disclosure under applicable law. If the reader of this communication

is not the intended recipient, or the employee or agent responsible

for delivering this communication to the intended recipient, you

are hereby notified that any dissemination, distribution, copying

or disclosure of this communication and any attachment is strictly

prohibited. If you have received this transmission in error, please

notify the sender immediately by telephone and electronic mail,

and delete the original communication and any attachment from any

computer, server or other electronic recording or storage device

or medium. Receipt by anyone other than the intended recipient is

not a waiver of any attorney-client, physician-patient or other

privilege. Thank you.


A posting from the Archives & Archivists LISTSERV List sponsored by the
Society of American Archivists, www.archivists.org. For the terms of
participation, please refer to
http://www.archivists.org/listservs/arch_listserv_terms.asp. 

To subscribe or unsubscribe, send e-mail to [log in to unmask]
In body of message: SUB ARCHIVES firstname lastname *or*: UNSUB ARCHIVES
To post a message, send e-mail to [log in to unmask] 

Or to do *anything* (and enjoy doing it!), use the web interface at
http://listserv.muohio.edu/archives/archives.html 

Problems? Send e-mail to Robert F Schmidt <[log in to unmask]>

A posting from the Archives & Archivists LISTSERV List sponsored by the Society of American Archivists, www.archivists.org.
For the terms of participation, please refer to http://www.archivists.org/listservs/arch_listserv_terms.asp.

To subscribe or unsubscribe, send e-mail to [log in to unmask]
      In body of message:  SUB ARCHIVES firstname lastname
                    *or*:  UNSUB ARCHIVES
To post a message, send e-mail to [log in to unmask]

Or to do *anything* (and enjoy doing it!), use the web interface at
     http://listserv.muohio.edu/archives/archives.html

Problems?  Send e-mail to Robert F Schmidt <[log in to unmask]>