How To Easily Archive Web Pages Using MHT Files

Posted Oct 28, 2009 | by Rich Menga  

If there was any universal immutable truth to the internet, it’s that things vanish from it all the time. Those pages you bookmarked last year? They may be gone. Those forums posts that contained a wealth of useful information? They may be gone as well.

There are several different ways to archive web pages.

You could use ScreenGrab for Firefox. But the problem is that you can text-search anything in an image.

You could use PDF Creator and "print" pages to PDF. This does allow text searching, but the PDF rarely looks anything like the original page and any images present look "off."

What truly works are MHT files. I’ve mentioned this before but have a few extra goodies to make it even easier.

What’s the difference between an MHT and a regular "Save Page As.."? The MHT is an actual single-file archive that contains all the code and images. It’s a great way to archive web pages that contain information you want to save.

Firefox does not have native ability to read or save MHT files, however with UnMHT, you can. It will even read MHTs saved by Internet Explorer, and IE will also read MHTs saved by Firefox. In addition to that, UnMHT has the ability to save all open tabs at once – something that IE 8 doesn’t do.

See video below for details on how it all works.

Which Of These Traits Applies To YOUR Computing Life?...

2 Responses to “How To Easily Archive Web Pages Using MHT Files”

  1. eli says:

    I love this about Opera. It natively does MHT files and I use them all the time, they’re very useful. If you do a SAVE AS in Opera, the default file type to save as is a MHT file. Awesome!

  2. [...] saving as an mht file will. Then archive it on your hard disk. Rich Menga of PCMech has explained how to archive web pages using MHT files very clearly together with a [...]

Leave a Reply