(2002-02-22) a

Excellent discussion on Hack The Planet about building a Personal Web Archive. Aaron Swartz points to his Python-based "Archiver Proxy", and a Zope product call zzKnowMan. Dave McCusker points to some of the potential complexities , especially if your tool builds a search-index in addition to just saving copies, and notes his understanding of the Buy Build Avoid concept: At work, I often have the opportunity to consider using an index to make something sophisticated work. We resist doing this every time, however, because I explain it will be hard to cope in any reasonable low latency time when (not if) corruption occurs. So we avoid the effects of entropy partly by avoiding the maintenance of systems we don't want to process when corrupt. (Things that don't exist don't get corrupt, and don't cost any time to create either.


Edited:    |       |    Search Twitter for discussion

No twinpages!