Iterasi, as you probably know, is a perfect tool for saving web pages online. You just click a button and an exact replica of the current web page (with images) is saved forever in your account. You may even schedule Iterasi web crawlers to grab pages at pre-defined intervals.
Some of you may have been reluctant to try Iterasi since it required a toolbar (IE & Firefox only) but fortunately, that’s no longer the case. You can drag this new bookmarklet and archive web pages from any web browser including Google Chrome, Safari and Opera without requiring the toolbar.
Your Personal Wayback Machine
Iterasi, probably inspired by the popular Wayback Machine of Internet Archive, has also created a public timeline of Internet websites (as they change) but here the web archiving is instant and happens on-demand.
In the case of Internet Archive, pages appear approximately six months after they get crawled and there’s no guarantee that your site will ever get indexed in the WayBack Machine.
See a comparison between Iterasi and Internet Archive for the Google homepage.
Iterasi Archives and Duplicate Content
With these new features, Iterasi has added a very useful Social Bookmarking layer to the Internet Archive technology but there’s a concern as well similar to my experience with Clipmarks.
Every page saved on Iterasi is an exact replica of another web page but these “mirror images” are served to Google in a plain text format. Here’s an example – this is the original page, this is what you see on Iterasi and this is how Google bots will see the page.
The situation is better than Clipmarks because archived documents link to the source page but still, this can lead to duplicate content issues especially for small or less popular websites.
*Internet Archive blocks all web crawlers (see robots.txt) from indexing their content.
0 comments:
Post a Comment