February 12, 2004, by Léon Krijnen
About AtomZ Tweet
About four years ago I discovered AtomZ, wrote a column (in Dutch, over here) about it, and used the search engine on the old Watchking site, that disappeared into the virtual Nirvana last year.
I restarted to use AtomZ again a month or so ago, to index the old columns i wrote between 1995 and 1999.
Stuff from '99 on until now is saved in the MySQL database, parsed by MoveableType in its monthly archives.
AtomZ is a perfect free tool if you want to index 500 pages or less of old stuff that, for one way or antoher, you don't want to import in your weblog. More than 500 pages are not indexed, unless you pay for it.
All you have to do is sign up for a free account, fire up the index process, and you're ready. All they ask from you is a neat little logo on your results page.
It's a template driven system, so you can point the templates to your own blog style sheet, or edit them in whatever manner you like. It's fast too, and you can choose from different forms and ways to present the results.
You can have the indexer work on your whole site, but I've used it this way: put all files in a directory, in my case called cyberspace. Go to the control panel of your AtomZ account and exclude alle other directories from the spider, for example by means of a wild card and include /cyberspace als the only directory to be indexed.
Important: when the indexer starts to index there has to be an index file in the directory, containing links to all the files you'd like to have indexed.
Say you don't want the directory to be visited or browsed, because you want it only to be possible to visit the files in question by means of an AtomZ result.
Solution: let the indexer index, remove the index file from the directory, and check that directory browsing is not allowed in your httpd.config.
Have almost 500 files, but no index.html? Allow directory browsing temporarily in httpd.config, browse the directory, save the browser page as index.html and activate the indexer.
Posted: February 12, 2004 10:15 PM (363 words). Tweet