RSS feed fixed

Somehow I mistakenly deleted the html stripping in the RSS feed, which was causing all kinds of erroneous ampersands. The bug has been fixed, and things should be back to normal.

Posted by cameron on November 18, 2002 at 04:46 PM
Blogdex in action

For those interested parties, you can watch Blogdex do its work through the agent logs. I use these to keep track of progress and errors, but there's no reason it shouldn't be public. In the future I'll be trying to expose as much of the internal operation of Blogdex as possible.

Blogdex Agent Logs

Posted by cameron on November 04, 2002 at 08:09 PM
Search added

Search functionality is now available. I'm using MySql's full-text search methods which are quick but offer very little in the way of performance tuning. Results for multiple word queries are forced to be boolean OR across whatever dataset is being searched. The search page can be found here:

Blogdex Search

I'm considering moving the database to MySql 4.0 which offers a much richer query language, or just doing the search myself. But in the mean time, it should be fast and easy to find the sites you've been looking for!

Posted by cameron on November 03, 2002 at 10:11 PM
Paging Mr. Blogdex..

I've installed paging links on all of the parts of the site that display lists of links (i.e. the front page, tracking pages, and blog lists). Most of the data stored in the databases should now be available.

I also rebuilt much of the HTML generation code which was a last minute hack to get the site up. Most of the pages load much more quickly now. Some of the more popular sites are loading slowly in the tracking page, which I think must be an unoptimized table join or something. I'll try and figure that one out in the morning.

I think I have 95% of the old site rebuilt now, so hopefully by the weekend I'll be working on new features.

Posted by cameron on November 01, 2002 at 02:35 AM