Sitescooper Latest Changes

These are the latest changes to sitescooper and its site files. Note that you can download each file independently from here.
Who / When What
akkana site_samples/science/new_scientist_news.site
2005-11-10 Use RSS instead of html feed, because of stories that weirdly don't show up in Plucker
  
akkana site_samples/
2005-11-10 opinion/pulpit.site, opinion/slate.site, palmsized/the_register_rss.site, tech/slashdot_top.site: Updates for sites that have changed
  
akkana site_samples/
2005-11-10 business/lazarus_at_large.site, opinion/alanmiller.site, tech/paulgraham.site: New sites from me
  
akkana site_samples/tech/pcmag_firstlooks.site
2005-11-10 New site from Goh Boon Nam
  
akkana lib/Sitescooper/URLProcessor.pm
2005-11-10 Add application/*xml to allowed types, for newer RSS sites
  
barrygonzaga site_samples/bsd/openbsd_journal.site
2005-08-08 update url, remove postprocess magic, update email
  
barrygonzaga site_samples/regional_philippines/ctc-movies-metro.site
2005-08-08 Add clickthecity.com Metro Manila Movie Guide; note: huge site
  
barrygonzaga site_samples/palmsized/inq7-mobile.site
2005-08-08 3 level inq7.net site
  
barrygonzaga site_samples/regional_philippines/
2005-08-08 inq7.site, pdi.site: replace pdi.site with inq7.site
  
barrygonzaga site_samples/linux/gwn.site
2005-08-08 add logo imageurl; update author email
  
barrygonzaga site_samples/business/businessweek.site
2005-08-08 Reflect web site title, update author email
  
barrygonzaga site_samples/palmsized/
2005-08-08 ny_times.site, salon.site: remove nonworking site
  
akkana lib/Sitescooper/Main.pm
2005-07-06 Add << ^^ >> links at end of story as well as beginning
  
akkana site_samples/
2005-07-06 lib/layouts.site, humor/jon_carroll.site, news/wired_news/wired_news_politics.site, opinion/salon.site, science/new_scientist_news.site, tech/newsforge.site: Some updates for sites that have changed.
  
akkana site_samples/regional_boston/bostonglobe.site
2005-07-06 New site: Boston Globe City & Region sections. From Bruce Zohn
  
akkana site_samples/
2005-07-06 news/USNews.site, news/newsweek_intl.site, tech/pcmag_images.site: Updates from BoonNam Goh
  
akkana site_samples/science/new_scientist_news.site
2005-01-26 Changes to track the recent site changes
  
akkana site_samples/regional_israel/
2005-01-17 haaretz.site, jpost-columns.site, jpost-international.site, jpost-israel.site, jpost-me.site, jpost-opinion.site: David Resnick : Jerusalem Post and Haaretz site files
  
akkana site_samples/regional_uk/bbc_news_sci_tech.site
2005-01-17 Add ContentsDiff
  
akkana site_samples/sport/GSR/
2005-01-17 GSR_Appearance_Mods.site, GSR_Bike.site, GSR_General_Disc.site, GSR_Owners.site, GSR_Performance_Mods.site, GSR_Stories.site, GSR_Technical.site, GSR_Tips-n-Tricks.site: Delmer Wells : GSR motorcycle information sites
  
akkana site_samples/opinion/slate.site
2005-01-05 Anthony Foglia : New site, Slate
  
akkana site_samples/linux/slashdot.site
2005-01-05 B. M. Sleight : minor changes to pick up ask.slashdot.org it.slashdot.org
  
akkana site_samples/weblog/kevin_sites.site
2005-01-05 New site from Delmer Wells : Kevin's War Blog
  
akkana site_samples/tech/pcmag_images.site
2005-01-05 Goh Boon Nam: Update to track site changes and grab images better
  
akkana site_samples/business/the_economist.site
2005-01-05 Goh Boon Nam: Remove Subscription-only pages which cause problem to Plucker
  
akkana site_samples/
2005-01-05 humor/dave_barry.site, linux/debian_weekly_news.site, news/wired_news/wired_news_tech.site, tech/newsforge.site, tech/the_register.site, weblog/riverbend.site: Updates to track changes in the web sites
  
akkana site_samples/weblog/riverbend.site
2004-06-22 Fixed StoryStart
  
akkana site_samples/linux/
2004-06-03 kc_debian_hurd.site, kc_gimp.site: Remove no longer extant debian, hurd and gimp kernel cousins
  
akkana site_samples/regional_australia/yourmovies_canberra.site
2004-05-20 Your Movies, Canberra: from Ken Russell
  
akkana site_samples/news/USNews.site
2004-05-14 Update from Goh Boon Nam
  
akkana site_samples/
2004-05-14 science/archaeology_org.site, science/grahamhancock.site, tech/slyck.site: New sites from Ken Russell
  
akkana site_samples/palmsized/the_register_rss.site
2004-05-14 New palmsized register from Ken Russell
  
akkana site_samples/palmsized/
2004-05-14 the_register.site, the_register_rss.site: Rename palmsized The Register to The Register RSS, so as not to conflict with the non-palmsized Register
  
akkana site_samples/
2004-05-14 news/atlantic.site, tech/slashdot_top.site: New sites
  
akkana site_samples/opinion/salon.site
2004-05-14 Comment out StoryToPrintableSub -- it was causing errors
  
akkana site_samples/
2004-04-27 linux/desktoplinux.site, science/smithsonian.site, tech/joelonsoftware.site, tech/newsforge.site, weblog/riverbend.site, weblog/where_is_raed.site: New sites, from me
  
akkana site_samples/lib/layouts.site
2004-04-27 Fix BBC news information
  
akkana site_samples/
2004-04-27 linux/kernel_traffic.site, opinion/i_cringely.site, tech/the_register.site: Update URL, content start, and other minor fixes
  
akkana site_samples/news/yahoo/
2004-04-26 yahoo_business.site, yahoo_entertainment.site, yahoo_politics.site, yahoo_tech.site, yahoo_top_stories.site: Re-adding yahoo sites, fixed thanks to Jonathan Becker
  
akkana site_samples/comics/
2004-04-26 boondocks.site, doonesbury.site, tedrall.site: New comics from Ignatz Sol
  
akkana site_samples/
2004-04-25 news/newsweek_intl.site, tech/pcmag_images.site: Updates from Goh Boon Nam
  
akkana site_samples/humor/dave_barry.site
2004-04-25 Update from Alan Hoyle : fix story start, end, headline
  
cwerner site_samples/opinion/pulpit.site
2004-04-23 New site for Bob Cringely's weekly column: The Pulpit. This is the same site scooped by i_cringely.site, except that he old i_cringely site did a 2 level scoop that attempted to get a set of columns, whereas the new one gets a single column and only on Fridays. The old one can probably be removed, but I didnt want to mess with it in case someone is relying on it.
  
cwerner default_isilox.ixl, sitescooper.cf, doc/site_params.html, lib/Sitescooper/Main.pm, lib/Sitescooper/SCF.pm
2004-03-22 Improved support for isiloXC: 1. Added a new param to sitescooper.cf "ISiloDefaultIxlFile" that points to an .ixl file in the file system. This means that users can change the iSiloX options by using the iSiloX GUI tool to create a new document, change all the options, then save as a .ixl file. The and tags of the document are stripped and replaced by sitescooper but the rest is used for generating the isilox pdb. More details are given in the comments in sitescooper.cf. The most common likely use for this is to allow the users of -isilox to specify global settings for things like image depth, color, inclusion, dithering etc, and perhaps for category too. 2. Added a new site param called "ExtraISiloIxlTags", to allow ixl settings specific to a site. Updated doc/site_params.html, so see this for more details. This is a little different in that the user has to specify a set of top-level tags for the .ixl file. These get appended to the generated file thus overriding the defaults (or overriding the global options if the new config param is used). This takes advantage of the fact that isilox tolerates the tags appearing more than once by simply taking the last tag and ignoring earlier copies (or at least its xml parser does). So you can set general options in your .ixl file and override specific options in the .site files. The fact that you have to override the whole tag such as means that you can't override, say bitdepth separately from dithering, but its still pretty powerful. And simpler and more durable (ie resitant to changes in isilox) than adding a bunch of new site params. : Modified Files: : sitescooper/sitescooper.cf sitescooper/doc/site_params.html : sitescooper/lib/Sitescooper/Main.pm : sitescooper/lib/Sitescooper/SCF.pm : Added Files: : sitescooper/default_isilox.ixl
  
jmason lib/Sitescooper/
2004-02-19 Robot.pm, StoryURLProcessor.pm: some glitches in RSS output fixed; now does not search for sub-stories after html_to_text conversion
  
jmason site_samples/science/new_scientist_news.site
2004-02-18 New Scientist News site updated
  
akkana site_samples/
2004-02-16 cinema/ebert_1min.site, cinema/roger_ebert.site, humor/dave_barry.site: Contributions from Alan Hoyle, alanh at email.unc.edu
  
jmason lib/Sitescooper/
2004-02-13 Main.pm, SCF.pm: added patch from Robert Fuhge, robert.fuhge.at.epost.de, assign categories to Plucker documents using the Category: line in the site file
  
jmason site_samples/tech/risks.site
2004-02-13 updated risks.site to use new 'mobile device' rendering
  
akkana site_samples/business/the_economist.site
2004-02-11 The Economist, from BoonNam Goh
  
akkana site_samples/news/
2004-02-11 newsweek.site, newsweek_intl.site: Newsweek updates from BoonNam Goh
  
jmason site_samples/security/
2004-02-07 crypto_gram.site, crypto_gram.site: cryptogram site fixed
  
jmason lib/Sitescooper/Robot.pm
2004-01-31 handle undef headlines
  
jmason lib/Sitescooper/Robot.pm
2004-01-31 oops; RSS output headline was not being HTML-encoded correctly
  
akkana site_samples/
2003-11-15 tech/computer_world.site, news/newsweek_intl.site: Contributions from BoonNam Goh
  
barrygonzaga site_samples/linux/gwn.site
2003-11-04 add Gentoo Weekly News
  
akkana site_samples/
2003-10-31 news/Newsweek.site, news/NewsweekIntl.site, regional_israel/jpost.site: Remove inconsistently named files
  
akkana site_samples/news/
2003-10-31 newsweek.site, newsweek_intl.site: Newsweek, from Goh Boon Nam
  
akkana site_samples/regional_israel/jerusalem_post.site
2003-10-31 Jerusalem Post, from David Resnick
  
akkana site_samples/tech/wiredmag.site
2003-10-31 Previous commit only got one specific date. So I've substituted my own Wired site file, which doesn't get entire stories yet, but it does get Wired every day.
  
akkana site_samples/tech/wiredmag.site
2003-10-31 One issue of Wired Magazine, from richard_html2pdb at yahoo dot com
  
akkana site_samples/tech/pcmag_images.site
2003-10-31 Update from Goh Boon Nam: Get full-sized images
  
akkana site_samples/news/
2003-10-31 Newsweek.site, NewsweekIntl.site: Newsweek updates (US and Intl) from BoonNam Goh
  
akkana site_samples/regional_israel/jpost.site
2003-10-31 Jerusalem Post, from David Resnick
  
akkana site_samples/news/
2003-10-29 Newsweek.site, USNews.site: New sites contributed by BoonNam Goh
  
hubidubi site_samples/regional_hungary/linuxonline_hu.site
2003-09-17 new site file for linuxonline.hu
  
jmason lib/Sitescooper/StoryURLProcessor.pm
2003-07-08 fixed some wierdness in error messages
  
jmason site_samples/science/new_scientist.site
2003-06-25 fixed NS site
  
jmason lib/Sitescooper/Main.pm
2003-06-25 bug fixed
  
jmason sitescooper.pl, lib/Sitescooper/Main.pm, site_samples/science/new_scientist.site
2003-06-25 bug on win32, noted by Robert P. Nix
  
jmason site_samples/culture/world_new_york.site
2003-06-11 fixed and re-added World New York site
  
jmason site_samples/science/new_scientist.site
2003-06-11 added headline support for New Scientist
  
jmason site_samples/
2003-06-11 business/economist.site, business/stocksmart.site, business/wsj.site, cinema/coaxialnews.site, cinema/coolnews.site, cinema/forcenet.site, comics/girls_and_sports.site, comics/horrorscope.site, comics/i_need_help.site, comics/new_breed.site, comics/pops_place.site, comics/wildwood.site, culture/plastic.site, games/bluesnews.site, humor/alexei_sayle.site, humor/dave_barry.site, humor/ditherati.site, linux/linuxtoday.site, linux/linuxworld.site, linux/mandrakeforum.site, linux/mysql_newsletter.site, linux/weekly_news.site, news/gallup_poll.site, news/world_new_york.site, news/wired_news/wired_news_top_stories.site, news/yahoo/yahoo_business.site, news/yahoo/yahoo_entertainment.site, news/yahoo/yahoo_health.site, news/yahoo/yahoo_oddly_enough.site, news/yahoo/yahoo_politics.site, news/yahoo/yahoo_public_opinion.site, news/yahoo/yahoo_science.site, news/yahoo/yahoo_sports.site, news/yahoo/yahoo_technology.site, news/yahoo/yahoo_top_stories.site, news/yahoo/yahoo_world.site, odd/morbid_fact_du_jour.site, odd/snopes.site, opinion/salon_archives.site, opinion/tbtf.site, opinion/tbtf_log.site, opinion/unblinking.site, palm/memoware.site, palm/palmguru.site, palm/palminfocenter.site, palm/pdalive.site, palm/pencomputing.site, palmsized/beyond2000-pda.site, regional_australia/abc_news_online.site, regional_australia/fairfax_it.site, regional_california/mercury_center.site, regional_california/la_times/latimes_local.site, regional_california/la_times/latimes_nat.site, regional_california/la_times/latimes_oc.site, regional_california/la_times/latimes_science.site, regional_california/la_times/latimes_tech.site, regional_california/la_times/latimes_world.site, regional_croatia/KSET_monthly.site, regional_francais/libe_portrait_du_jour.site, regional_francais/libe_rebonds.site, regional_francais/sia_fr.site, regional_germany/bundesregierung.site, regional_germany/de_excite.site, regional_germany/de_heute.site, regional_germany/de_zdnet.site, regional_germany/de_zeit/de_zeit_media.site, regional_hungary/hirek.site, regional_israel/jerusalem_post.site, regional_north_carolina/cats_cradle.site, regional_north_carolina/charlotte_observer.site, regional_north_carolina/news_observer.site, regional_philadelphia/phillynews.site, regional_seattle/seattletimes.site, regional_spain/es_zdnet.site, regional_spain/marca_soccer.site, regional_spain/marca_sports.site, regional_uk/digiguide_tv_listings.site, regional_uk/times_britain.site, regional_uk/times_world.site, science/cosmiverse.site, science/nasa2go.site, science/sciam.site, security/securityportal.site, sport/fox_sports.site, tech/beyond2000.site, tech/mit_tech_review.site, weblog/joel_on_software.site, weblog/tsluts.site: removed all sites that now give HTTP errors when used
  
jmason sitescooper.pl, lib/Sitescooper/LWPHTTPClient.pm, lib/Sitescooper/Main.pm, site_samples/web/alistapart.site, site_samples/web/asktog.site, site_samples/web/webmonkey.site
2003-06-11 added -timeout parameter
  
jmason rss-to-site.pl
2003-06-11 another patch from Adrian Colley
  
jmason site_samples/
2003-06-11 cinema/ebert_answer_man.site, cinema/ebert_features.site, cinema/ebert_great_movies.site, cinema/roger_ebert.site, opinion/nro.site: updated sites from John Straw
  
jmason site_samples/regional_germany/
2003-06-10 de_cert.site, de_cyberkino.site, de_gazette.site, de_heise_mobil.site, de_heise_tp.site, de_heute.site, de_pdassi_news.site, de_pdassi_software.site, de_spiegel.site, de_stern.site, de_tagesschau.site, de_teltarif.site, de_tvspielfilm.site, mobile2day.site, palmfaq_de.site, pda_debitel_net.site, windows2000faq.site, zdnet_news.site, bundesregierung.site: a whole lot of new regional_germany sites from Stefan Schwingeler
  
jmason lib/Sitescooper/Main.pm, site_samples/comics/thismodernworld.site, site_samples/security/crypto_gram.site
2003-06-10 patch for Plucker; now able to handle big images. also added thismodernworld site. patch from Adrian Colley
  
jmason lib/Sitescooper/
2003-06-09 Main.pm, StoryURLProcessor.pm: remove non-required hashing
  
jmason sitescooper.pl, lib/Sitescooper/Main.pm, lib/Sitescooper/Robot.pm
2003-06-09 description now encoded; RSS 1.0 the default
  
jmason lib/Sitescooper/StoryURLProcessor.pm
2003-06-06 added SubStoryPermalink conf setting so that permalinks are picked up
  
jmason lib/Sitescooper/
2003-06-06 Robot.pm, SCF.pm: added SubStoryId conf setting so that permalinks are picked up
  
jmason lib/Sitescooper/URLProcessor.pm
2003-06-05 relative links became relative to sitescooper.org; fixed
  
jmason site_samples/tech/pcmag_images.site
2003-06-05 updated PC Magazine site from Goh Boon Nam
  
jmason lib/Sitescooper/Robot.pm
2003-06-05 oops, forgot escaping in description tags
  
jmason lib/Sitescooper/Main.pm
2003-06-04 guid fix; use the real URL as much as poss
  
jmason lib/Sitescooper/LinksURLProcessor.pm
2003-06-04 remove HTML comments before looking for links
  
jmason lib/Sitescooper/StoryURLProcessor.pm
2003-06-04 added -maxstories support for substory mode
  
jmason lib/Sitescooper/StoryURLProcessor.pm
2003-06-04 lib/Sitescooper/
  
jmason lib/Sitescooper/Main.pm
2003-06-03 fixed invalid RSS
  
jmason lib/Sitescooper/Robot.pm
2003-06-03 rss with -dump works
  
jmason lib/Sitescooper/
2003-06-03 Main.pm, Robot.pm, SCF.pm, StoryURLProcessor.pm, URLProcessor.pm: can now extract 'sub-stories' from within a story page
  
jmason lib/Sitescooper/
2003-05-31 Main.pm, Robot.pm: added -rss switch for RSS output
  
hubidubi site_samples/regional_hungary/hirek.site
2003-05-28 Site URL update
  
jmason lib/Sitescooper/Main.pm
2003-04-29 fix for plucker from Rik Wehbring
  
barrygonzaga site_samples/regional_philippines/pdi.site
2003-03-31 update and clean
  
jmason site_samples/regional_australia/abc_news_online.site
2003-03-03 added ABC News Online site from Wayne Osborn
  
hubidubi site_samples/linux/mysql_newsletter.site
2003-02-26 Site file for MySQL monthly newsletter
  
hubidubi site_samples/
2003-02-06 regional_hungary/freebsd_hu.site, regional_hungary/hup_hu.site, regional_hungary/linuxforum_hu.site, linux/footnotes.site: some site logo improvements
  
jmason site_samples/tech/pcmag_images.site
2003-01-22 added pcmag_images.site from Goh Boon Nam
  
jmason site_samples/weblog/eckes.site
2003-01-15 added eckes.site
  
jmason lib/Sitescooper/CacheSingleton.pm, lib/Sitescooper/DirCacheFactory.pm, lib/Sitescooper/PerSiteDirCache.pm, site_samples/regional_hungary/freebsd_hu.site
2003-01-15 freebsd_hu.site from Hubidubi
  
jmason lib/Sitescooper/Main.pm, lib/Sitescooper/SCF.pm, site_samples/languages/php_net.site, site_samples/linux/debian_weekly_news.site, site_samples/linux/footnotes.site, site_samples/regional_hungary/hirek.site, site_samples/regional_hungary/hup_hu.site, site_samples/regional_hungary/linux_hu.site, site_samples/regional_hungary/linuxforum_hu.site, site_samples/regional_hungary/metro_hu.site, site_samples/regional_hungary/pdamania_hu.site
2002-11-15 many site updates from Hubidubi
  
barrygonzaga site_samples/regional_philippines/pdi.site
2002-11-03 -fix "letters" story url -fix "business" story url -fix "business" stories
  
barrygonzaga site_samples/humor/
2002-10-29 bofh-2k+1.site, bofh-2k.site: add description, clean up bad bold/italic markups, replaced
with

..

  
barrygonzaga site_samples/humor/
2002-10-28 bofh-2k+1.site, bofh-2k.site: add bofh 2k and 2k+1
  
jmason site_samples/sport/cnn_sports.site
2002-09-03 added cnn_sports site
  
jmason site_samples/linux/weekly_news.site
2002-09-03 updated weekly_news.site
  
jmason lib/Sitescooper/
2002-07-15 Main.pm, URLProcessor.pm: applied bugfix from Bernd Rellermeyer
  
barrygonzaga site_samples/sport/mobilebikes.site
2002-05-06 cycling newsletter
  
barrygonzaga site_samples/palmsized/the_register.site
2002-05-06 cleanup
  
barrygonzaga site_samples/
2002-05-06 bsd/openbsd_journal.site, news/gallup_poll.site, palm/palminfocenter.site, palm/pdalive.site, palmsized/salon.site, business/businessweek.site: obscured email address
  
barrygonzaga site_samples/palmsized/ny_times_handheld.site
2002-05-06 site restricted
  
barrygonzaga site_samples/regional_philippines/pdi.site
2002-05-06 - obscured email address - cleanups
  
barrygonzaga site_samples/palmsized/ny_times_handheld.site
2002-05-06 obscured email address
  
jmason site_samples/lib/layouts.site
2002-01-25 updated BBC layout from Akkana's site
  
jmason site_samples/regional_uk/digiguide_tv_listings.site
2002-01-22 Digiguide site re-submitted from Andy Carlson
  
jmason site_samples/linux/linuxtoday.site
2002-01-22 updated linuxtoday
  
jmason site_samples/
2002-01-22 science/new_scientist_news.site, security/hacker_news_network.site: hackernews gone
  
jmason site_samples/comics/
2002-01-21 better_half.site, between_friends.site, crock.site, curtis.site, dinette_set.site, edge_city.site, girls_and_sports.site, grin_and_bear_it.site, horrorscope.site, i_need_help.site, katzenjammer_kids.site, lockhorns.site, mallard_fillmore.site, moose_and_molly.site, new_breed.site, piranha_club.site, pops_place.site, redeye.site, rhymes_with_orange.site, safe_havens.site, sam_and_silo.site, six_chix.site, theyll_do_it_every_time.site, trudy.site, tumbleweeds.site, zippy_the_pinhead.site: re-added fixed comics from Yoon Fui Thean
  
jmason site_samples/
2002-01-19 admin/sitescooper_archive.site, bsd/oreillynet_bsd.site, business/cnn_financial.site, business/cnnfn.site, cinema/filmink-online.site, palmsized/cnn.site, regional_seattle/seattle_p_i.site, weblog/tim_oreilly.site: fixed some redirected links; removing duplicate CNN sites
  
jmason site_samples/
2002-01-19 business/hottips.site, linux/linuxplaza.site, opinion/feed.site, regional_germany/de_spiegel.site, regional_north_carolina/weather24_raleigh.site: more dead sites pruned
  
jmason site_samples/
2002-01-18 languages/aspwire.site, languages/news_perl_org.site, languages/perlmonth.site, languages/sqlwire.site, languages/vbwire.site, opinion/simson_garfinkel.site, tech/sendmail_net.site: removed lots of dead sites
  
jmason site_samples/
2002-01-18 business/financial_times.site, business/fox_market_wire.site, business/the_standard.site, business/the_street.site, cinema/cinescape.site, comics/better_half.site, comics/between_friends.site, comics/crock.site, comics/curtis.site, comics/dinette_set.site, comics/girls_and_sports.site, comics/grin_and_bear_it.site, comics/horrorscope.site, comics/i_need_help.site, comics/katzenjammer_kids.site, comics/lockhorns.site, comics/mallard_fillmore.site, comics/moose_and_molly.site, comics/new_breed.site, comics/piranha_club.site, comics/pops_place.site, comics/redeye.site, comics/rhymes_with_orange.site, comics/safe_havens.site, comics/sam_and_silo.site, comics/six_chix.site, comics/theyll_do_it_every_time.site, comics/trudy.site, comics/tumbleweeds.site, comics/zippy_the_pinhead.site, games/oswalds_6th_floor.site, humor/modern_humorist.site, languages/perlnews.site, linux/mandrake_pda.site, news/csmonitor.site, news/my_excite.site, palm/palmgear.site, palmsized/mercury_center_mobile.site, palmsized/the_standard.site, regional_chicago/chicago_tribune_arts_and_entertainment.site, regional_chicago/chicago_tribune_books.site, regional_chicago/chicago_tribune_cars.site, regional_chicago/chicago_tribune_commentary.site, regional_chicago/chicago_tribune_editorials.site, regional_chicago/chicago_tribune_friday.site, regional_chicago/chicago_tribune_good_eating.site, regional_chicago/chicago_tribune_health_and_family.site, regional_chicago/chicago_tribune_home_and_garden.site, regional_chicago/chicago_tribune_jobs.site, regional_chicago/chicago_tribune_kidnews.site, regional_chicago/chicago_tribune_magazine.site, regional_chicago/chicago_tribune_metro_chicago.site, regional_chicago/chicago_tribune_metro_dupage.site, regional_chicago/chicago_tribune_metro_lake.site, regional_chicago/chicago_tribune_metro_mchenry.site, regional_chicago/chicago_tribune_metro_northwest.site, regional_chicago/chicago_tribune_metro_southwest.site, regional_chicago/chicago_tribune_new_homes.site, regional_chicago/chicago_tribune_real_estate.site, regional_chicago/chicago_tribune_tempo.site, regional_chicago/chicago_tribune_transportation.site, regional_chicago/chicago_tribune_travel.site, regional_chicago/chicago_tribune_tv_week.site, regional_chicago/chicago_tribune_woman_news.site, regional_chicago/chicago_tribune_your_money.site, regional_chicago/chicago_tribune_your_place.site, regional_croatia/DHMZ_Hrvatska_danas.site, regional_croatia/DHMZ_Hrvatska_sutra.site, regional_croatia/DHMZ_Jadran.site, regional_croatia/DHMZ_Zagreb_danas.site, regional_croatia/DHMZ_Zagreb_sutra.site, regional_denmark/politiken.site, regional_denmark/valutakurser.site, regional_francais/01_informatique.site, regional_francais/afp.site, regional_francais/cinenouba.site, regional_germany/de_br_news.site, regional_germany/de_dwelle.site, regional_germany/de_kalenderblatt.site, regional_ireland/irish_times.site, regional_north_carolina/wral-tv.site, regional_philippines/manila_bulletin.site, regional_spain/telebasket_nba_spanish.site, regional_spain/telebasket_spain.site, regional_toronto/globe_and_mail_business.site, regional_uk/digiguide_tv_listings.site, security/securityfocus.site, sport/EurosportTV.site, sport/cnn_sports.site, sport/telebasket_nba.site, sport/thatsracin.site, sport/uk_sports_com.site, tech/cnet.site, tech/geeknews.site, tech/techweb.site: removed sites which now give HTTP 404s
  
jmason site_samples/
2002-01-18 regional_california/ocregister.site, science/hotair_features.site, sport/sportingnews.site: moved broken sites to 'broken' dir
  
jmason site_samples/
2002-01-18 linux/kde-dev-news.site, linux/rhad_rumor_mill.site, news/gallup_poll.site, opinion/idler.site, opinion/jaundiced_eye.site, opinion/slate_todays_papers.site, regional_denmark/sslug-nyheder.site, regional_francais/libe_q.site, science/spaceref.site, tech/rcfoc.site, web/webreference_experts.site: thoroughly outdated dead sites removed
  
jmason site_samples/bsd/daemonnews.site
2002-01-18 removed broken site
  
jmason site_samples/humor/jon_carroll.site
2002-01-18 added jon_carroll.site from Jan Lund Thomsen
  
jmason site_samples/regional_germany/de_tecchannel.site
2002-01-14 added de_tecchannel.site from Michael Schubart
  
jmason site_samples/
2002-01-14 cinema/imdb_studio_briefing.site, weblog/jason_pettus.site: added sites from Jan Lund Thomsen
  
jmason site_samples/regional_denmark/geekculture.site
2002-01-07 added sites from Jan Lund Thomsen
  
jmason lib/Sitescooper/Main.pm
2002-01-07 committed patch from Akkana to silence Plucker warning
  
jmason site_samples/regional_japan/jp_daily_yomiuri_english.site
2002-01-04 added jp_daily_yomiuri_english.site from Michael Schubart
  
jmason site_samples/regional_japan/jp_japan_times/
2002-01-02 jp_japan_times_business.site, jp_japan_times_news.site: added jp_japan_times sites from Michael Schubart
  
jmason lib/Sitescooper/Main.pm
2001-12-30 added fix for iSilo on win2k
  
jmason site_samples/comics/calvin_and_hobbes.site, t/html/newstories/index.html, t/html/newstories/1/page1_1.html, t/html/newstories/2/page2.html
2001-12-16 updated calvin and hobbes site from Gary Paulson
  
jmason site_samples/regional_denmark/politiken_daily_summary.site, t/html/scdiff2.html
2001-12-04 added politiken_daily_summary.site from Jan Lund Thomsen
  
jmason site_samples/humor/alexei_sayle.site, t/html/scdiff2.html
2001-12-04 added Alexei Sayle site from Jan Lund Thomsen
  
jmason lib/Sitescooper/CacheSingleton.pm, lib/Sitescooper/PerSiteDirCache.pm, t/html/http_redirect/front/currentdate/index.html, t/html/newstories/index.html, t/html/newstories/1/page1_1.html, t/html/newstories/2/page2.html
2001-12-04 backed out prev change; already fixed in CVS
  
jmason lib/PDA/PilotInstall.pm, lib/Sitescooper/CacheSingleton.pm, t/html/http_redirect/front/currentdate/index.html
2001-12-04 added fixes for problems reported by Andy Carlson
  
alastair site_samples/regional_australia/fairfax_it.site
2001-12-03 Fixed to work with the latest Fairfax site changes.
  
alastair site_samples/tech/zzz.site
2001-12-03 Updated site to include ContentsDiff (d'oh!)
  
jmason t/html/newstoriesdiff/index.html
2001-12-02 ..
  
jmason sitescooper.pl, lib/Sitescooper/Main.pm, t/html/newstoriesdiff/index.html
2001-12-02 added Torsten Uhlmann's isilo-X support patch
  
jmason lib/Sitescooper/Robot.pm, site_samples/regional_denmark/politiken.site
2001-11-25 updated politiken, from Claus Hindsgaul
  
jmason site_samples/linux/kc_kde.site
2001-11-12 updated kc_kde from Torsten Uhlmann
  
jmason site_samples/comics/family_circus.site
2001-10-31 family_circus.site from Thean Yoon Fui
  
barrygonzaga site_samples/palm/palminfocenter.site
2001-10-26 removed advertisement from contents fixed (P|p)olls.php link in contents
  
jmason default_templates.html, lib/Sitescooper/Main.pm, lib/Sitescooper/PerSiteDirCache.pm, lib/Sitescooper/URLProcessor.pm, site_samples/palmsized/the_guardian_palmsized.site
2001-10-06 fixed bug using -fromcache with shared cache
  
jmason site_samples/
2001-10-02 regional_uk/the_guardian.site, science/new_scientist.site: a few site updates
  
jmason site_samples/science/new_scientist.site
2001-10-02 updated newscientist site
  
jmason sitescooper.cf, lib/Sitescooper/DirCacheFactory.pm, lib/Sitescooper/Main.pm
2001-10-02 added __OUTPUTFORMAT__ support
  
jmason site_samples/science/sciam.site
2001-10-02 updated sciam site to honor caching
  
jmason lib/PDA/PilotInstall.pm
2001-09-27 fixed PDA::PilotInstall to work with later palm desktops and activeperls
  
jmason lib/PDA/PilotInstall.pm
2001-09-25 fixes from Tim Steele
  
barrygonzaga site_samples/palmsized/the_register.site
2001-09-21 used rss file, palm-friendly site is/was not updated regularly.
  
barrygonzaga site_samples/palmsized/beyond2000-pda.site
2001-09-21 used by2k's palm edition.
  
barrygonzaga site_samples/regional_philippines/pdi.site
2001-09-20 fixed sites erroneous links
  
barrygonzaga site_samples/palmsized/cnn.site
2001-09-19 added contents logo, removed duplicate
's on stories.
  
barrygonzaga site_samples/regional_philippines/
2001-09-19 manila_bulletin.site, pdi.site: renamed/moved category, regional_philippines *not* regional_phillipines.
  
jmason site_samples/
2001-09-17 bsd/openbsd_journal.site, palm/palminfocenter.site, palmsized/cnn.site, palmsized/ny_times_handheld.site, palmsized/the_register.site: site files from Barry Dexter A. Gonzaga
  
jmason site_samples/palmsized/the_guardian_palmsized.site
2001-09-14 Guardian site updated by Stewart C. Russell (stewart /at/ ref.collins.co.uk)
  
jmason site_samples/business/businessweek.site
2001-09-06 oops, forgot busweek
  
jmason site_samples/
2001-09-05 palm/pdalive.site, palmsized/ny_times.site, palmsized/salon.site, news/gallup_poll.site, palm/palminfocenter.site: added sites from Barry Dexter A. Gonzaga
  
jmason site_samples/regional_denmark/politiken.site
2001-08-27 added Politiken site from Claus Hindsgaul
  
jmason lib/Sitescooper/UserAgent.pm
2001-08-20 fixed http auth support
  
jmason site_samples/regional_toronto/
2001-08-18 globe_and_mail_columnists.site, globe_and_mail_national.site, globe_and_mail_thearts.site, globe_and_mail_toronto.site: globe+mail sites updated by Michael Graham (magog@the-wire.com)
  
jmason site_samples/regional_california/
2001-08-17 la_times.site, latimes_nat.site, latimes_oc.site, la_times/la_times_frontpage.site, la_times/latimes_local.site, la_times/latimes_nat.site, la_times/latimes_oc.site, la_times/latimes_science.site, la_times/latimes_tech.site, la_times/latimes_world.site: added new LA Times sites from Mark Beckman (mbeckman at jps.net), and reorged them into a directory
  
jmason site_samples/comics/
2001-08-16 flash_gordon.site, prince_valiant.site: Yoon Fui Thean: comics update
  
jmason site_samples/
2001-06-28 business/cnn_financial.site, news/cnn_mobile.site, science/sciam.site, sport/cnn_sports.site: added SciAm site from Marko, and some CNN sites from David's PODS system translated by Marko
  
jmason lib/Sitescooper/Main.pm
2001-06-28 added support for escaped-hashes in site files from Jeff Hecker
  
jmason site_samples/opinion/unblinking.site
2001-06-21 fixed typo
  
jmason TODO
2001-06-20 added Manila Bulletin site from Eric Pareja
  
jmason sitescooper.cf, doc/running.html
2001-06-19 fixed doco a little
  
jmason lib/Sitescooper/LWPHTTPClient.pm, lib/Sitescooper/Main.pm, lib/Sitescooper/SCF.pm, lib/Sitescooper/URLProcessor.pm, lib/Sitescooper/Util.pm, site_samples/tech/firstmonday.site
2001-06-16 added First Monday site, and worked around webserver bug
  
jmason Makefile
2001-06-11 fixed MANDIR in sitescooper make install
  
jmason site_samples/tech/the_register.site
2001-06-08 added sites
  
jmason site_samples/regional_germany/
2001-06-08 de_heise.site, de_sueddeutsche.site, de_sz/de_sz.site, de_sz/de_sz_drei.site, de_sz/de_sz_politik.site, de_sz/de_sz_sport.site, de_sz/de_sz_wissen.site, de_zeit/de_zeit.site, de_zeit/de_zeit_alternate.site, de_zeit/de_zeit_kultur.site, de_zeit/de_zeit_leben.site, de_zeit/de_zeit_media.site, de_zeit/de_zeit_politik.site, de_zeit/de_zeit_reisen.site, de_zeit/de_zeit_wirtschaft.site, de_zeit/de_zeit_wissen.site: new de_sz, de_zeit and de_heise sites from Peter Marschall
  
jmason site_samples/regional_germany/de_sz/
2001-06-08 de_sz.site, de_sz.site-halbwegs-ok, de_sz_bay.site, de_sz_bayern.site, de_sz_berlin.site, de_sz_beruf.site, de_sz_drei.site, de_sz_feuill.site, de_sz_feuilleton.site, de_sz_hochschule.site, de_sz_immobilien.site, de_sz_kultur.site, de_sz_literatur.site, de_sz_medien.site, de_sz_meinung.site, de_sz_muenchen.site, de_sz_nche.site, de_sz_pano.site, de_sz_panorama.site, de_sz_politik.site, de_sz_reise.site, de_sz_sonder.site, de_sz_sonderbeilage.site, de_sz_sport.site, de_sz_streifl.site, de_sz_streiflicht.site, de_sz_verkehr.site, de_sz_verm.site, de_sz_vier.site, de_sz_wirt.site, de_sz_wirtschaft.site, de_sz_wissen.site, de_sz_wochenende.site: new de_sz and de_zeit sites from Peter Marschall
  
jmason sitescooper.pl, lib/Sitescooper/Main.pm, lib/Sitescooper/PerSiteDirCache.pm, site_samples/languages/use_perl.site
2001-06-05 added mod to not copy up .cvsignore
  

(Scooped by sitescooper. Go back to the sitescooper page)