WebSeitz/wikilog
Publish And Subscribe
Whenever any Form of Government becomes destructive of these ends, it is the Right of the People to alter or to abolish it, and to institute new Government, laying its foundation on such principles and organizing its powers in such form, as to them shall seem most likely to effect their Safety and Happiness.

(backlinks off) (map off)
(search off)
last edited by BillSeitz on Jul 28, 2008 5:28 pm

aka

In the context, this is about handling a reader's interest in multiple blogs. Specifically, having a mechanism so that the reader gets alerted to changes (new postings).

[Russ Lipton]'s page talks about various points, including the cognitive issue of browsing hundreds of subscribed blogs. (See for ideas on that.)

Why is anything resembling real-time necessary? Would it be such a bad thing if everyone just grabbed stuff once a day?


polling (not really )

An typically has a single global setting (sometimes not even user-editable) controlling how frequently it polls every subscribed blog to see whether it's changed.

Some aggregators do an [GET], which can involve grabbing a fair amount of data if the blog includes its full content.

Some aggregators (I believe) just do an [HEAD] first, so the blog server responds with the date-time of the last change - then the aggregator compares that to the last time it got data, and if the last change is more recent it then does a full [GET].

If an aggregator passes as part of its request the date-time it most recently got data, theoretically the blog server can decide to send no data if there's been no change. That reduces/avoids the bandwidth concern, but adds some computational overhead to each such request. But probably the best trade-off. See

At the opposite end of the cooperativeness scale, 's had horrible problems with .


Another approach might be to get last-update info from a separate aggregator [Ping Service], like .

readers should probably check here before asking individual sites for updated . This would improve scalability (vs current practice of grabbing every chosen site's every 30-60 min regardless of whether it's been updated).

How handle a world of non-centralized weblogs.com sites? Assuming that each blog picks a single ping-site to ping, they could store that site's [URL] in (a) a link tag (just like many sites point to their feed) and (b) an channel property.

An issue is what to do when you've been offline for more than 3 hours (or haven't been running your ping-checker). I guess:

[Phil Ringnalda] describes the apparently-common practice of pinging multiples services.


True

, - using or (over only?) - http://www.thetwowayweb.com/soapmeetsrss and walkthrough

directions for

Some -base cloning by .

[DJAdams] did something similar with . There's a spec

I could see this making sense for use while the reader's machine is online, then with a batch catch-up process after a period offline. You'd want the update messages dumped into your for prioritizing.

While it's not an issue for engines that render upstream to the server separately from saving content, simpler systems there are lots of incremental changes to items (esp on a wiki, vs a weblog). Does it really make sense to shove all those "saves" out to the network? No, it makes sense to use some sort of periodicity/batching model. One would be to have the author trigger an updated-ping to his subscibers when he thinks it's appropriate (I manually ping ). Another would be to have an agent check for changes on a time period (half an hour?), then generate updated-ping messages.

Will we hit a point where there are 10million+ blogs? And 10million+ blog readers? If everyone's sucking in lots more stuff than they can or want to read (so their can rate/filter it), what does that imply for bandwidth scalability? Will -s run -s? Or will writers start to look for ways to limit the number of semi-readers they get? "If I subscribe to your feed then you get real-time pings, otherwise you can only read/poll once a day."

Maybe this isn't a big deal if people use some existing features? ()


Other contexts

on wanting it for

module

[Web Sphere] [Java Messaging System] support

Traditional enterprise messaging like [TibCo].

Shrook's Distributed Checking --, 2003/07/21 16:06 [GMT]
described here - basically a cluster of servers tracking last-updated times for many feeds

See : | | | | | | | | | | | | | | | |


 




Bill Seitz, fluxent at gmail dot com, Weblog