Archive >  2007 >  October >  17 Previous / Next

Scripting News, the weblog started in 1997 that bootstrapped the blogging revolution.

NY Times topics in OPML, the mother lode? Permanent link to this item in the archive.

A picture named shovel.jpgAmyloo was digging around the NY Times code weblog and found this OPML file, weighing in at a monstrous 3.3MB that contains some mysterious but rich data about the NY Times and a guide to using the Times to cover special topics that I don't think anyone outside the Times knew existed, but there it is, in a public folder, so lets have a look.

1. There are 10522 top-level headlines. There's no structure to the OPML, it's absolutely flat.

Here's an HTML rendering of the list: timestopics.html.

2. It's a subscription list. Each item has four attributes, type, title, htmlUrl and xmlUrl.

3. The htmlUrl for each element points to a page of stories for the topic. For example, here's a page of stories about table tennis. On that page is a link to an RSS 2.0 feed containing the same information.

4. The xmlUrl links for at least some of the elements are broken, the error appears to be very simple, if you replace the ampersand with a question mark, it works.

If you look around at the topics you'll see it's an incredibly rich set of data. Here are just some of the topics that begin with the letter T: Tableware, Taste, Tattoos, Tax Credits, Tax Evasion, Taxation, Taxicabs and Taxicab Drivers, Tea, Teachers and School Employees, TED Conference News, Teflon, Telephones and Telecommunications, Television, Television Sets, Table Tennis, Terra Cotta, Terrorism, Tests and Testing, Textbooks, Thanksgiving Day.

NY Times metadata Permanent link to this item in the archive.

A picture named accordion.gifIf you do a View Source on a NY Times story, you'll see that there's lots of metadata in the HTML, including keywords for most of the of the stories.

Behind the keywords is a taxonomy that I haven't seen, but would like to. I asked them to make this public, both at my meeting there last Thursday and in a phone talk this morning. I think there could be a lot of value in the Times taxonomy, it might even set a standard.

In the meantime, I wrote a script last night that tracks the keywords in NY Times stories as they flow through the nytimesriver application. Here's a report that's updated once per hour.

Obviously it would be interesting to be able to click on the keywords to see what articles reference each of the keywords. And it would also be nice to have a cumulative list and a daily list. Right now all we have is the cumulative version.

But it's still pretty interesting, bordering on fascinating to think of the possibilities if they provide the framework behind these keywords.

When the pros try to figure out how what they do will continue to make sense after the Internet achieves all its promise, this may be an example. The metadata is generated by librarians, and we don't as yet have our own librarians in the blogosphere (though some might disagree). And it's possible that after a release of the taxonomy that something like Wikipedia may happen, with the public taking over maintenence of the taxonomy. No one knows what will happen, but one thing seems clear, there can be value in a news organization beyond the reporting and editing it does.

Unsung flow-builders Permanent link to this item in the archive.

Over the last week, I've been writing about the disconnect between flow and rank. Paradoxically, sites that are ranked high don't always deliver a lot of hits when they link to you.

On the flipside, there are some sites that are rarely on Top 100 lists, or talked about very much, that deliver substantial flow. Two of them stand out, one a veteran site, and the other a relative newcomer.

A picture named phillies.gif1. Daring Fireball is a thoughtful blog written by John Gruber that focuses on the Macintosh. Since I've returned to the Mac in 2005, and have been writing more about Mac issues, I've started getting links from this site, and when I do, they usually send between 1000 and 2000 readers my way. And they're generally interesting people with useful information and ideas. I follow Gruber on Twitter and have learned that he is a Phillies fan and therefore disappointed this year. His posts are interesting there too, and irreverent, which I like of course.

2. A Digg-like memetracker, is in the same league as TechMeme, about 1000 hits for a highly ranked piece. I don't know much about the site, I'm not a regular reader, and I don't know much about the people who visit from this site.

iPhone SDK coming in Feb Permanent link to this item in the archive.

Apple announced that there will be an SDK for the iPhone.


Funny sign at Web 2.0 Permanent link to this item in the archive.

A picture named eatAolsLunch.jpg

Thanks to Bijan Sabet!

Nokia N810 Permanent link to this item in the archive.

A picture named n810.gif

Just read about this on Engadget.

I know there's a Nokia breakfast in SF starting at 8AM, which I will not be able to make, but as an N800 user, if this product really is coming, I can see two thing right off the bat that address major problems with the previous model. 1. Nokia makes good keyboards, but the old model doesn't have one. On-screen keyboards are a pain, even relatively good ones like the one in the iPhone, but the one in the N800 is not particularly good. 2. The other notable feature is the screen resolution, which looks pretty fantastic.

Anyway, I've asked my contacts at Nokia for info as soon as it's available, but it seems like the Engadget guys are on top of it. If you have any more info, please post a comment here. Thanks.


Nokia did announce the N810 (data sheet pdf).

Here's a high-res picture.

A video showing the N810 in action.

Flors: What the N810 means for maemo developers.

Apple's iPhone SDK announcement Permanent link to this item in the archive.

Note: There was no permalink for the story on Apple's news website, here's the full text, with permalink. DW

Let me just say it: We want native third party applications on the iPhone, and we plan to have an SDK in developers’ hands in February. We are excited about creating a vibrant third party developer community around the iPhone and enabling hundreds of new applications for our users. With our revolutionary multi-touch interface, powerful hardware and advanced software architecture, we believe we have created the best mobile platform ever for developers.

It will take until February to release an SDK because we’re trying to do two diametrically opposed things at once—provide an advanced and open platform to developers while at the same time protect iPhone users from viruses, malware, privacy attacks, etc. This is no easy task. Some claim that viruses and malware are not a problem on mobile phones—this is simply not true. There have been serious viruses on other mobile phones already, including some that silently spread from phone to phone over the cell network. As our phones become more powerful, these malicious programs will become more dangerous. And since the iPhone is the most advanced phone ever, it will be a highly visible target.

Some companies are already taking action. Nokia, for example, is not allowing any applications to be loaded onto some of their newest phones unless they have a digital signature that can be traced back to a known developer. While this makes such a phone less than “totally open,” we believe it is a step in the right direction. We are working on an advanced system which will offer developers broad access to natively program the iPhone’s amazing software platform while at the same time protecting users from malicious programs.

We think a few months of patience now will be rewarded by many years of great third party applications running on safe and reliable iPhones.


P.S.: The SDK will also allow developers to create applications for iPod touch. [Oct 17, 2007]


Last update: Wednesday, October 17, 2007 at 7:42 PM Pacific.

Dave Winer, 52, pioneered the development of weblogs, syndication (RSS), podcasting, outlining, and web content management software; former contributing editor at Wired Magazine, research fellow at Harvard Law School, entrepreneur, and investor in web media companies. A native New Yorker, he received a Master's in Computer Science from the University of Wisconsin, a Bachelor's in Mathematics from Tulane University and currently lives in Berkeley, California.

"The protoblogger." - NY Times.

"The father of modern-day content distribution." - PC World.

One of BusinessWeek's 25 Most Influential People on the Web.

"Helped popularize blogging, podcasting and RSS." - Time.

"The father of blogging and RSS." - BBC.

"RSS was born in 1997 out of the confluence of Dave Winer's 'Really Simple Syndication' technology, used to push out blog updates, and Netscape's 'Rich Site Summary', which allowed users to create custom Netscape home pages with regularly updated data flows." - Tim O'Reilly.

Dave Winer Mailto icon

My most recent trivia on Twitter.

Comment on today's
Scripting News

On This Day In: 2006 2005 2004 2003 2002 2001 2000 1999 1998 1997.

October 2007
Sep   Nov

Lijit Search
Things to revisit:

1.Microsoft patent acid test.
2.What is a weblog?
3.Advertising R.I.P.
4.How to embrace & extend.
5.Bubble Burst 2.0.
6.This I Believe.
7.Most RSS readers are wrong.
8.Who is Phil Jones?
9.Send them away.
10.Negotiate with users.
11.Preserving ideas.
12.Empire of the Air.
13.NPR speech.
14.Russo & Hale.
15.Trouble at the Chronicle.
15.RSS 2.0.
16.Checkbox News.
17.Spreadsheet calls over the Internet.
18.Twitter as coral reef.
19.Mobs of the blogosphere.
20.Advice for Campaigns.
21.Social Cameras.
22.The Next Big Thing.
23.It's time to open up networking, again.
24.Am I competing?
25.Time to shake up conferences?
26.Bloggers working with journalists.

Teller: "To discover is not merely to encounter, but to comprehend and reveal, to apprehend something new and true and deliver it to the world."

Click here to see a list of recently updated OPML weblogs.

Click here to read blogs commenting on today's Scripting News.

Morning Coffee Notes, an occasional podcast by Scripting News Editor, Dave Winer.

KitchenCam 1.0

Click here to see an XML representation of the content of this weblog.

Click here to view the OPML version of Scripting News.

© Copyright 1997-2007 Dave Winer.

Previous / Next