<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Life in the startup lane &#187; REST</title>
	<atom:link href="http://tomchikoore.com/category/rest/feed/" rel="self" type="application/rss+xml" />
	<link>http://tomchikoore.com</link>
	<description></description>
	<lastBuildDate>Tue, 31 Jan 2012 23:58:37 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>All I want for the new year is HTML 5</title>
		<link>http://tomchikoore.com/2009/01/03/all-i-want-for-the-new-year-is-html-5/</link>
		<comments>http://tomchikoore.com/2009/01/03/all-i-want-for-the-new-year-is-html-5/#comments</comments>
		<pubDate>Sat, 03 Jan 2009 04:27:57 +0000</pubDate>
		<dc:creator>tomchikoore</dc:creator>
				<category><![CDATA[REST]]></category>
		<category><![CDATA[RSS]]></category>
		<category><![CDATA[Web 2.0]]></category>

		<guid isPermaLink="false">http://tomchikoore.com/?p=74</guid>
		<description><![CDATA[While there have been many wishes and predictions for 2009, mine is simple, it’s HTML 5.  The adoption of HTML 5 specs by browsers, rendering engines and content publishers in this coming year will make 2009 a good year for me. As I have written in the past (here and here), content extraction is an [...]]]></description>
			<content:encoded><![CDATA[<p><span style="font-size: 10pt; color: #000000; font-family: Verdana;">While there have been many wishes and predictions for 2009, mine is simple, it’s <a title="HTML 5" href="http://dev.w3.org/html5/spec/Overview.html" target="_blank">HTML 5</a>.  The adoption of HTML 5 specs by browsers, rendering engines and content publishers in this coming year will make 2009 a good year for me. As I have written in the past (<a title="A case for standardizing blog templates" href="http://tomchikoore.com/?p=21" target="_blank">here </a>and <a title="Yahoo! BOSS - the answer for web semantic analysis-based applications??" href="http://tomchikoore.com/?p=58" target="_blank">here</a>), content extraction is an often overlooked challenge that gets in the way of deriving web content semantics. This is an issue that often gets overlooked but for those of us who are passionate about extracting web content semantics, we understand how much it gets in the way of making much of the good work being done now even better. As we have seen recently, this is not an issue that is challenging only the small players, some of the major applications that rely on content extraction such a Google Alerts are seeing a degradation in content quality as they provide articles that have keyword hits in the navigations bars, ads and other non-content related text on web pages.</span></p>
<p><span style="font-size: 10pt; color: #000000; font-family: Verdana;">HTML 5 had taken steps in specifying how web content (e.g. news story, blog entry) should be represented in a page. The specification has attempted to structure a web page by separating different parts of a web page such as headers, footers, navigation, content etc.  The elements of HTML 5 that will help with content extraction are &lt;section&gt; and &lt;article&gt;.</span></p>
<p><span style="font-size: 10pt; color: #000000; font-family: Verdana;">The &lt;section&gt; element is described in the HTML 5 specification as follows,</span></p>
<blockquote><p><em><span style="font-size: 10pt; color: #000000; font-family: Verdana;">&#8220;The </span><code><span style="font-size: 10pt; color: #000000;">section</span></code><span style="font-size: 10pt; color: #000000; font-family: Verdana;"> element represents a generic document or application section. A section, in this context, is a thematic grouping of content, typically with a header, possibly with a footer.</span></em></p>
<p class="example" style="margin: auto 0in;"><span style="font-size: 10pt; color: #000000; font-family: Verdana;"><em>Examples of sections would be chapters, the various tabbed pages in a tabbed dialog box, or the numbered sections of a thesis. A Web site&#8217;s home page could be split into sections for an introduction, news items, contact information.&#8221;</em></span></p>
</blockquote>
<p class="example" style="margin: auto 0in;"><span style="font-size: 10pt; color: #000000; font-family: Verdana;">Having an HTML element that groups content is very welcome.  The &lt;section&gt; element can be used to contain content such as a news article.  HTML 5 has gone one step further to make this possible by introducing the &lt;article&gt; element which the specification described as follows,</span></p>
<blockquote>
<p class="example" style="margin: auto 0in;"><em><span style="font-size: 10pt; color: #000000; font-family: Verdana;">&#8220;The </span><code><span style="font-size: 10pt; color: #000000;">article</span></code><span style="font-size: 10pt; color: #000000; font-family: Verdana;"> element represents a section of a page that consists of a composition that forms an independent part of a document, page, or site. This could be a forum post, a magazine or newspaper article, a Web log entry, a user-submitted comment, or any other independent item of content.</span></em></p>
<p class="note" style="margin: auto 0in;"><em><span style="font-size: 10pt; color: #000000; font-family: Verdana;">An </span><code><span style="font-size: 10pt; color: #000000;">article</span></code><span style="font-size: 10pt; color: #000000; font-family: Verdana;"> element is &#8220;independent&#8221; in that its contents could stand alone, for example in syndication. However, the element is still associated with its ancestors; for instance, contact information that applies to a parent </span><code><span style="font-size: 10pt; color: #000000;">body</span></code><span style="font-size: 10pt; color: #000000; font-family: Verdana;"> element still covers the </span><code><span style="font-size: 10pt; color: #000000;">article</span></code><span style="font-size: 10pt; color: #000000; font-family: Verdana;"> as well.&#8221;</span></em></p>
</blockquote>
<p class="example" style="margin: auto 0in;"><span style="font-size: 10pt; color: #000000; font-family: Verdana;"> A structured implementation of the &lt;section&gt; and &lt;article&gt; elements by content publishers will go a long way in making content extraction simpler thereby providing for a small step in making web content semantic analysis easier.</span></p>
<p class="example" style="margin: auto 0in;"><span style="font-size: 10pt; color: #000000; font-family: Verdana;">The HTML 5 specification has been out there for some time, its time for rendering engines to start implementing some of the new semantic oriented elements in the specification (some rendering engines have already started implementing parts of the specification). 2009 sounds like a good year for rendering engines, content publishers and content generation software to come together and help chart the course for web semantic analysis-based applications.</span></p>
<p class="example" style="margin: auto 0in;"><span style="font-size: 10pt; color: #000000; font-family: Verdana;"><strong>NOTE:</strong> HTML 5 contains other descriptive elements that help with the expression of semantics of textual data. I will get to those in future posts. </span></p>
]]></content:encoded>
			<wfw:commentRss>http://tomchikoore.com/2009/01/03/all-i-want-for-the-new-year-is-html-5/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Yahoo! BOSS &#8211; the answer for web semantic analysis-based applications??</title>
		<link>http://tomchikoore.com/2008/09/26/yahoo-boss-the-answer-to-web-semantic-analysis-based-applications/</link>
		<comments>http://tomchikoore.com/2008/09/26/yahoo-boss-the-answer-to-web-semantic-analysis-based-applications/#comments</comments>
		<pubDate>Fri, 26 Sep 2008 08:57:24 +0000</pubDate>
		<dc:creator>tomchikoore</dc:creator>
				<category><![CDATA[Blogroll]]></category>
		<category><![CDATA[REST]]></category>
		<category><![CDATA[RSS]]></category>
		<category><![CDATA[Social Networking]]></category>
		<category><![CDATA[Web 2.0]]></category>

		<guid isPermaLink="false">http://tomchikoore.com/?p=58</guid>
		<description><![CDATA[A couple of weeks ago, I attended the Yahoo Open Hack Day at the Yahoo Campus in Sunnyvale, CA.  At Open Hack Day, Yahoo opened up all their technologies for a few chosen hackers to play with and evaluate for a weekend.  The technology that I was most interested in was BOSS (Build your Own [...]]]></description>
			<content:encoded><![CDATA[<p>A couple of weeks ago, I attended the Yahoo Open Hack Day at the Yahoo Campus in Sunnyvale, CA.  At Open Hack Day, Yahoo opened up all their technologies for a few chosen hackers to play with and evaluate for a weekend.  The technology that I was most interested in was BOSS (Build your Own Search Service). BOSS is &#8220;Yahoo!&#8217;s open search web services platform&#8221;.  Simply put, this means Yahoo has opened up its web index for anyone to use using the BOSS API.  This is unprecedented and opens up a ton of opportunities to advance some of the topics that I have discussed on this blog, primarily <a title="NLP: Unstructured thinking for unstructured data" href="http://tomchikoore.com/?p=22">NLP: Unstructured thinking for unstructured data</a> and <a title="2008 Web Search is still in 1979" href="http://tomchikoore.com/?p=27">2008 Web Search is still in 1979</a>.</p>
<p>As I have said in the past, the goal of the semantic web is still a long ways to be realized. However, rather than wait for every website owner to build semantic web conforming website (or retrofit their past content to be semantic web compliant), we should seek to derive web semantics at the application level using a whole new set of applications, web semantic analysis-based applications. Yahoo&#8217;s BOSS can be one of the missing components that pushes the ball forward towards this goal.</p>
<p>Surprisingly, one of the challenges of deriving web semantics is as simple as programmatically identifying and extracting the content from a web page (I have talked about this in a previous post: <a title="A case for standardizing blog templates" href="http://tomchikoore.com/?p=21">A case for standardizing blog templates</a>).  Before semantic analysis can be performed on a web page, the proper content must be extracted fom the web page first. As humans, when we look at a web page, we can readily distinguish the &#8220;main content&#8221; of a web page from  navigation bar, header, links or ads.  This is not so easy for computer programs to accomplish.  At <a title="Filtrbox - Know what you dont know" href="http://www.filtrbox.com">Filtrbox</a>, we have developed algorithms to accomplish this with a very high success rate only because we have devoted time and resources into the algorithms because they are core to our business.  Other application developers wishing to leverage web content semantics may not have the time and resources to build such algorithms because that is not core to their business. This is where Yahoo BOSS comes into the picture. We know that Yahoo has built its massive Web index by indexing the &#8220;main content&#8221; extracted from web pages.  Yahoo has  invested time and resources to solve the content extraction problem. In addition, they have built a massive infrastructure to index and store web content.  Therefore, instead of re-inventing the wheel, developers of applications that leverage web semantics can take advantage of Yahoo&#8217;s content extraction through the Yahoo BOSS API. However, Yahoo needs to open up a little more for this to be possible.</p>
<p>Here is where Yahoo needs to open up: Although Yahoo currently performs content extraction and content indexing, unfortunately the Yahoo BOSS API is not geared towards applications that analyze web data semantics.  The Yahoo BOSS API in its current form is geared towards web searches.  It is keyword query-based and returns at least TITLE, URL and ABSTRACT/EXCERPT.  Unfortunately, to move towards web semantic analysis-based applications, the ABSTRACT/EXCERPT alone is not enough.  Instead, the Yahoo BOSS API should return the WHOLE &#8220;main content&#8221; (not links,ads and navigation etc) of a web page.  Returning the whole content enables applications to perform semantic analysis on the data from millions of web pages that is stored in Yahoo&#8217;s web index, thereby adding value to the data and moving the ball forward towards unlocking the hidden value in web data using web semantic analysis-based applications.</p>
]]></content:encoded>
			<wfw:commentRss>http://tomchikoore.com/2008/09/26/yahoo-boss-the-answer-to-web-semantic-analysis-based-applications/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Filtrbox vs. RSS readers/aggregators</title>
		<link>http://tomchikoore.com/2008/09/23/filtrbox-vs-rss-readersaggregators/</link>
		<comments>http://tomchikoore.com/2008/09/23/filtrbox-vs-rss-readersaggregators/#comments</comments>
		<pubDate>Tue, 23 Sep 2008 19:28:55 +0000</pubDate>
		<dc:creator>tomchikoore</dc:creator>
				<category><![CDATA[REST]]></category>
		<category><![CDATA[RSS]]></category>
		<category><![CDATA[Startups]]></category>
		<category><![CDATA[Web 2.0]]></category>

		<guid isPermaLink="false">http://tomchikoore.com/?p=39</guid>
		<description><![CDATA[(NOTE: This article was originally published at my old blog) One of the questions that I am often asked is how Filtrbox is different from traditional RSS readers and aggregators.  The following are the major differences:   Closed Search Domain vs. Open Search Domain When using traditional RSS aggregators, the user supplies the list of [...]]]></description>
			<content:encoded><![CDATA[<p><span style="font-family: Arial;"><span style="font-size: small;">(NOTE: This article was originally published at my old blog)</span></span></p>
<p class="MsoNormal" style="margin:0;"><span style="font-family: Arial;"><span style="font-size: small;">One of the questions that I am often asked is how Filtrbox is different from traditional RSS readers and aggregators.<span>  </span>The following are the major differences:</span></span></p>
<p class="MsoNormal" style="margin:0;"> </p>
<p class="MsoNormal" style="margin:0;"><span style="font-family: Arial;"><span style="font-size: small;"><strong>Closed Search Domain vs. Open Search Domain</strong></span></span></p>
<p class="MsoNormal" style="margin:0;"><span style="font-family: Arial;"><span style="font-size: small;">When using traditional RSS aggregators, the user supplies the list of RSS feeds. This means that the domain of information gathered by a traditional RSS reader/aggregator is limited to the RSS feeds that are known to the user.<span>  </span>I call this a closed search domain. However, in an environment such the one we have today where thousands of new content sources are being created on a daily basis and anyone can potentially become a publisher, it is unrealistic to put the burden on the user to keep up with the thousands of new content sources that are sprouting up each day.<span>  </span>Filtrbox takes this burdensome responsibility away from the user and discovers the new content sources for the user because Filtrbox’s search domain covers all the new content sources. I call this an open search domain. The user can also add RSS feeds to the search domain, thereby guaranteeing that their RSS feeds of interest are searched.<span> </span>This approach leads to the user discovering new content sources.</span></span></p>
<p class="MsoNormal" style="margin:0;"> </p>
<p class="MsoNormal" style="margin:0;"><span style="font-family: Arial;"><span style="font-size: small;"><strong>Publisher centric vs. Content centric</strong></span></span></p>
<p class="MsoNormal" style="margin:0;"><span style="font-family: Arial;"><span style="font-size: small;">Traditional RSS readers/aggregators present to the user all the content that is published by a specific publisher regardless of whether the user is interested in the content or not. Thus, the traditional RSS readers/aggregators implement a publisher centric information consumption model. On the other hand, Filtrbox implements a content centric information consumption model.<span>  </span>Rather than deliver to the user all the content published by a specific publisher, whether its relevant or not, Filtrbox allows the user to filter for the content that they are interested in from ANY publisher by providing contextual keywords. The content centric model implemented by Filtrbox greatly reduces information overload because each piece of content is examined and filtered for contextual relevance before it is delivered to the user.</span></span></p>
<p class="MsoNormal" style="margin:0;"> </p>
<p class="MsoNormal" style="margin:0;"><span style="font-family: Arial;"><span style="font-size: small;"><strong>No filtering vs. Contextual relevance filtering</strong></span></span></p>
<p class="MsoNormal" style="margin:0;"><span style="font-family: Arial;"><span style="font-size: small;">As indicated above, traditional RSS aggregators do not filter the content.<span>  </span>All content published by a publisher in the user’s closed search domain is delivered to the user regardless of whether it is relevant or not.<span>  </span>Filtrbox applies algorithms that filter content from an open search domain of publishers for contextual relevance.<span>  </span>Filtrbox uses multiple factors to determine the contextual relevance of content and assigns a score called FiltrRank.<span>  </span>The most important feature of the algorithm is that the contextual relevance algorithm learns from a Filtrbox user’s implicit interests and applies the implicit interest to future contextual relevance filtering. This means that the content delivered to the user is content that that specific user is interested in and not content other people are interested in.<span>  </span>Contextual relevance filtering plays a large part in the reduction of information overload.</span></span></p>
<p class="MsoNormal" style="margin:0;"> </p>
<p class="MsoNormal" style="margin:0;"><span style="font-family: Arial;"><span style="font-size: small;"><strong>Beyond RSS</strong></span></span></p>
<div class="MsoNormal" style="margin:0;"><span style="font-family: Arial;"><span style="font-size: small;">Unlike traditional RSS readers/aggregators, Filtrbox consumes content delivery formats beyond RSS. Filtrbox is capable of consuming both standard and proprietary content delivery formats.</span></span></div>
]]></content:encoded>
			<wfw:commentRss>http://tomchikoore.com/2008/09/23/filtrbox-vs-rss-readersaggregators/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>A case for standardizing blog templates</title>
		<link>http://tomchikoore.com/2008/02/04/a-case-for-standardizing-blog-templates/</link>
		<comments>http://tomchikoore.com/2008/02/04/a-case-for-standardizing-blog-templates/#comments</comments>
		<pubDate>Mon, 04 Feb 2008 21:06:29 +0000</pubDate>
		<dc:creator>tomchikoore</dc:creator>
				<category><![CDATA[Agile Software Development]]></category>
		<category><![CDATA[Enterprise 2.0]]></category>
		<category><![CDATA[REST]]></category>
		<category><![CDATA[RSS]]></category>
		<category><![CDATA[Social Networking]]></category>
		<category><![CDATA[Software Engineering]]></category>
		<category><![CDATA[Startups]]></category>
		<category><![CDATA[Web 2.0]]></category>

		<guid isPermaLink="false">http://tomchikoore.wordpress.com/?p=21</guid>
		<description><![CDATA[Alex Isikold of AdaptiveBlue has published a great post on “How YOU can make the web more structured”.  A section of this post, &#8220;Standardizing Blog Templates Across Platforms&#8221;, really resonates with me.  Isikold is suggesting that blogging platforms such as WordPress and TypePad standardize their templates.  Why is this important?  To help answer this question, [...]]]></description>
			<content:encoded><![CDATA[<p><span style="font-family:Arial;"><a href="http://alexiskold.wordpress.com">Alex Isikold</a> of <a href="http://www.adaptiveblue.com/">AdaptiveBlue</a> has published a great post on <a href="http://tomchikoore.wordpress.com/wp-admin/post-new.php">“How YOU can make the web more structured”</a>.<span>  </span>A section of this post, &#8220;Standardizing Blog Templates Across Platforms&#8221;, really resonates with me.<span>  </span>Isikold is suggesting that blogging platforms such as WordPress and TypePad standardize their templates.<span>  </span>Why is this important?</span><span style="font-family:Arial;"> </span></p>
<p><span style="font-family:Arial;"></span><span style="font-family:Arial;">To help answer this question, here is the Web 2.0 school of thought that I subscribe to:<span>  </span>Let’s start off with an enterprise database analogy. The basic assumption is that blogs are nothing but a data store.<span>  </span>While information in a blog makes for an interesting read, it is about as interesting as reading data in a text column in a relational database.<span>  </span>While the data in a single text column may have a lot of meaning, its meaning and usefulnes is enhanced when the data is combined with other columns in the same table in database, or with other tables in the same database, or even with data in other databases. The wealth of data is hidden in its interconnections with other data. In order to harvest the wealth of data in databases, applications are built on top of the databases that reference and make relational semantic inferences between the data in the database(s).<span>  </span>Today, blogs are the database(s). What is lacking are the applications that harvest the wealth of information stored in the blogs.<span>  </span>These are the applications that the next wave of Web 2.0 companies (including myself) are working on.</span><span style="font-family:Arial;"> </span></p>
<p><span style="font-family:Arial;"></span><span style="font-family:Arial;">The pace of these next generation applications is being hindered by the lack of a consistent structure (standard) in blog data. What Isikold is bringing attention to is that unlike relational databases, which adhere to relational database management system standard (characterized by a simple TABLE/COLUMN/ROW+SQL structure that has been consistent over the years), blogs have no such standard. The structure of blogs is currently left up to the blogging platforms such a WordPress, Typepad etc. Blogging standards today are akin to having Oracle, SQL Server, MySQL each using a different standard for storing and retrieving information. Not only a different a standard for each of the databases, but a different standard for each version of each database.<span>  </span>Exacerbating the problem further, each of the different databases being customizable by anyone and anyone can change the standard to a standard of their liking. If these databases were is such a state, it would be very difficult to write any applications that leverage data from these databases. ODBC and JDBC standards would be very unreliable, if not useless.<span>  </span>Such is the state of the blogosphere today when one looks at it from a data interface perspective.</span><span style="font-family:Arial;"> </span><span style="font-family:Arial;"> </span></p>
<p><span style="font-family:Arial;"></span><span style="font-family:Arial;">As many of you know, I am currently devoted to work on the layer of applications that leverages the data in blogs and beyond in order make such data more useful to users.<span>  </span>The lack of standardization (as described above) makes it difficult to identify the content in blogs.<span>  </span>Content identification is important because an application needs to be able to identify the difference between actual blog post text and some other text on the blog so that analyses and inferences can be established appropriately.<span>  </span>I have been monitoring the different types of templates in an attempt to predict template patterns for the different blogging platforms (mainly WordPress, TypePad, Blogger, MovableType).<span>  </span>I came to the conclusion that pattern prediction is only successful to a certain point due to the following </span></p>
<blockquote><p><span style="font-family:Arial;">1) the original templates from the blogging platform vendor consists of multiple major and minor versions that do not have a predictable consistency in the template content tagging and </span></p></blockquote>
<blockquote><p><span style="font-family:Arial;">2) there are modified/hand coded templates floating out there which are totally unreliable. </span></p></blockquote>
<p><span style="font-family:Arial;">As a result of these observations, I have resorted to writing my own content identification algorithms that include a combination of template pattern predictor algorithms and NLP based semantic blog post text identification algorithms.<span>  </span>While this has served me well up to now, a blog template standard will be very beneficial not only to myself but many people who have not figured out how get past the problem. </span><span style="font-family:Arial;"> </span></p>
<p><span style="font-family:Arial;"></span><span style="font-family:Arial;">Isikold is suggesting that a standard be adopted with the goal of giving blog templates a consistent structure.<span>  </span>This means the adoption of a template standard that identifies the different types of data on the different parts of bogs post. Isikold is suggesting that on a blog post, the template should make it easy to identify the </span><span style="font-family:Arial;">blog post text, the side bar, the name of the author, the data that blog post was published, the tags for the blog post content and the blog posts comments.<span>  </span>I believe an adoption of this simple template will go a long way in helping to bring the next wave of Web 2.0 applications to market faster.  I support a blog template standard.</span></p>
]]></content:encoded>
			<wfw:commentRss>http://tomchikoore.com/2008/02/04/a-case-for-standardizing-blog-templates/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Correct RSS date format</title>
		<link>http://tomchikoore.com/2008/02/04/correct-rss-date-format/</link>
		<comments>http://tomchikoore.com/2008/02/04/correct-rss-date-format/#comments</comments>
		<pubDate>Mon, 04 Feb 2008 19:19:47 +0000</pubDate>
		<dc:creator>tomchikoore</dc:creator>
				<category><![CDATA[Agile Software Development]]></category>
		<category><![CDATA[Enterprise 2.0]]></category>
		<category><![CDATA[Enterprise software]]></category>
		<category><![CDATA[REST]]></category>
		<category><![CDATA[RSS]]></category>
		<category><![CDATA[Software Engineering]]></category>
		<category><![CDATA[Web 2.0]]></category>

		<guid isPermaLink="false">http://tomchikoore.wordpress.com/?p=20</guid>
		<description><![CDATA[If you see a date like “01/02/07” in an RSS feed, what do you do?  You write a blog post about it.  The applications that I am working on are reliant on some calculations using RSS dates.  I have noticed that the RSS date specification is probably the most taken for granted part of the [...]]]></description>
			<content:encoded><![CDATA[<p><span style="font-family:Arial;">If you see a date like “01/02/07” in an RSS feed, what do you do?<span>  </span>You write a blog post about it.</span><span style="font-family:Arial;"> </span></p>
<p><span style="font-family:Arial;"></span><span style="font-family:Arial;">The applications that I am working on are reliant on some calculations using RSS dates.<span>  </span>I have noticed that the RSS date specification is probably the most taken for granted part of the RSS spec.<span>  </span>It is taken for granted because many consumers of RSS program around the date inconsistencies so there is not much of an outcry.<span>  </span>However, when you see a date like 01/02/07, you have to stop and say something.</span><span style="font-family:Arial;"> </span></p>
<p><span style="font-family:Arial;"></span><span style="font-family:Arial;">To those developers generating RSS feeds, please take a look at the RSS date format specifications as per the RSS specification.<span>  </span>I will summarize it here:</span><span style="font-family:Arial;"> </span></p>
<p><span style="font-family:Arial;"></span><span style="font-family:Arial;">The RSS date must conform to the <a href="http://www.faqs.org/rfcs/rfc822.html">RFC-822</a> (refer to the BNF for &#8220;date-time&#8221;  in section 5) date time format.<span>  </span>Examples of this format are:</span><span style="font-family:Arial;"> </span></p>
<blockquote><p><span style="font-family:Arial;"></span><span style="font-family:Arial;">Wed, 04 Feb 2008 08:00:00 EST</span></p></blockquote>
<blockquote><p><span style="font-family:Arial;"></span><span style="font-family:Arial;">Wed, 04 Feb 2008 13:00:00 GMT</span></p></blockquote>
<blockquote><p><span style="font-family:Arial;"></span><span style="font-family:Arial;">Wed, 04 Feb 2008 15:00:00 +0200</span><span style="font-family:Arial;"></span><span style="font-family:Arial;"> </span></p></blockquote>
<p><span style="font-family:Arial;"></span><span style="font-family:Arial;">Do not just execute a stringifying method on your date object before writing it to the RSS feed.<span>  </span>Set the date format to the above mentioned format first before writing it to the RSS feed.</span><span style="font-family:Arial;"> </span></p>
<p style="margin:0;" class="MsoNormal"><span style="font-family:Arial;">To validate whether your date is correct, you can use <a href="http://feedvalidator.org/">http://feedvalidator.org</a></span></p>
]]></content:encoded>
			<wfw:commentRss>http://tomchikoore.com/2008/02/04/correct-rss-date-format/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
	</channel>
</rss>

