<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Life in the startup lane &#187; Web 2.0</title>
	<atom:link href="http://tomchikoore.com/category/web-20/feed/" rel="self" type="application/rss+xml" />
	<link>http://tomchikoore.com</link>
	<description></description>
	<lastBuildDate>Tue, 31 Jan 2012 23:58:37 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>What does your internal collaboration enterprise social graph look like?</title>
		<link>http://tomchikoore.com/2011/09/22/what-does-your-internal-collaboration-enterprise-social-graph-look-like/</link>
		<comments>http://tomchikoore.com/2011/09/22/what-does-your-internal-collaboration-enterprise-social-graph-look-like/#comments</comments>
		<pubDate>Thu, 22 Sep 2011 23:41:55 +0000</pubDate>
		<dc:creator>tomchikoore</dc:creator>
				<category><![CDATA[Enterprise 2.0]]></category>
		<category><![CDATA[Enterprise software]]></category>
		<category><![CDATA[social graph]]></category>
		<category><![CDATA[Social Networking]]></category>
		<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[Web 2.0]]></category>

		<guid isPermaLink="false">http://tomchikoore.com/?p=402</guid>
		<description><![CDATA[Whats does your internal collaboration enterprise social graph look like? Let&#8217;s cut to the chase, this is how it should NOT look like: Fig 1: Internal Collaboration Enterprise Social Graph of the US Senate &#8211; 2009  (Source: Slate.com) The graph you are seeing above is a visual representation of internal collaboration enterprise social graph of [...]]]></description>
			<content:encoded><![CDATA[<p>Whats does your internal collaboration enterprise social graph look like?</p>
<p>Let&#8217;s cut to the chase, this is how it should NOT look like:</p>
<p style="text-align: center;"><a href="http://tomchikoore.com/wp-content/uploads/2011/09/Screen-shot-2011-08-11-at-8.13.48-AM.png"><img class="size-full wp-image-403 aligncenter" title="Internal Collaboration Enterprise Social Graph of US Senate (2009)" src="http://tomchikoore.com/wp-content/uploads/2011/09/Screen-shot-2011-08-11-at-8.13.48-AM.png" alt="Internal Collaboration Enterprise Social Graph of US Senate in 2009 by Slate (Credit: Slate.com)" width="277" height="384" /></a></p>
<p><em><strong>Fig 1: Internal Collaboration Enterprise Social Graph of the US Senate &#8211; 2009  (Source: Slate.com)</strong></em></p>
<p>The graph you are seeing above is a visual representation of internal collaboration enterprise social graph of the US Senate [This force directed graph from Slate is based on votes in 2009 (I will be working on a 2011 graph). Each blue dot represents a Democratic senator, each red dot represents a Republican senator. A line connects two senators when they voted the same way on 65 percent of the votes]. The force directed graph clusters dots with the most connections to each other, pushing away dots with the least connections and as a result we can visually identify the people who collaborate with each the most and those who collaborate the least.</p>
<p>In my engagements with various companies, I have observed that many people draw an enterprise social graph as dots connected by lines, some even go as far as drawing pretty clusters of dots connected by lines as shown in the US Senate graph above. While such a graph is not incorrect, it is not what you want your internal collaboration enterprise social graph to look like if you are to have effective internal collaboration. An effective internal collaboration enterprise social graph looks like this:</p>
<p style="text-align: center;"><a href="http://tomchikoore.com/wp-content/uploads/2011/09/Screen-shot-2011-08-11-at-8.13.17-AM.png"><img class="size-full wp-image-406  aligncenter" title="Internal Collaboration Enterprise Social Graph" src="http://tomchikoore.com/wp-content/uploads/2011/09/Screen-shot-2011-08-11-at-8.13.17-AM.png" alt="Internal Collaboration Enterprise Social Graph" width="301" height="259" /></a></p>
<p><em><strong>Fig 2: Effective Internal Collaboration Enterprise Social Graph <em><strong>(Source: Tom Chikoore -<a title="http://tomchikoore.com" href="http://tomchikoore.com/">http://tomchikoore.com</a>)</strong></em><br />
</strong></em></p>
<p>It&#8217;s not pretty, it&#8217;s messy but it&#8217;s effective. Each dot represents an employee and each line represents two employees who follow each other on an internal social enterprise collaboration community. This means each line represents a &#8220;bi-directional&#8221; follow between two employees. A bidirectional follow represents an open communication channel for effective collaboration. The color of each dot represents department in the organization. The following is a legend for some of the organizational departments shown in the graph:</p>
<p>BLUE &#8211; Sales<br />
RED &#8211; Engineering<br />
SALMON &#8211; Product Marketing<br />
FLUORESCNET GREEEN &#8211; Product Management<br />
ORANGE &#8211; Marketing<br />
BLACK &#8211; Professional Services<br />
ACQUA &#8211; Support<br />
QUALITY ASSURANCE &#8211; Saddle Brown<br />
LIME GREEN &#8211; Vendor</p>
<p>From this graph, you can see that the departments that one would expect to collaborate with each other are collaborating with each other. This is a beautiful visualization: Notice how the Support team is positioned between Engineering and Professional Services; and Professional Service is positioned between Support and Sales. This is the type of relationship you would expect to see is the majority of product development organizations. QA is positioned right next to Engineering and Product Management is nestled between Engineering Product Marketing, and Sales. This is what you would expect in most organizations, this is what your internal collaboration enterprise social graph should look like.</p>
<p>You can think of graph depicted above as a depiction of the &#8220;internal collaboration DNA&#8221; of your organization. Using this graph and the right algorithms, an assessment of the health of your community is possible. The right tools can help you identify collaboration problems and suggest solutions to fix the problems to enable effective internal collaboration.</p>
<p>For more information on the social graph and community health assessment tools send me an email at tom AT tomchikoore.com</p>
]]></content:encoded>
			<wfw:commentRss>http://tomchikoore.com/2011/09/22/what-does-your-internal-collaboration-enterprise-social-graph-look-like/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>INFLUENCE &#8211; Why every company should care.</title>
		<link>http://tomchikoore.com/2009/07/01/influence-why-every-company-should-care/</link>
		<comments>http://tomchikoore.com/2009/07/01/influence-why-every-company-should-care/#comments</comments>
		<pubDate>Wed, 01 Jul 2009 05:04:05 +0000</pubDate>
		<dc:creator>tomchikoore</dc:creator>
				<category><![CDATA[Social Networking]]></category>
		<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[Web 2.0]]></category>
		<category><![CDATA[filtrbox]]></category>
		<category><![CDATA[influence]]></category>
		<category><![CDATA[social media]]></category>
		<category><![CDATA[Twitter]]></category>

		<guid isPermaLink="false">http://tomchikoore.com/?p=115</guid>
		<description><![CDATA[Several weeks ago, at Filtrbox, we shared some of our internal functionality with the public via the Filtrbox Twitter Influence scoring page.  The Filtrbox Twitter Influence scoring page, which has turned out to be a hit among many, allows anyone to check the Twitter influence of any Twitter user. Some of our users have had [...]]]></description>
			<content:encoded><![CDATA[<p>Several weeks ago, at <a title="Filtrbox Inc " href="http://www.filtrbox.com/" target="_blank">Filtrbox</a>, we shared some of our internal functionality with the public via the <a title="Filtrbox Twitter Influence Scoring Page" href="http://www.filtrbox.com/twitter.htm" target="_blank">Filtrbox Twitter Influence scoring page</a>.  The Filtrbox Twitter Influence scoring page, which has turned out to be a hit among many, allows anyone to check the Twitter influence of any Twitter user. Some of our users have had some good fun with it for the purposes of ego boosting or ego busting. While we appreciate the versatility of purpose of our technology, the purpose of the Filtrbox Twitter Influence scoring page goes beyond a bragging rights tool.  The Filtrbox Twitter Influence scoring page provides a means to gauge the &#8220;reach&#8221; of mentions on  Twitter by measuring the influence of the &#8220;mentioner&#8221;  (Twitter is only one of the many conversation venues whose participants&#8217; influence Filtrbox tracks). In this blog post, I would like to impress upon the reader that, going forward, the measurement of “influence&#8221; in social media conversation venues, such as Twitter,  should be integrated as part of all “message reach analysis” activity that a company performs.</p>
<p> </p>
<p>Given the fact that conversation venues, such as Twitter, democratize the notion of &#8220;reach&#8221;  by providing a venue where anyone can mention anything (including your brand) to an organic audience (original target audience+viral audience), it is imperative that  brand protecting companies,</p>
<p> </p>
<p>1) Track mentions of the company’s brand (s)</p>
<p>2) Analyze the influence (“reach analysis”) of the people who mention a company&#8217;s brand(s)</p>
<p> </p>
<p>As social media networks become entrenched conversation venues where participants discuss anything under the sun including company brands, &#8220;reach analysis&#8221; needs to be expanded beyond messages that originate from a company&#8217;s marketing department.  This is the first step in acknowledging that there are other messages that are emanating from places other than your marketing department.  Those messages you cannot control. However, you can manage the conversations that the messages produce. In order to manage messages that result in conversations about your brand, regardless of their origin, brand mentions need to be effectively monitored and the message reach effectively analyzed.</p>
<p> </p>
<p>Consider the following example: Every brand protecting company’s nightmare is seeing the following brand mention (message) on Twitter (or any conversation venue e.g. Facebook, Blog comment, Online newspaper comment)</p>
<p> </p>
<p><em><strong>&#8220;(put your brand here) sucks!!&#8221;</strong></em></p>
<p> </p>
<p>The next time people Google your brand; you do not want this to be the first brand mention they see. It well can be, if you do not properly manage the conversation that emanates from this mention. Therefore, before you react to the mention, it is important that you perform a &#8220;reach analysis&#8221; of the mention (measure the “influence” of the “mentioner”) in order to understand the authority of the person who made the brand mention, the nature of the venue in which it was made and the number of people who potentially saw the mention.  Performing such a reach analysis gives you the ability to assess an appropriate entry into the conversation and gives you a basis for formulating an approach on how to manage the conversation going forward. Products like Filtrbox simplify the “reach analysis” determination through Twitter Influence scoring and FiltrRank scoring.</p>
<p> </p>
<p>In closing, it is important that ALL companies pay attention to “influence” in social media conversation venues. Think of “influence” as good old “reach analysis”, except the message whose reach needs to be analyzed is not coming solely from your marketing department &#8211; its coming from anyone, its coming from everywhere and, in a real-time information environment, its coming fast.</p>
]]></content:encoded>
			<wfw:commentRss>http://tomchikoore.com/2009/07/01/influence-why-every-company-should-care/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The death of the Rocky Mountain News, the rise of Filtrbox</title>
		<link>http://tomchikoore.com/2009/03/24/the-death-of-the-rocky-mountain-news-the-rise-of-filtrbox/</link>
		<comments>http://tomchikoore.com/2009/03/24/the-death-of-the-rocky-mountain-news-the-rise-of-filtrbox/#comments</comments>
		<pubDate>Tue, 24 Mar 2009 21:38:10 +0000</pubDate>
		<dc:creator>tomchikoore</dc:creator>
				<category><![CDATA[content]]></category>
		<category><![CDATA[media]]></category>
		<category><![CDATA[Web 2.0]]></category>

		<guid isPermaLink="false">http://tomchikoore.com/?p=92</guid>
		<description><![CDATA[Several weeks ago, while purchasing a commemorative copy of the Rocky Mountain News, I came to the realization that two distinct stories, symbolic of the shift in media landscape, were playing themselves out on both ends of US-36. In Denver, The Rocky Mountain News, a symbol of traditional mainstream media, was closing down after almost [...]]]></description>
			<content:encoded><![CDATA[<p>Several weeks ago, while purchasing a commemorative copy of the Rocky Mountain News, I came to the realization that two distinct stories, symbolic of the shift in media landscape, were playing themselves out on both ends of US-36. In Denver, The Rocky Mountain News, a symbol of traditional mainstream media, was closing down after almost 150 years of publishing. In Boulder, at Filtrbox, a young new media company, we were celebrating the release of the latest version of our service, Filtrbox G2. While the people at the Rocky Mountain News were probably not aware of Filtrbox, I had a keen eye on the daily goings on at The Rocky and I looked at the whole situation at the Rocky as a symbolic passing of the media torch.</p>
<p> </p>
<p>As a long time resident of the state of Colorado, its was tough buying the last copy of the Rocky. As the CTO of Filtrbox, I lamented the loss of yet another mainstream content source. Contrary to what many may expect, in my opinion, the loss of content source like the Rocky is no cause for celebration at Filtrbox. The reason is that the death of a medium, such the newspaper, is a natural cycle; media have come and gone over the years. However, one thing that has remained constant is the content.  There is no substitute for good content. Whether Mike Littwin&#8217;s dispatches from the political stump or Dave Krieger&#8217;s Broncos inside scoop or Penny Parker&#8217;s celebrity sightings around town are delivered via pony express, the telegraph, the tabloid, the broadsheet, the web or a Filtrbox Daily Briefing, its all all great compelling content that I want to read on a regular basis. Thus, the death of the Rocky was by no means of verdict on content, it is a verdict on the medium in which the content is delivered. </p>
<p> </p>
<p>Filtrbox is providing new ways for discovering and delivering content using new media. Instead of a newspaper being delivered to your porch every morning, Filtrbox delivers a daily briefing to your inbox every morning.  In addition, Filtrbox provides various other means of consuming the content. But at the end of the day Filtrbox has to deliver content, quality content. The death of the Rocky results in one less source of content for Filtrbox users.  Content diversity is paramount if our users  are to be be well informed. Many have said, mainstream content will be replaced by blogs. However, that assumption is not reflected in the information consumption patterns that we see on a daily basis. At Filtrbox we interface with a variety of consumers of content and observe that information consumers like diversity.  Just as much as they want the thought stream in the blogosphere, they also want to know what is being said in mainstream media and on micro blogs and other sources. People simply want good content that keeps them well informed.</p>
<p> </p>
<p>So, to the journalists who were at the the old media companies like the Rocky Mountain News and the Seattle Post Intelligencer, I say, there is still demand for your content; newspaper as a medium to deliver your content may be dying but other means to deliver your content are on the rise. Keep writing great content, the content industry is not dead.</p>
]]></content:encoded>
			<wfw:commentRss>http://tomchikoore.com/2009/03/24/the-death-of-the-rocky-mountain-news-the-rise-of-filtrbox/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>All I want for the new year is HTML 5</title>
		<link>http://tomchikoore.com/2009/01/03/all-i-want-for-the-new-year-is-html-5/</link>
		<comments>http://tomchikoore.com/2009/01/03/all-i-want-for-the-new-year-is-html-5/#comments</comments>
		<pubDate>Sat, 03 Jan 2009 04:27:57 +0000</pubDate>
		<dc:creator>tomchikoore</dc:creator>
				<category><![CDATA[REST]]></category>
		<category><![CDATA[RSS]]></category>
		<category><![CDATA[Web 2.0]]></category>

		<guid isPermaLink="false">http://tomchikoore.com/?p=74</guid>
		<description><![CDATA[While there have been many wishes and predictions for 2009, mine is simple, it’s HTML 5.  The adoption of HTML 5 specs by browsers, rendering engines and content publishers in this coming year will make 2009 a good year for me. As I have written in the past (here and here), content extraction is an [...]]]></description>
			<content:encoded><![CDATA[<p><span style="font-size: 10pt; color: #000000; font-family: Verdana;">While there have been many wishes and predictions for 2009, mine is simple, it’s <a title="HTML 5" href="http://dev.w3.org/html5/spec/Overview.html" target="_blank">HTML 5</a>.  The adoption of HTML 5 specs by browsers, rendering engines and content publishers in this coming year will make 2009 a good year for me. As I have written in the past (<a title="A case for standardizing blog templates" href="http://tomchikoore.com/?p=21" target="_blank">here </a>and <a title="Yahoo! BOSS - the answer for web semantic analysis-based applications??" href="http://tomchikoore.com/?p=58" target="_blank">here</a>), content extraction is an often overlooked challenge that gets in the way of deriving web content semantics. This is an issue that often gets overlooked but for those of us who are passionate about extracting web content semantics, we understand how much it gets in the way of making much of the good work being done now even better. As we have seen recently, this is not an issue that is challenging only the small players, some of the major applications that rely on content extraction such a Google Alerts are seeing a degradation in content quality as they provide articles that have keyword hits in the navigations bars, ads and other non-content related text on web pages.</span></p>
<p><span style="font-size: 10pt; color: #000000; font-family: Verdana;">HTML 5 had taken steps in specifying how web content (e.g. news story, blog entry) should be represented in a page. The specification has attempted to structure a web page by separating different parts of a web page such as headers, footers, navigation, content etc.  The elements of HTML 5 that will help with content extraction are &lt;section&gt; and &lt;article&gt;.</span></p>
<p><span style="font-size: 10pt; color: #000000; font-family: Verdana;">The &lt;section&gt; element is described in the HTML 5 specification as follows,</span></p>
<blockquote><p><em><span style="font-size: 10pt; color: #000000; font-family: Verdana;">&#8220;The </span><code><span style="font-size: 10pt; color: #000000;">section</span></code><span style="font-size: 10pt; color: #000000; font-family: Verdana;"> element represents a generic document or application section. A section, in this context, is a thematic grouping of content, typically with a header, possibly with a footer.</span></em></p>
<p class="example" style="margin: auto 0in;"><span style="font-size: 10pt; color: #000000; font-family: Verdana;"><em>Examples of sections would be chapters, the various tabbed pages in a tabbed dialog box, or the numbered sections of a thesis. A Web site&#8217;s home page could be split into sections for an introduction, news items, contact information.&#8221;</em></span></p>
</blockquote>
<p class="example" style="margin: auto 0in;"><span style="font-size: 10pt; color: #000000; font-family: Verdana;">Having an HTML element that groups content is very welcome.  The &lt;section&gt; element can be used to contain content such as a news article.  HTML 5 has gone one step further to make this possible by introducing the &lt;article&gt; element which the specification described as follows,</span></p>
<blockquote>
<p class="example" style="margin: auto 0in;"><em><span style="font-size: 10pt; color: #000000; font-family: Verdana;">&#8220;The </span><code><span style="font-size: 10pt; color: #000000;">article</span></code><span style="font-size: 10pt; color: #000000; font-family: Verdana;"> element represents a section of a page that consists of a composition that forms an independent part of a document, page, or site. This could be a forum post, a magazine or newspaper article, a Web log entry, a user-submitted comment, or any other independent item of content.</span></em></p>
<p class="note" style="margin: auto 0in;"><em><span style="font-size: 10pt; color: #000000; font-family: Verdana;">An </span><code><span style="font-size: 10pt; color: #000000;">article</span></code><span style="font-size: 10pt; color: #000000; font-family: Verdana;"> element is &#8220;independent&#8221; in that its contents could stand alone, for example in syndication. However, the element is still associated with its ancestors; for instance, contact information that applies to a parent </span><code><span style="font-size: 10pt; color: #000000;">body</span></code><span style="font-size: 10pt; color: #000000; font-family: Verdana;"> element still covers the </span><code><span style="font-size: 10pt; color: #000000;">article</span></code><span style="font-size: 10pt; color: #000000; font-family: Verdana;"> as well.&#8221;</span></em></p>
</blockquote>
<p class="example" style="margin: auto 0in;"><span style="font-size: 10pt; color: #000000; font-family: Verdana;"> A structured implementation of the &lt;section&gt; and &lt;article&gt; elements by content publishers will go a long way in making content extraction simpler thereby providing for a small step in making web content semantic analysis easier.</span></p>
<p class="example" style="margin: auto 0in;"><span style="font-size: 10pt; color: #000000; font-family: Verdana;">The HTML 5 specification has been out there for some time, its time for rendering engines to start implementing some of the new semantic oriented elements in the specification (some rendering engines have already started implementing parts of the specification). 2009 sounds like a good year for rendering engines, content publishers and content generation software to come together and help chart the course for web semantic analysis-based applications.</span></p>
<p class="example" style="margin: auto 0in;"><span style="font-size: 10pt; color: #000000; font-family: Verdana;"><strong>NOTE:</strong> HTML 5 contains other descriptive elements that help with the expression of semantics of textual data. I will get to those in future posts. </span></p>
]]></content:encoded>
			<wfw:commentRss>http://tomchikoore.com/2009/01/03/all-i-want-for-the-new-year-is-html-5/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Yahoo! BOSS &#8211; the answer for web semantic analysis-based applications??</title>
		<link>http://tomchikoore.com/2008/09/26/yahoo-boss-the-answer-to-web-semantic-analysis-based-applications/</link>
		<comments>http://tomchikoore.com/2008/09/26/yahoo-boss-the-answer-to-web-semantic-analysis-based-applications/#comments</comments>
		<pubDate>Fri, 26 Sep 2008 08:57:24 +0000</pubDate>
		<dc:creator>tomchikoore</dc:creator>
				<category><![CDATA[Blogroll]]></category>
		<category><![CDATA[REST]]></category>
		<category><![CDATA[RSS]]></category>
		<category><![CDATA[Social Networking]]></category>
		<category><![CDATA[Web 2.0]]></category>

		<guid isPermaLink="false">http://tomchikoore.com/?p=58</guid>
		<description><![CDATA[A couple of weeks ago, I attended the Yahoo Open Hack Day at the Yahoo Campus in Sunnyvale, CA.  At Open Hack Day, Yahoo opened up all their technologies for a few chosen hackers to play with and evaluate for a weekend.  The technology that I was most interested in was BOSS (Build your Own [...]]]></description>
			<content:encoded><![CDATA[<p>A couple of weeks ago, I attended the Yahoo Open Hack Day at the Yahoo Campus in Sunnyvale, CA.  At Open Hack Day, Yahoo opened up all their technologies for a few chosen hackers to play with and evaluate for a weekend.  The technology that I was most interested in was BOSS (Build your Own Search Service). BOSS is &#8220;Yahoo!&#8217;s open search web services platform&#8221;.  Simply put, this means Yahoo has opened up its web index for anyone to use using the BOSS API.  This is unprecedented and opens up a ton of opportunities to advance some of the topics that I have discussed on this blog, primarily <a title="NLP: Unstructured thinking for unstructured data" href="http://tomchikoore.com/?p=22">NLP: Unstructured thinking for unstructured data</a> and <a title="2008 Web Search is still in 1979" href="http://tomchikoore.com/?p=27">2008 Web Search is still in 1979</a>.</p>
<p>As I have said in the past, the goal of the semantic web is still a long ways to be realized. However, rather than wait for every website owner to build semantic web conforming website (or retrofit their past content to be semantic web compliant), we should seek to derive web semantics at the application level using a whole new set of applications, web semantic analysis-based applications. Yahoo&#8217;s BOSS can be one of the missing components that pushes the ball forward towards this goal.</p>
<p>Surprisingly, one of the challenges of deriving web semantics is as simple as programmatically identifying and extracting the content from a web page (I have talked about this in a previous post: <a title="A case for standardizing blog templates" href="http://tomchikoore.com/?p=21">A case for standardizing blog templates</a>).  Before semantic analysis can be performed on a web page, the proper content must be extracted fom the web page first. As humans, when we look at a web page, we can readily distinguish the &#8220;main content&#8221; of a web page from  navigation bar, header, links or ads.  This is not so easy for computer programs to accomplish.  At <a title="Filtrbox - Know what you dont know" href="http://www.filtrbox.com">Filtrbox</a>, we have developed algorithms to accomplish this with a very high success rate only because we have devoted time and resources into the algorithms because they are core to our business.  Other application developers wishing to leverage web content semantics may not have the time and resources to build such algorithms because that is not core to their business. This is where Yahoo BOSS comes into the picture. We know that Yahoo has built its massive Web index by indexing the &#8220;main content&#8221; extracted from web pages.  Yahoo has  invested time and resources to solve the content extraction problem. In addition, they have built a massive infrastructure to index and store web content.  Therefore, instead of re-inventing the wheel, developers of applications that leverage web semantics can take advantage of Yahoo&#8217;s content extraction through the Yahoo BOSS API. However, Yahoo needs to open up a little more for this to be possible.</p>
<p>Here is where Yahoo needs to open up: Although Yahoo currently performs content extraction and content indexing, unfortunately the Yahoo BOSS API is not geared towards applications that analyze web data semantics.  The Yahoo BOSS API in its current form is geared towards web searches.  It is keyword query-based and returns at least TITLE, URL and ABSTRACT/EXCERPT.  Unfortunately, to move towards web semantic analysis-based applications, the ABSTRACT/EXCERPT alone is not enough.  Instead, the Yahoo BOSS API should return the WHOLE &#8220;main content&#8221; (not links,ads and navigation etc) of a web page.  Returning the whole content enables applications to perform semantic analysis on the data from millions of web pages that is stored in Yahoo&#8217;s web index, thereby adding value to the data and moving the ball forward towards unlocking the hidden value in web data using web semantic analysis-based applications.</p>
]]></content:encoded>
			<wfw:commentRss>http://tomchikoore.com/2008/09/26/yahoo-boss-the-answer-to-web-semantic-analysis-based-applications/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Filtrbox vs. RSS readers/aggregators</title>
		<link>http://tomchikoore.com/2008/09/23/filtrbox-vs-rss-readersaggregators/</link>
		<comments>http://tomchikoore.com/2008/09/23/filtrbox-vs-rss-readersaggregators/#comments</comments>
		<pubDate>Tue, 23 Sep 2008 19:28:55 +0000</pubDate>
		<dc:creator>tomchikoore</dc:creator>
				<category><![CDATA[REST]]></category>
		<category><![CDATA[RSS]]></category>
		<category><![CDATA[Startups]]></category>
		<category><![CDATA[Web 2.0]]></category>

		<guid isPermaLink="false">http://tomchikoore.com/?p=39</guid>
		<description><![CDATA[(NOTE: This article was originally published at my old blog) One of the questions that I am often asked is how Filtrbox is different from traditional RSS readers and aggregators.  The following are the major differences:   Closed Search Domain vs. Open Search Domain When using traditional RSS aggregators, the user supplies the list of [...]]]></description>
			<content:encoded><![CDATA[<p><span style="font-family: Arial;"><span style="font-size: small;">(NOTE: This article was originally published at my old blog)</span></span></p>
<p class="MsoNormal" style="margin:0;"><span style="font-family: Arial;"><span style="font-size: small;">One of the questions that I am often asked is how Filtrbox is different from traditional RSS readers and aggregators.<span>  </span>The following are the major differences:</span></span></p>
<p class="MsoNormal" style="margin:0;"> </p>
<p class="MsoNormal" style="margin:0;"><span style="font-family: Arial;"><span style="font-size: small;"><strong>Closed Search Domain vs. Open Search Domain</strong></span></span></p>
<p class="MsoNormal" style="margin:0;"><span style="font-family: Arial;"><span style="font-size: small;">When using traditional RSS aggregators, the user supplies the list of RSS feeds. This means that the domain of information gathered by a traditional RSS reader/aggregator is limited to the RSS feeds that are known to the user.<span>  </span>I call this a closed search domain. However, in an environment such the one we have today where thousands of new content sources are being created on a daily basis and anyone can potentially become a publisher, it is unrealistic to put the burden on the user to keep up with the thousands of new content sources that are sprouting up each day.<span>  </span>Filtrbox takes this burdensome responsibility away from the user and discovers the new content sources for the user because Filtrbox’s search domain covers all the new content sources. I call this an open search domain. The user can also add RSS feeds to the search domain, thereby guaranteeing that their RSS feeds of interest are searched.<span> </span>This approach leads to the user discovering new content sources.</span></span></p>
<p class="MsoNormal" style="margin:0;"> </p>
<p class="MsoNormal" style="margin:0;"><span style="font-family: Arial;"><span style="font-size: small;"><strong>Publisher centric vs. Content centric</strong></span></span></p>
<p class="MsoNormal" style="margin:0;"><span style="font-family: Arial;"><span style="font-size: small;">Traditional RSS readers/aggregators present to the user all the content that is published by a specific publisher regardless of whether the user is interested in the content or not. Thus, the traditional RSS readers/aggregators implement a publisher centric information consumption model. On the other hand, Filtrbox implements a content centric information consumption model.<span>  </span>Rather than deliver to the user all the content published by a specific publisher, whether its relevant or not, Filtrbox allows the user to filter for the content that they are interested in from ANY publisher by providing contextual keywords. The content centric model implemented by Filtrbox greatly reduces information overload because each piece of content is examined and filtered for contextual relevance before it is delivered to the user.</span></span></p>
<p class="MsoNormal" style="margin:0;"> </p>
<p class="MsoNormal" style="margin:0;"><span style="font-family: Arial;"><span style="font-size: small;"><strong>No filtering vs. Contextual relevance filtering</strong></span></span></p>
<p class="MsoNormal" style="margin:0;"><span style="font-family: Arial;"><span style="font-size: small;">As indicated above, traditional RSS aggregators do not filter the content.<span>  </span>All content published by a publisher in the user’s closed search domain is delivered to the user regardless of whether it is relevant or not.<span>  </span>Filtrbox applies algorithms that filter content from an open search domain of publishers for contextual relevance.<span>  </span>Filtrbox uses multiple factors to determine the contextual relevance of content and assigns a score called FiltrRank.<span>  </span>The most important feature of the algorithm is that the contextual relevance algorithm learns from a Filtrbox user’s implicit interests and applies the implicit interest to future contextual relevance filtering. This means that the content delivered to the user is content that that specific user is interested in and not content other people are interested in.<span>  </span>Contextual relevance filtering plays a large part in the reduction of information overload.</span></span></p>
<p class="MsoNormal" style="margin:0;"> </p>
<p class="MsoNormal" style="margin:0;"><span style="font-family: Arial;"><span style="font-size: small;"><strong>Beyond RSS</strong></span></span></p>
<div class="MsoNormal" style="margin:0;"><span style="font-family: Arial;"><span style="font-size: small;">Unlike traditional RSS readers/aggregators, Filtrbox consumes content delivery formats beyond RSS. Filtrbox is capable of consuming both standard and proprietary content delivery formats.</span></span></div>
]]></content:encoded>
			<wfw:commentRss>http://tomchikoore.com/2008/09/23/filtrbox-vs-rss-readersaggregators/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Filtrbox is hiring</title>
		<link>http://tomchikoore.com/2008/07/30/filtrbox-is-hiring-3/</link>
		<comments>http://tomchikoore.com/2008/07/30/filtrbox-is-hiring-3/#comments</comments>
		<pubDate>Wed, 30 Jul 2008 20:20:46 +0000</pubDate>
		<dc:creator>tomchikoore</dc:creator>
				<category><![CDATA[jobs]]></category>
		<category><![CDATA[Startups]]></category>
		<category><![CDATA[Web 2.0]]></category>
		<category><![CDATA[filtrbox]]></category>

		<guid isPermaLink="false">http://tomchikoore.com/?p=31</guid>
		<description><![CDATA[Java + LAMP Developer Join a dynamic, growing software company in Boulder, Colorado Basic requirements are: * Solid experience with Java and LAMP * System administration skills (a plus) * Working knowledge of Information Retrieval and/or NLP (a plus) * Must be energetic, motivated and creative Please send your resume to TOM AT FILTRBOX DOT [...]]]></description>
			<content:encoded><![CDATA[<p><strong>Java + LAMP Developer</strong></p>
<p>Join a dynamic, growing software company in Boulder, Colorado<br />
Basic requirements are:</p>
<p>* Solid experience with Java and LAMP<br />
* System administration skills (a plus)<br />
* Working knowledge of Information Retrieval and/or NLP (a plus)<br />
* Must be energetic, motivated and creative</p>
<p>Please send your resume to TOM AT FILTRBOX DOT COM</p>
<p><em><strong>NOTE: Prima Donna, high maintenance Rockstar developers, please do NOT bother sending your resumes!!!!</strong></em></p>
]]></content:encoded>
			<wfw:commentRss>http://tomchikoore.com/2008/07/30/filtrbox-is-hiring-3/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Boulder city services Radiohead-style</title>
		<link>http://tomchikoore.com/2008/06/10/boudler-city-services-radiohead-style/</link>
		<comments>http://tomchikoore.com/2008/06/10/boudler-city-services-radiohead-style/#comments</comments>
		<pubDate>Tue, 10 Jun 2008 19:55:31 +0000</pubDate>
		<dc:creator>tomchikoore</dc:creator>
				<category><![CDATA[Web 2.0]]></category>

		<guid isPermaLink="false">http://tomchikoore.wordpress.com/?p=32</guid>
		<description><![CDATA[This is a pic of the new &#8220;Hop 2 Chautauqua&#8221; route map that I took this morning at the bus stop on Pearl and 23rd.]]></description>
			<content:encoded><![CDATA[<p>This is a pic of the new &#8220;Hop 2 Chautauqua&#8221; route map that I took this morning at the bus stop on Pearl and 23rd.</p>
<p><a href="http://tomchikoore.files.wordpress.com/2008/06/boulderhop.jpg"><img class="alignnone size-full wp-image-33" src="http://tomchikoore.files.wordpress.com/2008/06/boulderhop.jpg" alt="Radiohead-style bus fare" width="455" height="364" /></a></p>
]]></content:encoded>
			<wfw:commentRss>http://tomchikoore.com/2008/06/10/boudler-city-services-radiohead-style/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>2008 Web Search is still in 1979</title>
		<link>http://tomchikoore.com/2008/04/28/2008-web-search-is-still-in-1979/</link>
		<comments>http://tomchikoore.com/2008/04/28/2008-web-search-is-still-in-1979/#comments</comments>
		<pubDate>Mon, 28 Apr 2008 00:51:49 +0000</pubDate>
		<dc:creator>tomchikoore</dc:creator>
				<category><![CDATA[Enterprise software]]></category>
		<category><![CDATA[RSS]]></category>
		<category><![CDATA[Software Engineering]]></category>
		<category><![CDATA[Startups]]></category>
		<category><![CDATA[Web 2.0]]></category>

		<guid isPermaLink="false">http://tomchikoore.wordpress.com/?p=31</guid>
		<description><![CDATA[  On Thursday (04/24/2008 ) last week, I had the privilege of talking to Dr. Jim Martin’s Natural Language Processing (NLP) graduate class, at the University of Colorado at Boulder, about the work that we are  doing at Filtrbox and the role that current NLP students will play in the future of information technology.  This [...]]]></description>
			<content:encoded><![CDATA[<p class="MsoNormal" style="margin:0;"> </p>
<p class="MsoNormal" style="margin:0;"><span style="font-family: Arial;"><span style="font-size: small;">On Thursday (04/24/2008 ) last week, I had the privilege of talking to Dr. Jim Martin’s Natural Language Processing (NLP) graduate class, at the University of Colorado at Boulder, about the work that we are<span>  </span>doing at Filtrbox and the role that current NLP students will play in the future of information technology.<span>  </span>This blog post is the basis of my message to the class.</span></span></p>
<p class="MsoNormal" style="margin:0;"> </p>
<p class="MsoNormal" style="margin:0;"><span style="font-family: Arial;"><span style="font-size: small;">As I have written <a title="Unstructured thoughts for unstructured data" href="http://tomchikoore.wordpress.com/2008/02/29/nlp-unstructured-thoughts-for-unstructured-data/" target="_blank">before</a>, the problem that we face today is how to harness the data that is available on the web so that we can apply meaningful interpretation to it using applications.<span>  </span>This problem is rooted in the assumption that the data that is stored on the web is “unstructured”.<span>  </span>Unlike the majority of the data processed by applications today which is stored in some form of a structure e.g. a relational database, the data on the web is not so, as its is perceived as discrete pieces of data scattered all over the web.</span></span></p>
<p class="MsoNormal" style="margin:0;"> </p>
<p class="MsoNormal" style="margin:0;"><span style="font-family: Arial;"><span style="font-size: small;">I told the class that part of what I am doing at Filtrbox is an attempt to prove that the data on the web is not as “unstructured” as we may think today.<span>  </span>Within that data, there is a lot of structure, relationship and general interconnectedness no matter how “discrete” we may think it is.<span>  </span>With effective mining of the data and good applications, we can apply interpretation to the data and produce meaningful information.<span>  </span>However, we are still far from applications that can apply effective interpretive meaning on this data.<span>  </span>The reason for this is that we have to address the problem of information retrieval (IR) first before we can get to the writing of applications.<span>  </span></span></span></p>
<p class="MsoNormal" style="margin:0;"><span style="font-family: Arial;"><span style="font-size: small;"></span></span> </p>
<p class="MsoNormal" style="margin:0;"><span style="font-family: Arial;"><span style="font-size: small;">To recognize where we are today on the continuum of web data information retreival and applications; a look at the evolution of enterprise applications gives us a great analogy:</span></span></p>
<p class="MsoNormal" style="margin:0;"><span style="font-size: small;"><span style="font-family: Arial;">Enterprise</span><span style="font-family: Arial;"> applications are where they are today primarily because they have a structured data storage model (Relational Database or RDB) and a standard access model (Structured Query Language or SQL).<span>  </span>Before there were enterprise applications that we know today, there were only RDBs and SQL. <span> </span>While RDB work dates back to the 1960s, the RDBs that the majority is familiar with today had their beginnings in the 1970s. <span> </span>The first (or widely believed to be) commercially available implementation of RDB+SQL was Oracle, then known as Relational Software, in 1979. This provided the ability to query an RDB for data using SQL but no applications as we know them today.<span>  </span>Analogizing this with the web, this is where we are today. We can go on Google or our favorite RSS readers (RDB analogy) and query for web data using a weak REST API or search form (SQL analogy) but we have no applications comparative to what is in enterprise today to interpret that data.<span>  </span>So simply put, today we are where enterprise applications were in 1979.</span></span></p>
<p class="MsoNormal" style="margin:0;"> </p>
<p class="MsoNormal" style="margin:0;"><span style="font-family: Arial;"><span style="font-size: small;">My message to the class was that applications like Filtrbox are starting to barely scratch the surface with respect to the implementing of applications on top of web data.<span>  </span>That is because, although its 2008, we are still in 1979.<span>  </span>The stumbling block is the perception of the “unstructured” nature of web data. Today’s NLP students will play a large role tomorrow in identifying and establishing structure in the “unstructured” web data in order to move us beyond 1979.</span></span></p>
]]></content:encoded>
			<wfw:commentRss>http://tomchikoore.com/2008/04/28/2008-web-search-is-still-in-1979/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>TechStars notes in the raw #2</title>
		<link>http://tomchikoore.com/2008/03/05/techstars-notes-in-the-raw-2/</link>
		<comments>http://tomchikoore.com/2008/03/05/techstars-notes-in-the-raw-2/#comments</comments>
		<pubDate>Wed, 05 Mar 2008 08:52:21 +0000</pubDate>
		<dc:creator>tomchikoore</dc:creator>
				<category><![CDATA[Startups]]></category>
		<category><![CDATA[Techstars]]></category>
		<category><![CDATA[Web 2.0]]></category>

		<guid isPermaLink="false">http://tomchikoore.wordpress.com/?p=24</guid>
		<description><![CDATA[(I took copious notes during TechStars 2007. I am opening up my notebook and sharing them with aspiring entrepreneurs.   I am going to serialize my notes on this blog.  These are my RAW notes, so sometimes people spoke too fast or were inaudible but I tried to get the gist of what they were saying. [...]]]></description>
			<content:encoded><![CDATA[<p style="margin:0;" class="MsoNormal"><span style="font-family:Arial;"><em>(I took copious notes during <a target="_blank" href="http://www.techstars.org/" title="TechStars"><font color="#6c8c37">TechStars </font></a>2007. I am opening up my notebook and sharing them with aspiring entrepreneurs.<span>   </span>I am going to serialize my notes on this blog.<span>  </span>These are my RAW notes, so sometimes people spoke too fast or were inaudible<span> </span>but I tried to get the gist of what they were saying. There is very little editing to these notes.)</em></span></p>
<p style="margin:0;" class="MsoNormal"><span style="font-family:Arial;"><em></em></span></p>
<p style="margin:0;" class="MsoNormal"><span style="font-family:Arial;">The following questions were addressed during one of the early <a target="_blank" href="http://www.techstars.org/" title="TechStars"><font color="#6c8c37">TechStars </font></a>panels:</span></p>
<p style="margin:0;" class="MsoNormal"><span style="font-family:Arial;"></span></p>
<p><span style="font-family:Arial;"><span style="font-family:Arial;"><strong>1) What kills most startups?</strong></span></span><span style="font-family:Arial;"></span><span style="font-family:Arial;"></span><span style="font-family:Arial;"></span><span style="font-family:Arial;"></span><span style="font-family:Arial;"></span><span style="font-family:Arial;"></span><span style="font-family:Arial;"></span><span style="font-family:Arial;"></span><span style="font-family:Arial;"></span><span style="font-family:Arial;"></p>
<ul>
<li>
<div style="margin:0;" class="MsoNormal"><span style="font-family:Arial;">Surprise!! Surprise!! Not making money is not usually the big issue that causes failures unless you don’t have a vision</span></div>
</li>
<li>
<div style="margin:0;" class="MsoNormal"><span style="font-family:Arial;">Team dynamic issues – startup failures are mostly caused by founding team friction</span></div>
</li>
<li>
<div style="margin:0;" class="MsoNormal"><span style="font-family:Arial;">Companies fail due to execution failures. Execution failures are still team issues that can be categorized as follows:</span> </div>
</li>
</ul>
<ol>
<li>
<div style="margin:0;" class="MsoNormal"><span style="font-family:Arial;"><span> </span>Team dysfunction issues</span></div>
</li>
<li>
<div style="margin:0;" class="MsoNormal"><span style="font-family:Arial;"><span> </span>Team poor performance issues</span></div>
</li>
</ol>
<blockquote><p><span style="font-family:Arial;">1 and 2. are a “chicken and egg” situation</span></p></blockquote>
<ul>
<li>
<div style="margin:0;" class="MsoNormal"><span style="font-family:Arial;">Do not be afraid to address team issues head on, solve them and remove the problem</span></div>
</li>
<li>
<div style="margin:0;" class="MsoNormal"><span style="font-family:Arial;">Once you have a team issue problem that threatens your startup, re-adjust what you are doing or join another team <em>(None of the original TechStars team members changed teams)</em></span></div>
</li>
<li>
<div style="margin:0;" class="MsoNormal"><span style="font-family:Arial;">Beware of meandering, where after several weeks you are not getting anywhere. Address and re-adjust immediately because you risk team members losing passion because you are not getting anywhere</span></div>
</li>
</ul>
<p style="margin:0;" class="MsoNormal"><span style="font-family:Arial;"></span></p>
<p><span style="font-family:Arial;"> <strong>2) </strong></span><span style="font-family:Arial;"><strong> “Getting acquired” as a business model</strong> </span></p>
<ul>
<li>
<div style="margin:0;" class="MsoNormal"><span style="font-family:Arial;">Getting acquired is not a business model.<span>  </span>It’s a WISH!!!</span></div>
</li>
<li>
<div style="margin:0;" class="MsoNormal"><span style="font-family:Arial;">Concentrate on building a business that has compelling value</span></div>
</li>
</ul>
<p style="margin:0;" class="MsoNormal"><span style="font-family:Arial;"></span></p>
<p><strong><span style="font-family:Arial;"> 3) T</span><span style="font-family:Arial;">he “style” of a startup</span></strong></p>
<ul>
<li>
<div style="margin:0;" class="MsoNormal"><span style="font-family:Arial;">You have the permission to create your own identity</span></div>
</li>
<li>
<div style="margin:0;" class="MsoNormal"><span style="font-family:Arial;">Have an attitude</span></div>
</li>
<li>
<div style="margin:0;" class="MsoNormal"><span style="font-family:Arial;">Have a style</span></div>
</li>
<li>
<div style="margin:0;" class="MsoNormal"><span style="font-family:Arial;">Develop a style for your startup and yourself and work it <em>(America’s Next Top Startup, anyone???)</em> all the way through </span></div>
</li>
</ul>
<p style="margin:0;" class="MsoNormal"><span style="font-family:Arial;"> <span style="font-family:Arial;">MORE TO COME …..</span></span></p>
<p></span></p>
]]></content:encoded>
			<wfw:commentRss>http://tomchikoore.com/2008/03/05/techstars-notes-in-the-raw-2/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

