<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Commercial Intelligence &#187; Ontology</title>
	<atom:link href="http://haleyai.com/wordpress/category/ontology/feed/" rel="self" type="application/rss+xml" />
	<link>http://haleyai.com/wordpress</link>
	<description>systems that know and understand and think and learn</description>
	<lastBuildDate>Mon, 30 Jan 2012 19:51:53 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.1</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Event-centric BPM and goal-driven processing</title>
		<link>http://haleyai.com/wordpress/2011/11/07/event-centric-bpm-and-goal-driven-processing/</link>
		<comments>http://haleyai.com/wordpress/2011/11/07/event-centric-bpm-and-goal-driven-processing/#comments</comments>
		<pubDate>Mon, 07 Nov 2011 16:14:42 +0000</pubDate>
		<dc:creator>paul@haleyAI.com</dc:creator>
				<category><![CDATA[Business Process Management]]></category>
		<category><![CDATA[Complex Event Processing]]></category>
		<category><![CDATA[Compliant Governance]]></category>
		<category><![CDATA[Formal Logic]]></category>
		<category><![CDATA[Ontology]]></category>
		<category><![CDATA[Be Informed]]></category>
		<category><![CDATA[BPM]]></category>
		<category><![CDATA[BPMN]]></category>
		<category><![CDATA[BRF]]></category>
		<category><![CDATA[business rules forum]]></category>
		<category><![CDATA[CEP]]></category>
		<category><![CDATA[compliance]]></category>
		<category><![CDATA[event ontology]]></category>
		<category><![CDATA[event processing]]></category>
		<category><![CDATA[event semantics]]></category>
		<category><![CDATA[events]]></category>
		<category><![CDATA[goal-driven processing]]></category>
		<category><![CDATA[governance]]></category>
		<category><![CDATA[GRC]]></category>
		<category><![CDATA[Jan Verbeek]]></category>
		<category><![CDATA[legal reasoning]]></category>
		<category><![CDATA[logic programming]]></category>
		<category><![CDATA[Mills Davis]]></category>
		<category><![CDATA[OWL]]></category>
		<category><![CDATA[SBVR]]></category>
		<category><![CDATA[semantics]]></category>
		<category><![CDATA[taxation]]></category>
		<category><![CDATA[theorem proviing]]></category>

		<guid isPermaLink="false">http://haleyai.com/wordpress/?p=353</guid>
		<description><![CDATA[The slides for my Business Rules Forum presentation on event semantics and focusing on events in order to simplify process definition and to facilitate more robust governance and compliance are at Event-centric BPM.
After the talk I spoke with Jan Verbeek and Gartjan Grijzen of Be Informed and reviewed their software, which is excellent.  They have [...]]]></description>
			<content:encoded><![CDATA[<p>The slides for my Business Rules Forum presentation on event semantics and focusing on events in order to simplify process definition and to facilitate more robust governance and compliance are at <a title="Event-centric BPM" href="http://www.haleyai.com/documents/Event-centric BPM - BRF 2011.pptx">Event-centric BPM</a>.</p>
<p>After the talk I spoke with <a title="Jan Verbeek, Be Informed" href="http://www.beinformed.com/BeInformed/website/en/EN/WhoWeAre?init=true" target="_blank">Jan Verbeek</a> and Gartjan Grijzen of <a title="Be Informed" href="http://www.beinformed.com/BeInformed/website/en/EN/BusinessProcessPlatform?init=true" target="_blank">Be Informed</a> and reviewed their software, which is excellent.  They have been quite successful with various government agencies in applying  the event-centric methodology to produce goal-driven processing.  Their approach is elegant and effective.  It clearly demonstrates the merits of an event-centric approach and the power that emerges from understanding event-dependencies.  Also, it is very semantic, ontological, and logic-programming oriented in its approach (e.g., they use OWL and a backward-chaining inference engine).</p>
<p><object classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" width="640" height="360" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=6,0,40,0"><param name="allowFullScreen" value="true" /><param name="allowScriptAccess" value="always" /><param name="src" value="http://www.youtube.com/v/JNeT0ob7K2c&amp;hl=en_US&amp;feature=player_embedded&amp;version=3" /><param name="allowfullscreen" value="true" /><embed type="application/x-shockwave-flash" width="640" height="360" src="http://www.youtube.com/v/JNeT0ob7K2c&amp;hl=en_US&amp;feature=player_embedded&amp;version=3" allowscriptaccess="always" allowfullscreen="true"></embed></object></p>
<p>They do not have the top-down knowledge management approach that I advocate nor do they provide the logical verification of governing policies and compliance (i.e., using theorem provers) that I mention in the talk (see <a title="Guido Governatori" href="http://www.governatori.net/research/" target="_blank">Guido Governatori</a>&#8217;s 2010 publications and <a title="Travis Breaux" href="http://www.cs.cmu.edu/~breaux/" target="_blank">Travis Breaux</a>&#8217;s research at CMU, for example) but theirs is the best commercially deployed work in separating business process description from procedural implementation that comes to mind. (Note that <a title="Ed Barkmeyer" href="http://www.nist.gov/el/msid/infotest/ebarkmeyer.cfm" target="_blank">Ed Barkmeyer</a> of NIST reports some use of SBVR descriptions of manufacturing processes with theorem provers.  Some in automotive and aerospace industries have been interested in this approach for quality purposes, too.)</p>
<p>BeInformed is now expanding into the United States with the assistance of <a title="Mills Davis" href="http://project10x.com/mills.php" target="_blank">Mills Davis</a> and others.  Their software is definitely worth consideration and, in my opinion, is more elegant and effective than the generic BPMN approach.</p>
]]></content:encoded>
			<wfw:commentRss>http://haleyai.com/wordpress/2011/11/07/event-centric-bpm-and-goal-driven-processing/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Simple problems with the semantic web</title>
		<link>http://haleyai.com/wordpress/2011/10/10/simple-problems-with-the-semantic-web/</link>
		<comments>http://haleyai.com/wordpress/2011/10/10/simple-problems-with-the-semantic-web/#comments</comments>
		<pubDate>Mon, 10 Oct 2011 21:30:40 +0000</pubDate>
		<dc:creator>paul@haleyAI.com</dc:creator>
				<category><![CDATA[Formal Logic]]></category>
		<category><![CDATA[Ontology]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[defeasibility]]></category>
		<category><![CDATA[description logic]]></category>
		<category><![CDATA[DL]]></category>
		<category><![CDATA[expert systems]]></category>
		<category><![CDATA[first order logic]]></category>
		<category><![CDATA[FOL]]></category>
		<category><![CDATA[frame-based systems]]></category>
		<category><![CDATA[OWL]]></category>
		<category><![CDATA[Protege]]></category>
		<category><![CDATA[reasoner]]></category>
		<category><![CDATA[SILK]]></category>

		<guid isPermaLink="false">http://haleyai.com/wordpress/?p=340</guid>
		<description><![CDATA[The standard for defining ontologies these days is OWL and Protege.  Unfortunately, OWL lacks any notion of exceptions in inheritance or any other notion of defeasibility.
So, although you may want to say that birds fly, you&#8217;re ontology will be broken (or become much more complicated) when you realize there are birds that can&#8217;t fly, such [...]]]></description>
			<content:encoded><![CDATA[<p>The standard for defining ontologies these days is OWL and Protege.  Unfortunately, OWL lacks any notion of exceptions in inheritance or any other notion of defeasibility.</p>
<p>So, although you may want to say that birds fly, you&#8217;re ontology will be broken (or become much more complicated) when you realize there are birds that can&#8217;t fly, such as penguins or ostriches, or even sick or injured birds.</p>
<p>Practically speaking, you need something like courteous logic or the defeasibility in SILK to handle this (or any 1980s expert system shell or even earlier frame system).  OWL is very hard on mortal man (e.g., mainstream IT) in this regard.</p>
<p>How can I tell OWL that a pronoun is a noun but that pronouns are a closed class of words, unlike nouns, verbs, adjectives, and adverbs (in general).  Well, I&#8217;ll have to tell it about open-class nouns versus closed class nouns.  What a pain!</p>
<p>This is why we use Protege primarily as a drafting tool and, for example, SILK, to do reasoning.   Non-defeasible description logic and first-order reasoners are difficult to get along with, in practice (and make sustainable knowledge repositories too difficult &#8211; which inhibits adoption, obviously).</p>
]]></content:encoded>
			<wfw:commentRss>http://haleyai.com/wordpress/2011/10/10/simple-problems-with-the-semantic-web/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Tendencies and purpose matter</title>
		<link>http://haleyai.com/wordpress/2011/02/07/tendencies-and-purpose-matter/</link>
		<comments>http://haleyai.com/wordpress/2011/02/07/tendencies-and-purpose-matter/#comments</comments>
		<pubDate>Mon, 07 Feb 2011 20:16:28 +0000</pubDate>
		<dc:creator>paul@haleyAI.com</dc:creator>
				<category><![CDATA[Business Intelligence]]></category>
		<category><![CDATA[Business Process Management]]></category>
		<category><![CDATA[Ontology]]></category>
		<category><![CDATA[Predictive Analytics]]></category>
		<category><![CDATA[basic formal ontology]]></category>
		<category><![CDATA[BPM]]></category>
		<category><![CDATA[business motivation model]]></category>
		<category><![CDATA[CEP]]></category>
		<category><![CDATA[disposition]]></category>
		<category><![CDATA[function]]></category>
		<category><![CDATA[OMG BMM]]></category>
		<category><![CDATA[optimization]]></category>
		<category><![CDATA[performance management]]></category>

		<guid isPermaLink="false">http://haleyai.com/wordpress/?p=318</guid>
		<description><![CDATA[The basic formal ontology (BFO) offers a simple, elegant process model.   It adds alethic and teleological semantics to the more procedural models, among which I would include NIST&#8217;s process specification language (PSL) along with BPMN.
Although alethic typically refers to necessary vs. possible, it clearly subsumes the probable or expected (albeit excluding deontics0).  For example, consider [...]]]></description>
			<content:encoded><![CDATA[<p>The basic formal ontology (<a title="Basic Formal Ontology" href="http://www.ifomis.org/bfo" target="_blank">BFO</a>) offers a simple, elegant process model.   It adds <a title="alethic logic" href="http://en.wikipedia.org/wiki/Alethic_logic#Alethic_logic" target="_self">alethic </a>and <a title="teleology" href="http://en.wikipedia.org/wiki/Teleology" target="_self">teleological </a>semantics to the more procedural models, among which I would include NIST&#8217;s process specification language (<a title="Process Specification Language" href="http://www.mel.nist.gov/psl/" target="_blank">PSL</a>) along with BPMN.</p>
<p>Although alethic typically refers to necessary vs. possible, it clearly subsumes the probable or expected (albeit excluding <a title="deontic logic" href="http://en.wikipedia.org/wiki/Deontic_logic" target="_self">deontics</a><sup>0</sup>).  For example, consider the notion of &#8216;disposition&#8217; (shown below as rendered in <a title="Protege ontology editor" href="http://protege.stanford.edu/" target="_blank">Protege</a>):</p>
<p style="text-align: center;">
<p style="text-align: center;"><a href="http://haleyai.com/wordpress/wp-content/uploads/2011/02/BFOdisposition.jpg"><img class="aligncenter size-full wp-image-319" title="Disposition in the Basic Formal Ontology" src="http://haleyai.com/wordpress/wp-content/uploads/2011/02/BFOdisposition.jpg" alt="BFO's concept of 'disposition'" width="778" height="262" /></a></p>
<p>For example, cells might be disposed to undergo the cell cycle, which consists of interphase, mitosis, and cytokinesis.  Iron is disposed to rust.  Certain customers might be disposed to comment, complain, or inquire.</p>
<p>Disposition is nice because it reflects things that have an unexpected high probability of occurring<sup>1</sup> but that may not be a necessary part of a process.   It seems, however, that disposition is lacking from most business process models.  It is prevalent in the soft and hard sciences, though.  And it is important in medicine.</p>
<p>Disposition is distinct from what should occur or be attempted next in a process.  Just because something is disposed to happen does not mean that it should or will.  Although disposition is clearly related to business events and processes, it seems surprisingly lacking from business models (and CEP/BPM tooling).<sup>2</sup></p>
<p>A teleological aspect of BFO is the notion of purpose or intended &#8216;function&#8217;, as shown below:</p>
<p><a href="http://haleyai.com/wordpress/wp-content/uploads/2011/02/BFOfunction.jpg"><img class="aligncenter size-full wp-image-320" title="Function according to the Basic Formal Ontology" src="http://haleyai.com/wordpress/wp-content/uploads/2011/02/BFOfunction.jpg" alt="Function according to the Basic Formal Ontology" width="778" height="263" /></a>Function is about what something is expected to do or what it is for.  For example, what is the function of an actuary?  Representing such functionality of individuals or departments within enterprises may be atypical today, but is clearly relevant to skills-based routing, human resource optimization and business modeling in general.</p>
<p>Understanding disposition and function is clearly relevant to business modeling (including organizational structure), planning and performance optimization.    Without an understanding of disposition, anticipation and foresight will be lacking.  Without an understanding of function, measurement, reporting, and performance improvement will be lacking.</p>
<hr /><sup>0</sup> SBVR does a nice job with alethic and deontic augmentation of first order logic (i.e., positive and negative necessity, possibility, permission, and preference).</p>
<p><sup>1</sup> Thanks to BG for &#8220;politicians are disposed to corruption&#8221; which indicates a population that is more likely than a larger population to be involved in certain situations.</p>
<p><sup>2</sup> Cyc&#8217;s notion of &#8216;disposition&#8217; or &#8216;tendency&#8217; is focused on properties rather than probabilities, as in the following citation from <a title="OpenCyc" href="http://www.cyc.com/cyc/opencyc" target="_blank">OpenCyc</a>.  Such a notion is similarly lacking from most business models, probably because its utility requires more significant reasoning and business intelligence than is common within enterprises. <a title="OpenCyc" href="http://www.cyc.com/cyc/opencyc" target="_blank"></a></p>
<p style="padding-left: 30px;">The collection of all the different quantities of dispositional properties; e.g. a particular degree of thermal conductivity. The various specializations of this collection are the collections of all the degrees of a particular dispositional property. For example, ThermalConductivity is a specialization of this collection and its instances are usually denoted with the generic value functions as in (HighAmountFn ThermalConductivity).</p>
]]></content:encoded>
			<wfw:commentRss>http://haleyai.com/wordpress/2011/02/07/tendencies-and-purpose-matter/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Is Freebase worth much?</title>
		<link>http://haleyai.com/wordpress/2010/02/02/is-freebase-worth-much/</link>
		<comments>http://haleyai.com/wordpress/2010/02/02/is-freebase-worth-much/#comments</comments>
		<pubDate>Tue, 02 Feb 2010 14:23:04 +0000</pubDate>
		<dc:creator>paul@haleyAI.com</dc:creator>
				<category><![CDATA[Business planning]]></category>
		<category><![CDATA[Ontology]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[Social Networks]]></category>
		<category><![CDATA[advertising network]]></category>
		<category><![CDATA[API]]></category>
		<category><![CDATA[attribution]]></category>
		<category><![CDATA[Bing]]></category>
		<category><![CDATA[business model]]></category>
		<category><![CDATA[business plan]]></category>
		<category><![CDATA[capitalism]]></category>
		<category><![CDATA[cookies]]></category>
		<category><![CDATA[copyright]]></category>
		<category><![CDATA[DART]]></category>
		<category><![CDATA[DoubleClick]]></category>
		<category><![CDATA[Facebook]]></category>
		<category><![CDATA[Freebase]]></category>
		<category><![CDATA[Goldman Sachs]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[IBM]]></category>
		<category><![CDATA[linked data]]></category>
		<category><![CDATA[marketing]]></category>
		<category><![CDATA[Metaweb]]></category>
		<category><![CDATA[Powerset]]></category>
		<category><![CDATA[privacy]]></category>
		<category><![CDATA[question answering]]></category>
		<category><![CDATA[RDF]]></category>
		<category><![CDATA[search]]></category>
		<category><![CDATA[social network]]></category>
		<category><![CDATA[True Knowledge]]></category>
		<category><![CDATA[venture capital]]></category>
		<category><![CDATA[Vulcan]]></category>
		<category><![CDATA[Wikipedia]]></category>
		<category><![CDATA[Wolfram]]></category>

		<guid isPermaLink="false">http://haleyai.com/wordpress/?p=201</guid>
		<description><![CDATA[There has been some speculation that Freebase is a vehicle for Metaweb to prosper from its semantic web infrastructure when used for commercial purposes.  As I recall, Metaweb raised over $40 million in Series B around the time they started building Freebase. The investment was led by Goldman Sachs.  Metaweb’s seasoned investors were unlikely to [...]]]></description>
			<content:encoded><![CDATA[<p>There has been some <a href="http://jhingran.typepad.com/anant_jhingrans_musings/2007/09/freebase.html" target="_blank">speculation</a> that Freebase is a vehicle for Metaweb to prosper from its semantic web infrastructure when used for commercial purposes.  As I recall, Metaweb raised over $40 million in Series B around the time they started building Freebase. The investment was led by Goldman Sachs.  Metaweb’s seasoned investors were unlikely to invest so much in a business that cannot project a return on that investment.  Almost certainly, Metaweb has firm plans for realizing over $100 million in revenues.  Most likely, for these investors and the amount of capital, target revenues by 2014, five years after the second round, would be in the vicinity of $1 billion.  Obviously, there is a lot of work to get there from around zero today.</p>
<p>Some of the bubble in raising those funds has burst.  The economy would crimp the valuation and investment if made today.  And the semantic web has yet to produce a winner, so with less enthusiasm, the investment would again be less favorable today.  All this is modulo the business plan.  If the business plan withstands scrutiny and the rate of return from credibly achievable projections justifies investment, they could get the money again, even now.  But no one that I have heard or read over the past few years can explain the business plan adequately – that is, concretely.  I would appreciate any insights or opinions on the topic.  I believe these are smart people, in the company and among its investors, so I am sure it is there.  I just don’t believe in the “we’ll figure out how to make money eventually” business plan in this case. </p>
<p>Some Freebase terms that are worth knowing but are commercially reasonable for any site that provides a free service include:</p>
<ol>
<li>The terms of service are subject to change (upon posting).</li>
<li>The service may be changed or discontinued at any time and without notice.</li>
<li>Limits concerning access to or use of the services may be established.</li>
<li>Any disputes shall be heard in San Francisco and governed by California law.</li>
</ol>
<p><span id="more-201"></span>Such terms would stop me from basing a business on Freebase without a written agreement with Metaweb.  Other Freebase terms are interesting, and potentially controversial:</p>
<ol>
<li>Advertising is explicitly permitted.</li>
<li>Content contributed may not be stored or displayed and may be deleted.</li>
<li>Users and content providers indemnify Metaweb against any claims arising out of use.</li>
</ol>
<p>I have not noticed any advertising on Freebase, but it is permitted.  I really don’t care if there is advertising.  My concern would be more with the utility of the site and service, especially the API.  Others certainly feel otherwise, however.  Of course, advertising interacts with privacy, which concerns me, but perhaps not enough to preclude my doing business with Metaweb.</p>
<p>Indemnifying Metaweb rather than simply holding them harmless is a over the top.  Indemnification might make more sense for content contributors, but this policy applies even to users who do not contribute content.</p>
<p>Failing to store content that is in conformance with all terms of the agreements is not a necessary component of a free service.  This effectively precludes Freebase from providing a free RDF store, as Google and so many others do for limited amounts of data, whether files or email, for example.  I don’t fault this policy but it seems somewhat arbitrary and perhaps misguided (versus limitations on the volume of conforming content that is not displayed that will be stored).  Such a limitation probably interferes with some applications that would otherwise be developed using Freebase.</p>
<p>The following terms, aside from the possibility of advertising, may give some visibility to monetization:</p>
<ol>
<li>Registration, including an email address and “limited information about yourself” may be required.</li>
<li>Volume is currently limited to 100,000 API calls per day.
<ol>
<li>Is the API the primary monetization vehicle?</li>
<li>The API may be used only “as outlined in the documentation… on the site”.</li>
</ol>
</li>
</ol>
<p>As I read the privacy policy, Metaweb claims that IP addresses are “recorded for security and monitoring purposes” but does not preclude the sharing of personally identifiable information with third parties.  In addition, Metaweb uses cookies and links to third party web sites which may also introduce cookies, as in the Google/DoubleClick advertising network, which uses the DART cookie, for example.  Generally, I am uncomfortable with the Metaweb privacy policy in that it does not say what they will not do to my satisfaction.</p>
<p>Starting from the data, and with its social aspects, Freebase is headed for the intersection of Facebook and Wikipedia.  Commercial incentive and social gravity will propel Freebase content far beyond its foundation in Wikipedia attribution.  And the attraction of social content and data will concretely and economically motivate the Freebase community to grow beyond “meaningless” Facebook.  Will every student need a Freebase account?</p>
<p>I believe Freebase can get there and that the advertising network revenues and downstream user access channels will be phenomenal.  Imagine the questions that will be answer-able using Freebase data.  Metaweb is probably rubbing hands together as Wolfram, IBM, Vulcan and others work on question answering.  Or maybe they are working on it, too, with some of that $42.5 million….</p>
<p>If Freebase introduces a True Knowledge or a better Bing/Powerset, Google will not want to link to Freebase unless (for example) Freebase uses Google’s DoubleClick advertising network.  There is either an acquisition, deep collaboration, or a war looming as Freebase builds momentum and moves into search and question answering, if not simply query (once the user authored content is several times Wikipedia).</p>
<p>Google could be in trouble… It’s probably too late (or too embarrassing?) to follow Metaweb’s lead.  And if so, Freebase is worth a great deal.</p>
<p>If this was indeed the business plan, I am tremendously impressed.  Having the insight to set out on such a path &#8211; realizing that answers lie more explicitly within data than text and that social networking and attribution with commercial resources would overtake Wikipedia and build an extremely large community rooted deeply in data, discussion, interests, research, query, and more – with such long term vision and communicating it clearly and compelling enough to attract long term investors who grasped the then abstract but executable plan would be a tremendous achievement of capitalism.</p>
<p><em>This post was motivated by work on a retail e-commerce opportunity that may commoditize product information across search and comparison shopping engines, for example.  The challenge I am considering is how to realize value from semantic web content that needs to be licensed for broad use but the investment in which needs to be protected both in terms of integrity (i.e., as a standard) and from commoditization by competitors or lack of copyright enforcability.  Freebase licensing of content and APIs are of particular interest but not worth pursuing further here given the title of the article.  </em></p>
<ul>
<li>However, if you have thoughts on Freebase&#8217;s business model or on how to provide value to the semantic web without losing the invested capital, please share by email (<a href="mailto:paul@haleyai.com">mailto:paul@haleyai.com</a>) or comment here. </li>
<li>For those interested, Scott Brinker&#8217;s <a href="http://www.chiefmartec.com/" target="_blank">blog </a>has interesting discussion but lacks examples or specifity with regard to commercial business models.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://haleyai.com/wordpress/2010/02/02/is-freebase-worth-much/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Google follows Microsoft&#8217;s lead towards intelligence</title>
		<link>http://haleyai.com/wordpress/2010/01/26/google-follows-microsofts-lead-towards-intelligence/</link>
		<comments>http://haleyai.com/wordpress/2010/01/26/google-follows-microsofts-lead-towards-intelligence/#comments</comments>
		<pubDate>Tue, 26 Jan 2010 13:59:00 +0000</pubDate>
		<dc:creator>paul@haleyAI.com</dc:creator>
				<category><![CDATA[Natural Language]]></category>
		<category><![CDATA[Ontology]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[AI Magazine]]></category>
		<category><![CDATA[Bing]]></category>
		<category><![CDATA[events]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[IBM]]></category>
		<category><![CDATA[Jeopardy]]></category>
		<category><![CDATA[linked data]]></category>
		<category><![CDATA[natural language generation]]></category>
		<category><![CDATA[Nova Spivak]]></category>
		<category><![CDATA[Powerset]]></category>
		<category><![CDATA[Project Halo]]></category>
		<category><![CDATA[question answering]]></category>
		<category><![CDATA[Rich Snippets]]></category>
		<category><![CDATA[search engines]]></category>
		<category><![CDATA[the singularity]]></category>
		<category><![CDATA[True Knowledge]]></category>
		<category><![CDATA[Vulcan]]></category>

		<guid isPermaLink="false">http://haleyai.com/wordpress/?p=168</guid>
		<description><![CDATA[Being a fan of increased intelligence on the web, including Bing&#8217;s use of Powerset and True Knowledge, I enjoyed cnet&#8217;s report, &#8220;Google search gets answer highlights and events.&#8221;
Google now shows the following &#8220;The Empire State Building rises to 1250 ft (381 m) at the 102nd floor&#8221; in response to the classic semantic web test question.
Also, Google [...]]]></description>
			<content:encoded><![CDATA[<p>Being a fan of increased intelligence on the web, including Bing&#8217;s use of Powerset and True Knowledge, I enjoyed <a title="Google search gets answer highlights and events" href="http://news.cnet.com/8301-27076_3-10439712-248.html" target="_blank">cnet&#8217;s report</a>, &#8220;Google search gets answer highlights and events.&#8221;</p>
<p>Google now shows the following &#8220;<strong>The Empire State Building rises to 1250 ft (381 m) at the 102nd floor</strong>&#8221; in response to the classic semantic web test q<a href="http://haleyai.com/wordpress/wp-content/uploads/2010/01/Google-Empire-State-Building.jpg"></a>uestion.</p>
<p>Also, Google leverages more of the content of text or structure of linked data in its Rich Snippet answers:</p>
<p><img class="alignnone" title="Rich Snippet shows Google &quot;understands&quot; events" src="http://i.i.com.com/cnwk.1d/i/bto/20100122/concerts-google-dates-newstuff.png" alt="Rich Snippet shows Google &quot;understands&quot; events" width="571" height="141" /></p>
<p>As search engines increase their understanding of concepts and how to extract them from content or linked data and present them as Google does here or above in a sentence, the web will begin to feel a lot smarter. </p>
<p>As these simple enhancements indicate, the intelligent web is taking off and that feeling of intelligence will come sooner than expected.  Of course, there is a long way to go.   For more on that, I here there is an upcoming issue of AI Magazine that will survey the state of the art in question answering, including coverage of Vulcan&#8217;s Project Halo and IBM&#8217;s Jeopardy effort, among others.  Also, if you are interested in what bright minds are looking forward to in this regard, see Nova Spivak&#8217;s recent blogging and <a title="will the web become conscious" href="http://www.novaspivack.com/uncategorized/will-the-web-become-conscious" target="_blank">his post </a>on &#8220;will the web become conscious?&#8221;</p>
]]></content:encoded>
			<wfw:commentRss>http://haleyai.com/wordpress/2010/01/26/google-follows-microsofts-lead-towards-intelligence/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Extended Enterprise Ontology</title>
		<link>http://haleyai.com/wordpress/2010/01/19/extended-enterprise-ontology/</link>
		<comments>http://haleyai.com/wordpress/2010/01/19/extended-enterprise-ontology/#comments</comments>
		<pubDate>Tue, 19 Jan 2010 19:04:36 +0000</pubDate>
		<dc:creator>pvhaley</dc:creator>
				<category><![CDATA[Knowledge Management]]></category>
		<category><![CDATA[Ontology]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[Social Networks]]></category>
		<category><![CDATA[administrator]]></category>
		<category><![CDATA[bookmark]]></category>
		<category><![CDATA[bureaucrat]]></category>
		<category><![CDATA[change management]]></category>
		<category><![CDATA[citizen]]></category>
		<category><![CDATA[collaboration]]></category>
		<category><![CDATA[CRM]]></category>
		<category><![CDATA[democracy]]></category>
		<category><![CDATA[Drupal]]></category>
		<category><![CDATA[entity analytics]]></category>
		<category><![CDATA[Facebook]]></category>
		<category><![CDATA[FOAF]]></category>
		<category><![CDATA[inheritance]]></category>
		<category><![CDATA[Knoodl]]></category>
		<category><![CDATA[Linked Open Data]]></category>
		<category><![CDATA[LOD]]></category>
		<category><![CDATA[MediaWiki]]></category>
		<category><![CDATA[NeOn]]></category>
		<category><![CDATA[OpenCyc]]></category>
		<category><![CDATA[OWL]]></category>
		<category><![CDATA[permissions]]></category>
		<category><![CDATA[privilege]]></category>
		<category><![CDATA[Protege]]></category>
		<category><![CDATA[rate]]></category>
		<category><![CDATA[RDF]]></category>
		<category><![CDATA[Revelytix]]></category>
		<category><![CDATA[Semantic MediaWiki]]></category>
		<category><![CDATA[SMW]]></category>
		<category><![CDATA[social network]]></category>
		<category><![CDATA[TopBraid]]></category>
		<category><![CDATA[Twine]]></category>
		<category><![CDATA[UMBEL]]></category>
		<category><![CDATA[vote]]></category>
		<category><![CDATA[Wikipedia]]></category>
		<category><![CDATA[XBRL]]></category>

		<guid isPermaLink="false">http://haleyai.com/wordpress/?p=149</guid>
		<description><![CDATA[In a recent post I mentioned comments by Sir Tim Berners-Lee concerning the overlap between enterprise information models and semantic web ontology supporting the concept of linked data.  Sir Berners-Lee argued that overlap is already sufficient to have a transformative effect on mainstream IT.  I think he is right, but also that we are not [...]]]></description>
			<content:encoded><![CDATA[<p style="text-align: left;">In a recent post I mentioned comments by Sir Tim Berners-Lee concerning the overlap between enterprise information models and semantic web ontology supporting the concept of linked data.  Sir Berners-Lee argued that overlap is already sufficient to have a transformative effect on mainstream IT.  I think he is right, but also that we are not there yet.  There are many obstacles to adoption, not the least of which is the inertia of enterprise IT.  Disruptive approaches to software development typically require ten years or so to cross the chasm from visionary and early adopters to the mainstream.  We are only a few years into this and the technology is not ready.</p>
<p>First, let’s establish that there is plenty of semantics available for reuse now.  There are existing models, some of which are well-designed, mature, and widely used.  Unfortunately, most of what exists has little apparent relevance to enterprises.  There is little on this diagram that would draw the attention of an enterprise architect, for example.</p>
<p style="text-align: center;"> <img class="aligncenter" title="Linked open data sets as of Sept 18, 2008" src="http://www4.wiwiss.fu-berlin.de/bizer/pub/lod-datasets_2008-09-18.png" alt="" width="432" height="334" /></p>
<p><a href="http://www4.wiwiss.fu-berlin.de/bizer/pub/lod-datasets_2008-09-18.html"></a></p>
<p><span id="more-149"></span>Now, if there was an ontology for customer relationship management (CRM) or eXtensible Business Reporting Language (<a href="http://www.xbrl.org/XBRLandBusiness/" target="_blank">XBRL</a>) in this diagram, it would get more attention.  I suggest that semantic web technology will be approaching the enterprise when one of these shows up.</p>
<p>Still, some mature and widely-used ontologies may offer enterprise benefits.  For example, the “friend-of-a-friend” ontology provides the following concepts:</p>
<p>Every enterprise has some of FOAF’s basic elements, like surnames, organizations, and projects.  By linking their enterprise models to FOAF ontology, directly or indirectly, people across the enterprise and its value chain gain visibility to and flexibility in using information that would otherwise be expensive to provide and use.  There is a lot to this that I will not go into here, including its relationship to business intelligence and enterprise performance management (but see <a href="http://www.ibm.com/software/data/entity-analytic-solutions/" target="_blank">entity analytics at IBM</a>, for example).  Suffice it to say that there is meat on this bone.</p>
<p style="text-align: center;"><img class="size-full wp-image-153 aligncenter" title="FOAF spec concepts" src="http://haleyai.com/wordpress/wp-content/uploads/2010/01/FOAF-spec-concepts1.jpg" alt="FOAF spec concepts" width="745" height="242" /></p>
<p>As ontologies go, FOAF is small, at least in terms of its model, which consists of classes and properties.  Generally speaking, the more mature and widely-used ontologies today are small models.  Unfortunately, they are also isolated from one another.  They have been developed by small communities that are substantially independent of one another.  This reality is a symptom of a hurdle that needs to be crossed before semantics will cross the chasm to the mainstream.</p>
<p>Without collaboration, ontologies will remain small and independent.  There will be isolated examples of large ontologies, such as <a href="http://www.cyc.com/opencyc/overview" target="_self">OpenCyc</a> and interesting combinations of multiple ontologies, such as <a href="http://www.umbel.org/" target="_blank">UMBEL</a>, but they will have no uptake until they are managed by a large community that includes enterprise members.</p>
<p>Let’s look at the problem of enterprise modeling head on.  Consider, for example, defining the model underlying Oracle CRM and Salesforce.COM.  Or consider defining an application that produces the XML format now required of public companies by the Securities and Exchange Commission: XBRL.  These ontologies would be huge compared to any that has been widely adopted on the semantic web.  And yet, if such an ontology existed for SEC filings, it would have tremendous corporate relevance, especially to CFOs and in financial analysis at commercial banks and by the capital markets.  As for CRM, you don’t have to go far beyond FOAF to get the benefits of linked data and entity analytics.  So much more functionality would emerge than any vendor can provide or maintain, such as in mash-ups using open technology and linked data, if CRM was ontological.  (Mash-ups demonstrate that creativity is very powerful when unleashed.)</p>
<p>So, unless the vendors agree on an ontology for CRM or unless XBRL shifts from data modeling (e.g., XML schema definitions) to semantics, users are either doomed to redundant investments or trans-enterprise collaborative development of ontology is needed.  (I would not hold my breath on the former!)  I am ignoring a third alternative for the moment, which is that a technology startup or a professional organization gains significant traction or market leadership in providing such an ontology as a product or service offering.  On the other hand, given the size of commercial ontologies such as these, even such a disruptive innovator will need to overcome the collaboration bottleneck with respect to ontology, which is my focus here.</p>
<p>The most promising approach for models that are common across enterprises may be to develop them using a wiki, as in Wikipedia.  Certainly, some collaborative platform that exists “outside the firewall” is necessary.  But we need this platform to avoid the problems of spam and nuisance editors that plague Wikipedia and other completely open platforms.  We want to embrace the web’s lack of authority but we don’t want our platform to be completely egalitarian.  We want the community to direct itself to a collective objective, which is ontology worthy of broad enterprise adoption.</p>
<h3><strong>Ontologies as standards</strong></h3>
<p>If we are successful, completely open standards – the ontologies themselves &#8211; will be defined without committees.  There are challenges to be sure, but it seems possible that such ontological standards are within reach, driven by those who will derive the most benefit: enterprise consumers.  There are things we need from ontology in order for it to be an enterprise worth standard, however.  For one thing, we need version control.  An enterprise cannot use an ontology that changes at the whim of a Wikipedia editor.</p>
<p>Take <a href="http://semantic-mediawiki.org/" target="_blank">Semantic MediaWiki</a>, for example.  If we use SMW as a platform for collaborative ontology and we adopt that ontology in resources that our CRM or financial systems make available as linked data, what happens when someone deletes or renames a concept that we use?</p>
<p>The solution is not as simple as making a copy of the ontology at a moment in time.  Don’t forget that people from around the world are editing at every hour.  Asking the world to stop for a snapshot won’t work.  Nor will asking the world to unanimously agree that every concept in an ontology is right and final, at least for this version.  No, more sophisticated change management is needed.</p>
<p>All this should sound familiar to enterprise architects. We hold our models increasingly tightly as we approach a release.  And then they are fixed in stone until the next release.  We avoid the challenges of collaboration above.  We share at first and gradually take control.  It works.</p>
<p>But this approach won’t work across enterprises and is doomed to redundant investment within the enterprise.  It may be tough to sell across enterprises but there is definite value and ROI if the collaboration can overcome the issues of change management.</p>
<p>Unfortunately, ontology standards and platforms simply don’t address this problem yet.  Chalk this up as another bridge for semantic technology to cross in order to cross the chasm.  Some ideas for crossing that bridge follow.</p>
<h3><strong>Social semantics</strong></h3>
<p>We need the collaborative platform to allow convergence towards a stable ontology.  At the very least, we need to converge on some parts of the ontology.  If the community never “finally” agrees on even a single concept or property a standard will certainly not be born.  Fortunately, social media and change management may combine to address this problem.</p>
<p>Consider, for example, that social media ubiquitously support voting and rating functionality, whether five stars, thumbs up or down, dig it or favorite.  We can use this kind of functionality within the wiki to gauge the quality of content.  We can also take collective opinions on the need for elaboration.  There are other possibilities and details concerning voting and rating.  Among them are important details about aggregate opinions versus individual opinions. </p>
<h3><strong>Wiki communities</strong></h3>
<p>In the context of a wiki supporting enterprises, there might be a taxonomy of organizations, groups, and projects, each of which define a community for which voting or rating might be tracked.  A community is a collection of individuals where those collections are organized taxonomically.  Most wikis have only a single community that anyone can join, including anonymously.  Within this community there might be individuals with different roles who can perform certain operations that typical users may not.  A more general notion might define communities as requiring permission for an individual or community to join.  The privilege to grant membership might be given to those who play an administrative role for the community.  And the level of privilege or permission granted to an individual could use the voting and rating functionality described above. </p>
<h3><strong>Wiki democracy</strong></h3>
<p>As mentioned above, authority is not a natural phenomenon in open societies, such as the web.  Respect may be earned, but authority must be given, not taken.  The obvious technique for giving authority is by voting (possibly including a veto mechanism).  For example, a community may adopt the policy of extending the role of administrator along with its privileges to the top quartile of rated members.  Or, the community might follow a normal nomination and election process.  Or the community might elect a president or board of directors to whom it delegates the assignment of roles to community members.</p>
<h3><strong>Wiki citizens</strong></h3>
<p>Unfortunately, there are people who will be disruptive or cheat in any system.  Unchecked, they will post commercials or vote more than once for or against matters for various reasons.  So, it is important to know who is eligible to vote and ensure that there is a most one vote per citizen and that enough citizens vote on specific matters of governance.</p>
<p>It’s nice to have the template of Democracy so well established in the world.  Let’s use it.</p>
<h3><strong>Freeze-free releases</strong></h3>
<p>Now, with wiki democracy and communities, we have the basis for controlling release cycles.  Communities can configure their democracy, especially with regard to membership and the roles assigned to members.  More specifically, they decide the governance process by which versions are gradually stabilized.  They do this by assigning privileges or permissions to individuals, roles, or sub-communities.  These privileges or permissions allow them to “freeze” parts of the ontology.  But we cannot really allow the wiki (or its ontology) to be frozen.  Rather, we mark particular versions of parts of the ontology as being frozen with respect to a target release for our community.  Then, when we have frozen all parts of the ontology that our community needs in its ontology, we extract those versions into a release.  Other communities are unaffected by our release cycle.</p>
<p>The wiki might even show a community view of its content.  That is, it might show the versions of the ontology (and related wiki content) that is consistent with a community view.  Two different individuals might see two distinct views for the same pages.  Each might see the version of content that was frozen for their next release or the current content for parts that are not frozen for their community.</p>
<p>The careful IT reader might at this point be wondering about forks and other change management functionality.  I do not want to try to address such issues here, but I would welcome your comments and suggestions, especially if you think merges are important.  I would argue that forks are to be avoided.  They might be frozen for the practical reasons described here, but we need individual communities to converge with the entire community over time.</p>
<h3><strong>Bureaucracy</strong></h3>
<p>As the enterprise –worthy ontology underlying the collaborative platform grows and matures the electorate will inevitably demand more stability in what will become the core or upper ontology common to most enterprises.  Perhaps an amendment to the constitution will be offered and adopted by super-majority.  And a new community will form at the top of the taxonomy with permission to govern aspects and versions of parts of the ontology delegated to it by the electorate.  When this happens, the chasm will have been crossed.  Or, perhaps the wiki would start with such a board of directors acting as a senate or superior court, if you will.</p>
<p>(See <a href="http://en.wikipedia.org/wiki/Wikipedia:Bureaucrats" target="_blank">bureaucrats</a> and <a href="http://en.wikipedia.org/wiki/Wikipedia:Arbitration_Committee_Elections_December_2009" target="_blank">recent elections</a> at Wikipedia for further insight on practices and possibilities.)</p>
<h3><strong>Permissions</strong></h3>
<p>The bureaucracy will want to effectively freeze certain parts of the ontology for all communities other than itself.  Even while frozen however, the upper ontology might still receive votes, ratings, and annotations (which is an important and substantial topic in itself).  The mechanism for enforcing this is familiar to almost anyone who would use such a wiki: permissions.  The typical approach is to map types of users to things they can or cannot do with, to, or on a particular thing (and to support inheritance between things as appropriate).  For example, Revelytix supports some of what I have described for communities as well as role-based <a href="http://knoodl.com/ui/groups/knoodl/wiki/Help/entry/Communities#Permissions" target="_blank">permissions in Knoodl</a>, its semantic wiki.</p>
<h3><strong>Convergence</strong></h3>
<p>As the modifications to the content concerning a part of the ontology diminish, it is either becoming less relevant or more stable.  Voting or rating can determine which.  Combined with voting and rating information, version control helps identify parts of the ontology that are (close to being) ready for inclusion in a “frozen” version.</p>
<p>On the other hand, volatility in votes, ratings, and versions indicates potential controversy.  Wiki democracy can govern controversy.  For example, the electorate can dictate or delegate the definition of policies concerning limits and resolution of controversy. </p>
<p>More practically, if membership in the wiki is limited to people with a vested interest in the emergent and continuous use of the resulting ontology, such as by authentication and other policies, such as nomination and formal acceptance, many of the problems experienced by Wikipedia can be effectively eliminated.</p>
<p>Both of these approaches are becoming increasing common in popular wikis.</p>
<h3><strong>Authority</strong></h3>
<p>Social media demonstrate a variety of techniques for gauging authority, whether globally or per community, generally using voting and rating, but in some cases more objectively.  In addition, communities, particularly where they correspond to organizations, may explicitly delegate authority concerning their operations without “resorting” to democracy.  The wiki should support both kinds of authority, but it can leverage the former (within or across communities) in gauging the quality of content or to resolve controversy.</p>
<p>With appropriate views of the ontology, people can see what parts of the ontology have many positive or negative votes or ratings filtered by community and level of authority and ranked by volatility.  Additional criteria that may help assess the stability and adequacy of ontological knowledge (e.g., concepts, whether categories, collections, or individuals, and properties) are certainly possible and we solicit your ideas and suggestions.</p>
<h3><strong>Private Content</strong></h3>
<p>Our interest is in deriving enterprise benefits.  We believe that semantics offers significant benefits within enterprises.  These are most widely recognized in the areas of data governance and integration but will become increasingly relevant in business process management, enterprise knowledge management, and policy automation.  And we are looking forward to leveraging increased understanding of <a href="http://www.ibm.com/developerworks/rational/library/08/0401_amsden/" target="_blank">enterprise objectives</a>, in <a href="http://www.oracle.com/us/solutions/ent-performance-bi/index.htm" target="_blank">business intelligence and enterprise performance management</a> (e.g., leveraging XBRL for CFOs).</p>
<p>Enterprises who define their business processes, policies, and objectives semantically want their business process models (BPMN), their semantics of business vocabulary and rules (SBVR) and business motivation model (BMM) to be closely related to the ontology wiki.  In effect, they want a wiki within their firewall that is seamlessly integrated with the extended enterprise ontology wiki. They want control over their contributions to the community. </p>
<p>In the extreme, enterprises want to host their own wiki as inheriting from the version of the external wiki that their community is working with and towards.  As they edit their content they want careful control over what will be published to the broader community and what will be retained as proprietary.  If there is a firewall between their wiki and the global wiki, they want to ensure that none of their proprietary content penetrates the firewall.</p>
<p>This is an extended effect of permissions that interacts with an inheritance scheme between a taxonomy of two wikis.  Although this inheritance can be generalized to support a taxonomy of wikis rooted globally but extended internally, the typical use case is inheritance from one external wiki to one internal wiki.  This permissions and inheritance mechanism should support overrides and exclusions such that the enterprise can prefer its own definitions which improve or conflict with the global wiki without disclosing them to the broader community.</p>
<h3><strong>Collaborative Protege</strong></h3>
<p>Although it does not provide wiki functionality, Collaborative Protégé addresses many of the issues discussed above.  For example, the following shows Protégé’s support for an extensible set of annotations.<a title="Image:CollabProtege_ChaoToDo.png" href="http://protegewiki.stanford.edu/index.php/File:CollabProtege_ChaoToDo.png"> </a></p>
<p style="text-align: center;"><img class="aligncenter" title="Collaborative Protege annotations" src="http://protegewiki.stanford.edu/images/a/aa/CollabProtege_ChaoToDo.png " alt="" /> </p>
<p>Protégé uses annotations to encode version information, which is cumbersome but stays within the framework of using an RDF representation.  The following shows that even fine-grained constructs, such as annotations, are maintained with author and date/time information:</p>
<p style="text-align: center;"><img class="aligncenter" title="Collaborative Protege annotation details" src="http://protegewiki.stanford.edu/images/archive/0/09/20091216213844%21CollabProtege_search.png" alt="" /></p>
<p><a title="&quot;CollabProtege search.png&quot; " href="http://protegewiki.stanford.edu/index.php/File:CollabProtege_search.png"></a></p>
<p>Note that the changes tab would show the history for a specific object in the ontology.  Protégé also supports proposals and voting or rating within a discussion forum metaphor:</p>
<p style="text-align: center;"><img class="aligncenter" title="Collaborative Protege rating and voting" src="http://protegewiki.stanford.edu/images/7/72/CollabProtege_TodoType.png" alt="" /></p>
<p>Workflow support is planned for future versions of Collaborative Protégé.</p>
<p>Overall, Protégé is a good tool for the ontological aspects but insufficient when compared to the blog, wiki, content management, and discussion forum functionality common across the social, read-write web.  Unfortunately, as with other platforms discussed below, it does not yet go far enough with regard to communities, permissions, and inheritance (or other mappings) between federated ontologies.</p>
<h3><strong>Semantic MediaWiki</strong></h3>
<p>Semantic MediaWiki (SMW) has the virtue of facilitating evolutionary consensus on what concepts and relations between them mean by allowing authors to converge on a few paragraphs of text.  SMW pages may correspond to ontological objects, such as concepts or properties and links between SMW pages may correspond to ontological properties.</p>
<p>SMW is an extension of MediaWiki, the platform used for Wikipedia.  MediaWiki has adequate version control functionality but ignores conflicting edits; the last writer wins.  This may be fine, in practice but other wikis provide automatic locking with timeouts to avoid conflicting edits.</p>
<p>Conflicting edits can be particularly important when modifying ontological structure, especially where inheritance is involved, as in SMW.  Furthermore SMW content is more technical to edit than editing MediaWiki content, which is also technical.  Although SMW provides an adequate backbone for merging ontology with wiki content, its unstructured approach to editing and collaboration may prove inadequate for convergent ontological development.</p>
<p>MediaWiki supports limited group functionality (i.e., Bureaucrats, Administrators and Everyone) with correspondingly little support for permissions, consequently SMW falls short with regard to communities and convergence.</p>
<h3><strong>Content Management Systems</strong></h3>
<p>CMS, such as Drupal, can provide wiki-like functionality in addition to blog and other functionality which may prove attractive and useful to a community.  Drupal also provides discussion forum functionality.  Although MediaWiki provides discussion pages, the thread functionality provided by Drupal and other systems (including Protégé) is much more suitable for collaborative development.  Drupal also allows content to be edited in a more structured manner and to be more flexibly and dynamically composed.</p>
<h3><strong>Blogs</strong></h3>
<p>Wiki articles typically correspond to specific topics, as in an encyclopedia.  WordPress and TypePad articles are more free-form, while content management systems are best suited for on-line publications, including magazines and newspapers.</p>
<p>In a blog, the discussion is typically more in-line with the content than a wiki’s discussion or “talk” pages.  Many blogs support threaded commentary as in a more full-fledge discussion forum platform.   Blog-like commentary has been adopted in content management but is generally more limited in its presentation, such as when an article is viewed individually at</p>
<p>The wiki metaphor aligns better with the conceptual structure of an ontology.  This is demonstrated, for example, in the Semantic MediaWiki extension of MediaWiki (SMW).  The primary principle underlying SMW is that the ontology is documented by a page per concept or property.  Conceptually, the categories of MediaWiki become classes in an SMW ontology and the classes of the ontology correspond to MediaWiki categories.  Instances within the ontology also have their own pages.</p>
<h3><strong>Social Networks</strong></h3>
<p>Social networks, such as Facebook and Twine, for example, facilitate interpersonal communication, groups, interest tracking, and content or contact rankings and recommendations.  Clearly, such features should be considered extensions of a collaborative ontology platform.  Among them the ranking is most important, as discussed above.</p>
<h3><strong>Bookmarking</strong></h3>
<p>Services that originally provided bookmarking services, such as Digg, have evolved into social networks.  Nonetheless, their initial functionality was to provide tagging of bookmarks that would otherwise have remained unorganized within web browsers.  The tags assigned to bookmarks are typically less formal than ontological classification.  Some such services surreptitiously organize user tags and content ontologically, however.  Some argue that such ontological structure enables better recommendation, search, and browsing functionality.</p>
<p>In a platform for collaborative ontology development, it makes sense to support explicit tagging using ontological concepts.  Adding a tag would correspond to choosing (or defining) a concept in the ontology.  If in a wiki presentation, the pages for concepts could have a region or tab for bookmarks that use the category.  Moreover, semantic pivoting and drill-down could be supported, as is common in semantic web applications.  As with other content, bookmarks could be openly shared or limited to communities or individuals.</p>
<h3><strong>A Foundation for Semantic Collaboration</strong></h3>
<p>We are investigating platforms and formulating requirements for such a collaborative platform for developing and maintaining extended enterprise ontologies.  We will be writing further on this area in the future but welcome your thoughts and interests in the meantime.</p>
]]></content:encoded>
			<wfw:commentRss>http://haleyai.com/wordpress/2010/01/19/extended-enterprise-ontology/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Time for the next generation of knowledge automation</title>
		<link>http://haleyai.com/wordpress/2009/11/01/time-for-the-next-generation-of-knowledge-acquisition-management-and-automation/</link>
		<comments>http://haleyai.com/wordpress/2009/11/01/time-for-the-next-generation-of-knowledge-acquisition-management-and-automation/#comments</comments>
		<pubDate>Sun, 01 Nov 2009 15:08:36 +0000</pubDate>
		<dc:creator>paul@haleyAI.com</dc:creator>
				<category><![CDATA[Business Intelligence]]></category>
		<category><![CDATA[Business Process Management]]></category>
		<category><![CDATA[Business Rules Management]]></category>
		<category><![CDATA[Complex Event Processing]]></category>
		<category><![CDATA[Decision Management]]></category>
		<category><![CDATA[Formal Logic]]></category>
		<category><![CDATA[Knowledge Management]]></category>
		<category><![CDATA[Natural Language]]></category>
		<category><![CDATA[Ontology]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[aggregation]]></category>
		<category><![CDATA[Authority]]></category>
		<category><![CDATA[BI]]></category>
		<category><![CDATA[BPM]]></category>
		<category><![CDATA[business rules forum]]></category>
		<category><![CDATA[CEP]]></category>
		<category><![CDATA[English]]></category>
		<category><![CDATA[event calculus]]></category>
		<category><![CDATA[event processing]]></category>
		<category><![CDATA[events]]></category>
		<category><![CDATA[Haley]]></category>
		<category><![CDATA[IBM]]></category>
		<category><![CDATA[Ilog]]></category>
		<category><![CDATA[Oracle]]></category>
		<category><![CDATA[polcy management]]></category>
		<category><![CDATA[problem solving]]></category>
		<category><![CDATA[processes]]></category>
		<category><![CDATA[question answering]]></category>
		<category><![CDATA[situation calculus]]></category>
		<category><![CDATA[SparQL]]></category>
		<category><![CDATA[tense]]></category>
		<category><![CDATA[time]]></category>

		<guid isPermaLink="false">http://haleyai.com/wordpress/2009/11/01/time-for-the-next-generation-of-knowledge-acquisition-management-and-automation/</guid>
		<description><![CDATA[In preparing for my workshop at the Business Rules Forum in Las Vegas on November 5th, I have focused on the following needs in reasoning about processes, about events, and about or over time:

Reasoning at a point within a [business] process
Reasoning about events that occur over time.
Reasoning about a [business] process (as in deciding what [...]]]></description>
			<content:encoded><![CDATA[<p>In preparing for my workshop at the Business Rules Forum in Las Vegas on November 5th, I have focused on the following needs in reasoning about processes, about events, and about or over time:</p>
<ol>
<li>Reasoning at a point within a [business] process</li>
<li>Reasoning about events that occur over time.</li>
<li>Reasoning about a [business] process (as in deciding what comes next)</li>
<li>Reasoning about and across different states (as in planning)</li>
</ol>
<p>Enterprise decision management (EDM) addresses the first.  Complex event processing (CEP) is concerned with the second.  In theory, EDM could address the third but it does not in practice.  This third item includes  the issue of governing and defining workflow or event-driven business processes rather than point decisions within such business processes. </p>
<p>Business applications of rules have not advanced to include the fourth item.  That is to say, business has yet to significantly leverage reasoning or problem solving techniques that are common in artificial intelligence.  For example, artificially intelligent question and answer systems, which are being developed for  the semantic web,  can do more than retrieve data &#8211; they perform inference.  Commercial database and business intelligence queries are typically much less intelligent, which presents a number of opportunities that I don&#8217;t want to go into here but would happy to discuss with interested parties.  The point here is that business does not use reasoning much at all, let alone to search across the potential ramifications of alternative decisions or courses of action before making or taking one.  Think of playing chess or a soccer-playing robot planning how to advance the ball on goal.  Why shouldn&#8217;t business strategies or tactical business decisions benefit from a little simulated look-ahead along with a lot of inference and evaluation?</p>
<p>Even though I have recently become more interested in the fourth of these areas, I expect the audience at the business rules forum to be most interested in the first two points above.  There will also be some who have enough experience with complex business processes, which are common in larger enterprises.  These folks will be interested in the third item.  Only the most advanced applications, such as in biochemical process planning, will be interested in the fourth.  I don&#8217;t expect many of them to attend!</p>
<p>The notion of enterprise decision management (EDM) is focused on point decision making within a business process.  For enterprises that are concerned with governing business processes, a model of the process itself must be available to the business rules that govern its operation.  I&#8217;ve written elsewhere about the need for an ontology of events and processes in order to effectively integrate business process management (BPM) with business rules.  Here, and in the workshop, I intend to get a little more specific about the requirements, what is lacking in current standards and offerings, and what we&#8217;re trying to do about it.<span id="more-109"></span></p>
<p>As I&#8217;ve written previously, the distinction between business process management (BPM) and CEP is not well principled but arises from somewhat arbitrary, historically distinct emphases on technology and market segmentation.  Any modern business process system must handle events and discussing events processing without considering processes is a limiting perspective.</p>
<p>Most people would intuitively agree that events trigger business processes.  For example, a business receiving a payment or a letter from a customer or vendor is an event that triggers the process of crediting and depositing that payment or considering and responding to that letter  Hopefully, we are moving beyond academic arguments about the distinction.  (I am ignoring here the algorithmic applications of streaming event processing as in trading in the capital markets.)</p>
<p>So, in current BPM, which should include CEP capabilities, we want decision management that is less ignorant about where it is in a business process and that is aware of events that trigger processing.  That is, we want policies that talk about the state of a business process and the occurrence of events.  <span style="text-decoration: underline;">The surprising thing is that current business rules management systems (BRMS) and related standards are of no help</span>.  Tools from the leading business rule management system (BRMS) vendors, including Oracle and IBM / Ilog, have no intrinsic understanding of processes, events, or time.  And, outside their integrated BRMS, tools from BPM vendors don&#8217;t let us &#8220;talk&#8221; about anything.  They help us structure flows and code, but they rely on integrated BRMS to manage rules.  The rules include the policies, in which the business &#8220;talks&#8221;.  The BRMS is where English sentences (or something that increasing looks like English) are managed as the enterprise repository of policy. </p>
<p>Note that there is nothing special about English.  It&#8217;s just easier to read than &#8220;natural language&#8221;.  Another limitation of current policy management systems is their lack of language independence, which requires automatic translation, which is much simpler if the sentences are unambiguously interpreted with logical rigor, but I digress from the point of this missive&#8230;</p>
<p>Some examples will help here.  Suppose we check the credit of an applicant at various points within the collection of processes that constitute how our enterprise conducts business.  We might have policies that are concerned with how we consider or act based on credit information in originating a loan (or policy) versus in renewing  or re-pricing one.  In effect, our policies want to talk not just about evaluating credit or pricing risk, but to do so in the context of a larger business process.  To be more specific, business policies that sound like, &#8220;if evaluating credit in the course of pricing a renewal&#8230;&#8221; or &#8220;if evaluating credit while considering a new policy&#8230;&#8221; are quite natural.  These statements define or govern the business process.  They also talk about where the decision is being made within a business process.  </p>
<ul>
<li>BRMS need to understand the context of the business process in order to make context-sensitive decisions.</li>
<li>BPM needs to tell the BRMS what it is going on from the top-down for the BRMS to understand the context of a decision.</li>
</ul>
<p>So, we need the BRMS to be told things like:</p>
<ul>
<li>I am considering a new applicant.</li>
<li>I am considering the renewal of a contract.</li>
</ul>
<p>In these statements, the pronoun &#8220;I&#8221; is the overall enterprise system contemplating its own actions.  If you find that awkward, just substitute &#8220;while&#8221;  to obtain what you might &#8220;say&#8221; in a business policy.  Ideally, the language your policy management system would not be overly stilted but would understand any of:</p>
<ul>
<li>while considering&#8230;</li>
<li>during consideration of &#8230;</li>
<li>if considering&#8230;</li>
<li>if an &#8230; is being considered for&#8230;</li>
</ul>
<p>and so on.  Otherwise, users will find authoring such statements cumbersome.  Reading and understanding English, even if it is a bit stilted, is easy for people.  We&#8217;re built to communicate, after all.</p>
<p>Now consider what you would want to say if you were writing policies that involved events.  In this case, the event has already occurred, such as &#8220;we received a letter from a customer&#8230;&#8221;.  You do not want to say, &#8220;if I am receiving a letter from a customer&#8230;&#8221; (which could only be true for an instant that passes quickly unless it was stated as &#8220;&#8230; I will be receiving&#8230;&#8221;).  But if we can only refer to events in past tense, how do we talk about a current event that needs to be handled versus another event that we have already handled?</p>
<p>Many business to consumer (B2C) applications, such as pharmacy benefits have this problem, for example.  To a pharmacy benefit manager (PBM), like Medco or Express Scripts, the swiping of an insurance card at a retail pharmacy is an event to be processed.  Any individual beneficiary has a history of such requests.  We can try to model the current one as a request and the prior ones as transactions, but this becomes awkward for less formal or technical people who want to talk about how many requests someone has submitted over a period of time, for example.  The truth is that there is a history of requests per beneficiary and technical limitations should not obscure this fact.  We should be able to distinguish the current request from prior requests, as in the following:</p>
<ul>
<li>a request that has not been processed is current or pending</li>
<li>if processing a request&#8230;</li>
<li>if a request for &#8230; is being considered&#8230;</li>
</ul>
<p>Note that &#8220;request&#8221; is a deverbal noun, which is to say that the root form is the verb (in this case &#8220;to request&#8221;).  A request is a reference to an act of requesting that may be in any tense.  The sentences above reflect this in the use of an additional verb that carries the tense.  Of course, this is all completely natural since every sentence has a verb.</p>
<p>The most dangerous expression might be:</p>
<ul>
<li>if &#8230; requests &#8230;</li>
</ul>
<p>and yet this is the form that almost all BRMS would handle today!  This is dangerous because it is too ambiguous about when the request occurred.  It would be better to say:</p>
<ul>
<li>when &#8230; requests &#8230;</li>
</ul>
<p>provided that the system understands that, unlike &#8220;if&#8221;, &#8220;when&#8221; involves time, but even &#8220;when&#8221; is less than ideal since an event has always occurred in the past by the time it is processed.  On the other hand, we might define when a request occurs as in:</p>
<ul>
<li>a request occurs from the time it is received until it receives a final response.</li>
</ul>
<p>This assumes that &#8220;when&#8221; combined with a verb in present perfect tense means during the period of time in which the process referenced by the verb continues.  And this is an important point:</p>
<ul>
<li>An event can be an occurrence of a process.</li>
<li>An event may have a duration.</li>
<li>&#8220;When&#8221; may refer to an interval of time.</li>
</ul>
<p>Events are not necessarily processes, but may refer to instantaneous points in time, such as in the following:</p>
<ul>
<li>When a request is received&#8230;</li>
<li>When the processing of a request begins&#8230;</li>
</ul>
<p>but these uses of &#8220;when&#8221; refer to a point in time before any action can be taken in response to the event, therefore the sentences should only conclude with statements of implied, necessary, or modal logic and <span style="text-decoration: underline;">should not include any statement of action</span>. Of course, a competent BPM/CEP/BRMS would understand all this and advise the author of a policy that suggests taking action in the past.</p>
<p>As we proceed through these examples our intuition should be building the understanding of the first three points made above.  Processes and events and reasoning about or over time are completely intertwined in nature and separating them between BPM and CEP and BRMS systems is completely artificial and hopelessly limiting.</p>
<p>So what is the solution?  I suggest it is a knowledge management  system that understands the following:</p>
<ul>
<li>1. Policies that use tense.</li>
<li>2. Policies that refer to events using deverbal nouns.</li>
<li>3. Policies that refer to occurrences of processes as events.</li>
<li>4. Policies that refer to potential action using future tense, possibly by way of modals.</li>
<li>5. Policies that refer to occurrences of processes using verbs such as &#8220;begin&#8221;, &#8220;end&#8221;, &#8220;start&#8221;, etc.</li>
<li>6. Policies that refer to occurrences of process using words like &#8220;during&#8221;, &#8220;while&#8221; and &#8220;when&#8221;</li>
<li>7. Policies that refer to events using prepositions like &#8220;by&#8221;, &#8220;before&#8221;, &#8220;after&#8221;, and &#8220;when&#8221;</li>
</ul>
<p>The natural language technology to parse such sentences is widely available using many approaches.  I am happy to discuss that with interested parties.  The second step that needs to be addressed is transforming the logical interpretation of such sentences derived from the natural language system into the underlying execution architecture, which includes a process engine and a rules engine that must be appropriately integrated.  That integration involves the informing of the BRMS about the state of the business process and the actions that may be taken which may be expressed as processes in the BPMS.  I&#8217;ve written elsewhere about this in more detail and am also happy to discuss it in more detail with BPM or CEP practitioners, product managers and architects.</p>
<p>Understanding events and occurrences of processes as events adds a great deal of power to policy management.  It allows statements of policy to reference and consider the context of business processes.  It allows statements of policy to reference and consider how to handle events in the context of business processes.  And, if it is done with adequate natural language understanding, it accomplishes this integration of BPM and CEP within a single policy management system.</p>
<p>Although I had hoped to cover the fourth point made first above here, I now prefer to conclude with a brief discussion about reasoning over time.  I will strive to cover reasoning about potential states of a process another day.  It is interesting but rigorous material that requires (in my opinion) architectural support that is lacking from current rules engines, whether production rule or logically based, even if the situational or event calculi are good formalisms.</p>
<p>Reasoning over time is pervasive in CEP.  In the pharmacy benefits management domain, for example, coverage is commonly limited based on the history of transactions.  For example, a policy might limit the amount of refills over a period of time.  This involves aggregation over a number of events, each of which is the result of handling a prior request.</p>
<p>Very few knowledge or policy management systems understand that transactions are processes, occurrences of which can be viewed as events.  For example, is &#8220;order&#8221; an noun or a verb in your enterprise applications?  Our technology has biased us to thinking about objects, which drives our modeling towards nouns and away from verbs.  Our technology biases us against modeling events and processes well!  And it shows up, insidiously sapping productivity and accessibility.</p>
<p>The lack of ontology of process and event in current BRMS not only precludes the kind of integrated BPM and CEP I am discussing here, it also limits the ability of current BRMS to automate policies that consider what has happened in making decisions in the present.  For example, a statement like:</p>
<ul>
<li>if a medicine has a maximum therapeutic dosage over a period that is less than the total dosage of that medicine requested by a member over the same period then&#8230;</li>
</ul>
<p>is beyond the capabilities of current offerings.  Authority understood some grammar about time but did not understand that events, such as a request, <em>occurred</em> in any deep sense.  So it could automate a sentence like:</p>
<ul>
<li>if the total dosage of a medicine requested by a member on a date within the last 90 days exceeds the maximum quarterly therapeutic quarterly dosage for the medicine then&#8230;</li>
</ul>
<p>but understanding why it understands one sentence and not the other is too much for many authors to tolerate, let  alone understand.  The essential reason is that we sold the company before extending Authority&#8217;s ontology to include events and revising its parser to understand that both verbs and their deverbal nouns referred to events (including occurences of processes).</p>
<p>The bottom line here is that a quantum leap in natural language processing of business rules is needed.  Fortunately, this is not a quantum leap for natural language processing itself.  It is well-established that sentences are parsed into representations of events in which noun phrases play semantic roles, such as the following:</p>
<ul>
<li><a href="http://www.sil.org/linguistics/GlossaryOfLinguisticTerms/WhatIsAgentAsASemanticRole.htm">agent </a>or <a href="http://www.sil.org/linguistics/GlossaryOfLinguisticTerms/WhatIsACounteragentAsASemantic.htm">counteragent </a></li>
<li>donor or <a href="http://www.sil.org/linguistics/GlossaryOfLinguisticTerms/WhatIsABeneficiaryAsASemanticR.htm">beneficiary </a></li>
<li><a href="http://www.sil.org/linguistics/GlossaryOfLinguisticTerms/WhatIsPatientAsASemanticRole.htm">patient </a>or &#8220;<a href="http://www.sil.org/linguistics/GlossaryOfLinguisticTerms/WhatIsExperiencerAsASemanticRo.htm">experiencer&#8221;</a></li>
<li><a href="http://www.sil.org/linguistics/GlossaryOfLinguisticTerms/WhatIsCauserAsASemanticRole.htm">causer </a></li>
<li><a href="http://www.sil.org/linguistics/GlossaryOfLinguisticTerms/WhatIsInstrumentAsASemanticRol.htm">instrument </a></li>
<li><a href="http://www.sil.org/linguistics/GlossaryOfLinguisticTerms/WhatIsLocativeAsASemanticRole.htm">locative </a>or <a href="http://www.sil.org/linguistics/GlossaryOfLinguisticTerms/WhatIsTimeAsASemanticRole.htm">time </a></li>
<li><a href="http://www.sil.org/linguistics/GlossaryOfLinguisticTerms/WhatIsSourceAsASemanticRole.htm">source </a>or destination</li>
</ul>
<p>where quite a few prepositions relate to more refined aspects of time and location, such as at, on, during, by, before, after, in and so on.   The critical thing for processes and events is that they occur in time.</p>
<p>Realizing this quantum leap in policy management and knowledge automation is really pretty simple.  Take an approach such as Authority and extend its core, upper ontology with the semantic roles and the concepts of events and processes.  Then extend its relation-centric parsing with even-centric parsing (both are needed).  A few more steps, notably handling metonymy, and the next generation of knowledge management and automation that provides the integrated understanding of time, events, and processes discussed here becomes a reality.</p>
<p>That&#8217;s what we&#8217;re patiently working towards.  And we&#8217;re doing it in as engine-independent a manner as practical so that we can leverage standards like RIF and SBVR.  It&#8217;s all about the knowledge.</p>
<p>Finally, we are looking for collaborators who would like to learn more or help, and, perhaps, get involved in leveraging the solution or its underlying technology.</p>
]]></content:encoded>
			<wfw:commentRss>http://haleyai.com/wordpress/2009/11/01/time-for-the-next-generation-of-knowledge-acquisition-management-and-automation/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Sir Tim Berners-Lee on Ontology</title>
		<link>http://haleyai.com/wordpress/2009/10/29/sir-tim-berners-lee-on-ontology/</link>
		<comments>http://haleyai.com/wordpress/2009/10/29/sir-tim-berners-lee-on-ontology/#comments</comments>
		<pubDate>Thu, 29 Oct 2009 11:32:36 +0000</pubDate>
		<dc:creator>paul@haleyAI.com</dc:creator>
				<category><![CDATA[Ontology]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[ISWC]]></category>
		<category><![CDATA[linked data]]></category>
		<category><![CDATA[machine learning]]></category>
		<category><![CDATA[RDF]]></category>
		<category><![CDATA[Sir Tim Berners-Lee]]></category>

		<guid isPermaLink="false">http://haleyai.com/wordpress/2009/10/29/sir-tim-berners-lee-on-ontology/</guid>
		<description><![CDATA[A panel on whether or not ontology is needed to achieve a collective vision for the semantic web was held on Tuesday at the International Semantic Web Conference (ISWC 2009) near Washington, DC.  For most of the panelists the question was rhetorical.  But there were a few interesting points made, including that machine learning of [...]]]></description>
			<content:encoded><![CDATA[<p>A panel on whether or not ontology is needed to achieve a collective vision for the semantic web was held on Tuesday at the International Semantic Web Conference (ISWC 2009) near Washington, DC.  For most of the panelists the question was rhetorical.  But there were a few interesting points made, including that machine learning of ontology is one extreme of a spectrum that extends to human authoring of ontology (however authoritative or coordinated).  Nobody on the panel or in the audience felt that the extreme of human authored ontology was viable for the long-term vision of a comprehensively semantic and intelligent web.  It was clear that the panelists believed that machine learning of ontology will substantially enrich and automate ontology construction, although the timeframe was not discussed.  Nonetheless, the subjective opinion that substantial ontology will be acquired automatically within the next decade or so was clear.  There was much discussion about the knowledge being in the data and so on.  The discussion had a bit of the statistics versus logic debate to it.  Generally, the attitude was &#8220;get over it&#8221; and even Pat Hayes, who gave a well-received talk on <a href="http://www.slideshare.net/PatHayes/blogic-iswc-2009-invited-talk">Blogic</a> and whom one would expect to take the strict logic side of the argument, pointed out seminal work on combining machine learning and logic in natural language understanding of text.</p>
<p><a href="http://people.csail.mit.edu/karger/">David Karger of MIT&#8217;s AI lab</a> challenged the panel from the audience by asserting that the data people posted on the web is much more important than any ontology that might define what that data means.  This set off a bit of a firestorm.  There was consensus that data itself is critically important, if not central.  For the most part, panelists were aghast at the notion that spreadsheets of data would be useless to computers unless the meaning of its headings, for example, were related to concepts defined by reference to ontology those computers understood. </p>
<p>With respectful deference, the panel and audience yielded.  Sir Tim Berners-Lee took the floor.<span id="more-108"></span></p>
<p>The issue of semantics briefly faded from the discussion.  Utility seemed to be the crux of the matter.  Sir Tim illustrated how even the smallest bit of semantics (i.e., meaning by reference) added to a spreadsheet allowed others to quickly and incrementally, almost continuously add value to published data.  He did this by way of example, discussing mash-ups of bicycle accident, traffic, and map data by different people over the course of a day or so after someone first published the bicycle accident data.  Most interesting to me, however, was his concluding point: simply identifying what something is by reference to an existing semantic web concept makes data much more immediately consumable, useful and valuable.  For example, simply identifying that a column is a <a href="http://www.sameas.org/html?q=longitude">longitude using RDF</a> adds a lot of value.   Yes, I realize mash-ups are old, and linked data is well known, but his point was delivered in such a straightforward and compelling manner that the argument simply passed.  He put  a lot of experts eyes back on the ball.</p>
<p>Sir Tim reiterated this point an hour or so later at a meet-up of meet-ups.  His consistent, critical point was that lightweight use of ontology realizes a great deal of value.  Simply using RDF to anchor the semantics of linked data, without substantial ontology development, makes data much more useful to humans.  And with regard to enterprises, he discussed how eleven (yes, he said 11) different concepts are enough to address most of the semantic needs of various enterprises (i.e., their relational database models).  Of course, they will receive more value from reusing existing ontology, extended according to their needs.  His point was that the small step of simply linking data with open web semantics using only the most widely adopted RDF identifiers is a huge step forward for the semantic web and the benefits its technologies can bring to individuals and enterprises.  Full-fledged ontology is important for deeper functionality and long-term visions, but simply using concepts from existing ontology can be a huge step forward.</p>
]]></content:encoded>
			<wfw:commentRss>http://haleyai.com/wordpress/2009/10/29/sir-tim-berners-lee-on-ontology/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Zigtag for social semantic tagging</title>
		<link>http://haleyai.com/wordpress/2008/06/25/if-you-tag-like-me-zigtag-is-it/</link>
		<comments>http://haleyai.com/wordpress/2008/06/25/if-you-tag-like-me-zigtag-is-it/#comments</comments>
		<pubDate>Wed, 25 Jun 2008 19:24:20 +0000</pubDate>
		<dc:creator>paul@haleyAI.com</dc:creator>
				<category><![CDATA[Ontology]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[Social Networks]]></category>

		<guid isPermaLink="false">http://haleyai.com/wordpress/2008/06/25/if-you-tag-like-me-zigtag-is-it/</guid>
		<description><![CDATA[
I started to use Radar Networks’ Twine at the invitation of CEO Nova Spivak after writing this earlier this year (also see this). I enjoyed it for a while, especially because a lot of technology folks were hooking up with each other, especially the semantic web community, on Twine. But I found it  tedious to [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://haleyai.com/wordpress/wp-content/uploads/2008/06/image.png"><img src="http://haleyai.com/wordpress/wp-content/uploads/2008/06/image-thumb.png" style="border: 0px none ; margin: 0px 0px 0px 15px" alt="image" align="right" border="0" height="772" width="270" /></a></p>
<p>I started to use Radar Networks’ Twine at the invitation of CEO Nova Spivak after writing <a href="http://haleyai.com/wordpress/2008/03/11/over-100m-in-12-months-backs-natural-language-for-the-semantic-web/">this</a> earlier this year (also see <a href="http://haleyai.com/wordpress/2008/04/16/the-semantic-arms-race-facebook-vs-google/">this</a>). I enjoyed it for a while, especially because a lot of technology folks were hooking up with each other, especially the semantic web community, on Twine. But I found it  tedious to work through beta issues and to be bothered with recommendations or news about who was saying or bookmarking things about what. (I should have turned off the emails sooner!)</p>
<p>I was disappointed that Twine was taking an apparently folksonomic approach to tagging. It was as if Radar Networks was riding semantic web buzz without really embracing it openly or sharing the momentum that the invite-only community was investing in.  That may not sound fair &#8211; I believe that there are semantics in the back room, but that’s how it felt and it&#8217;s still the way it looks.  But probably the worst part is the process that you have to go through to add a bookmark &#8211; which is the whole point, of course!  (I ultimately sacrificed popup blockers, but the process still seems laborious compared to other alternatives.)</p>
<p>I stumbled across Zigtag almost accidentally while working for a VC firm with a portfolio of semantic startups. What I like most about Zigtag is that they make it obvious that they are building an ontology of tags and encourage users to select semantic tags (i.e., concepts) rather than folksonomic “words”.  They also provide tools for managing tags that allow you to move smoothly and incrementally from a folksonomic to a more semantic approach.</p>
<p><span id="more-106"></span></p>
<p>The key to the semantic approach for Zigtag is that shared tags are just that &#8211; they are more precise than strings.  They are not only words &#8211; they have definitions.</p>
<blockquote><p>Unfortunately, like Twine, Zigtag&#8217;s ontological model remains hidden.</p></blockquote>
<p>My initial experience with Zigtag resulted in immediate jubilation.  The Firefox plug-in works for me.  It lets me type in tags with nice completion and recommendations from the tags that others have defined.  Within 15 minutes I was writing to compliment Zigtag on a practical, elegant approach to the semantic bookmarking problem.  I liked it much better than Twine right off the bat &#8211; and despite its book-market, I like Twine a lot!  Within a few minutes I had an email from their founder, Reg Cheramy.  An hour later we were talking.  We talked about his early meeting with <a href="http://www.techcrunch.com/2006/03/04/zigtags-personal-knowledge-library/">Michael Arrington</a>, how his work compares to bulletin board or discussion forum emphasis in Twine , how he facilitates semantic tagging given a very large ontology and vocabulary, and so on.</p>
<p>Whether Reg took my advice to emphasize groups more or was already headed in that direction is unclear, but Zigtag now has group functionality that seems as good as (and in some ways better) than Twine&#8217;s.  If you go to Zigtag <a href="http://www.zigtag.com" target="_blank">the web site</a>, you can find groups to join, but unlike Twine&#8217;s web site, Zigtag does not recommend groups for you based on your interests.  I&#8217;m not sure this is a problem, though.  Recommendations can be distracting.  Nonetheless, if people want recommendations for more than content, it would be a simple step for Zigtag given the fact that they already recommend content that others have bookmarked.</p>
<p>I&#8217;m not too concerned with recommendations, even of content, so I cannot comment on Zigtag versus Twine on that front.  Generally, there is plenty of RSS and recommendation noise to go around.  I prefer the linked approach to finding information rather than searching and I don&#8217;t expect recommendations to become excellent in the near term.  For more on this, you might want to check out the recent news  about Vulcan&#8217;s EVRI investment at <a href="http://www.webware.com/8301-1_109-9953394-2.html" target="_blank">Webware</a> or <a href="http://www.readwriteweb.com/archives/evri_beta_launches_search_less.php" target="_blank">ReadWriteWeb</a>.</p>
<p>I like to use Zigtag from the sidebar in Firefox.  Actually, I owe Reg additional thanks for, in effect, causing me to abandon Internet Explorer for Firefox.     I use it primarily to organize my bookmarks semantically and across machines.  For those that want to do the same, you might also be interested in Mitch Kapor&#8217;s <a href="http://www.foxmarks.com/" target="_blank">Foxmarks</a>.</p>
<p>I&#8217;m fine with finding groups on my own and I like seeing people and what they tend to tag, too.  Now that I know they are available on the web site, though, I want them in the sidebar.   The fact that they are indirect on the web site, not presented in the sidebar, and not proactively recommended probably explains why there are relatively few (especially compared to Twine).  It would be nice, for example,  to see groups and people organized along with bookmarks according to how heavily they use tags as I pivot through various facets.</p>
<p>So, on a feature basis, I like Zigtag more than Twine for two primary reasons:</p>
<ol>
<li>Zigtag&#8217;s Firefox plug-in is a great user interface while Twine&#8217;s book-market is awkward in every sense that matters to me.</li>
<li>Zigtag emphasizes and leverages shared tagging of tags that have clearly documented interpretations  Twine is too folksonomic.</li>
</ol>
<p>The picture shown in this post shows that Zigtag already &#8220;knows&#8221; a lot about semantics.  Part of the reason is that they must have a roomful of people watching for  tags that people enter that are not defined.  Quite a few of the tags I&#8217;ve added have become defined within hours (sometimes minutes) of when I enter them.  We&#8217;ll see how this scales up, but I like it &#8211; a lot.</p>
<p>The key question for both these sites is:</p>
<blockquote><p>Are you going to share your ontology?  If not, why not?  If so, when or why not now?</p></blockquote>
<p>Note that I am not suggesting they should. But if they have a reason not to, it would be nice to understand that.</p>
<p>It also would be nice to know whether the effort I expend on either site will be lost if they are acquired or I want to switch.  That&#8217;s how it looks at Twine today.</p>
<p>Zigtag exports my bookmarks.  I can get them from or over to Delicious, no problem.  But I want their semantics, too.  I would really appreciate preservation of the text, preferably the semantics of my tags.  Perhaps if my bookmarks were simply output as an OWL referencing their ontology?  At least then I could move without losing the effort that I have put into them, whether folksonomic or semantic.  I also want to know if their ontology is are any good and, if so, I&#8217;d appreciate export to OWL so that I could use bookmarks for other purposes that interest me.</p>
<blockquote><p>The background issue of data portability, for bookmarks, social networks, and other personal profile data is huge.</p></blockquote>
<p>If I had OWL export and an open ontology, I would be less worried about my investment in Zigtag or Twine.  Consider Techcrunch&#8217;s <a href="http://www.techcrunch.com/2008/05/05/tagging-goes-semantic-with-zigtag/" target="_blank">recent comments</a>:</p>
<blockquote><p>Zigtag’s biggest obstacle is the slew of other social bookmarking sites already available (<a href="http://del.icio.us">&#8230;</a>). The semantic tagging feature is fairly unique, but its appeal is still untested, especially against automated semantic taggers like Twine. Frankly, a lot of people are just going to stick with the simple but effective Delicious interface.</p></blockquote>
<p>It&#8217;s hard to argue with the first sentence, but the second seems harsh.  Twine is getting credit that it may not deserve.  Also, Zigtag recommends tags, too.  But the third sentence is a problem for Zigtag as well as Twine, although the latter benefits from superior PR.</p>
<p>Another question, of course, is how Zigtag and Twine will fare once they try to make money.  Radar Networks has stated that Twine will start running ads by the end of the year.  Zigtag has made no public announcements.  Delicious selectively advertises (e.g., on search pages), perhaps to feed intelligence to Yahoo&#8217;s advertising network.  The advertisements are so selective that the value of other book-marking sites may be limited to the intelligence that they provide to established advertising networks.  If so, this will hold down valuations and slow innovation.  We&#8217;ll see, but obviously, I hope not..</p>
]]></content:encoded>
			<wfw:commentRss>http://haleyai.com/wordpress/2008/06/25/if-you-tag-like-me-zigtag-is-it/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>A Common Upper Ontology for Advanced Placement tests</title>
		<link>http://haleyai.com/wordpress/2008/04/18/a-common-upper-ontology-for-advanced-placement-tests/</link>
		<comments>http://haleyai.com/wordpress/2008/04/18/a-common-upper-ontology-for-advanced-placement-tests/#comments</comments>
		<pubDate>Fri, 18 Apr 2008 13:53:35 +0000</pubDate>
		<dc:creator>paul@haleyAI.com</dc:creator>
				<category><![CDATA[Ontology]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[advanced placement]]></category>
		<category><![CDATA[arithmetic]]></category>
		<category><![CDATA[fraction]]></category>
		<category><![CDATA[geometry]]></category>
		<category><![CDATA[Halo Project]]></category>
		<category><![CDATA[irrational number]]></category>
		<category><![CDATA[mathematics]]></category>
		<category><![CDATA[physics AP]]></category>
		<category><![CDATA[Protege]]></category>
		<category><![CDATA[ratio]]></category>
		<category><![CDATA[rational number]]></category>
		<category><![CDATA[SI units]]></category>
		<category><![CDATA[Vulcan]]></category>

		<guid isPermaLink="false">http://haleyai.com/wordpress/2008/04/18/a-common-upper-ontology-for-advanced-placement-tests/</guid>
		<description><![CDATA[I have previously written about the lack of a common upper ontology in the semantic web and commercial software markets (e.g., business rules).  For example, the lack of understanding of time limits the intelligence and ease of use of software in business process management (BPM) and complex event processing (CEP).  The lack of understanding of [...]]]></description>
			<content:encoded><![CDATA[<p>I have previously written about the lack of a common upper ontology in the semantic web and commercial software markets (e.g., business rules).  For example, the lack of understanding of time limits the intelligence and ease of use of software in business process management (BPM) and complex event processing (CEP).  The lack of understanding of money limits the intelligence and utility of business rules management systems (BRMS) in financial services and the capital markets.   And, more fundamentally, understanding time and money (among other things, such as location, which includes distance) requires a core understanding of amounts.  </p>
<ul>
<li><a rel="bookmark" href="http://haleyai.com/wordpress/2008/02/19/understanding-events-and-processes-takes-time/" title="Permanent Link to Understanding events and processes takes time">Understanding events and processes takes time</a></li>
<li><a rel="bookmark" href="http://haleyai.com/wordpress/2008/03/03/oracle-should-teach-siebel-crm-about-location-and-money/" title="Permanent Link to Oracle should teach Siebel CRM about location and money">Oracle should teach Siebel CRM about location and money</a></li>
<li><a rel="bookmark" href="http://haleyai.com/wordpress/2008/03/20/ontology-of-time-in-progress-amounts-needed/" title="Permanent Link to Ontology of time in progress - amounts needed">Ontology of time in progress &#8211; amounts needed</a>  </li>
</ul>
<p>The core principle here is that software needs to have a common core of understanding that makes sense to most people and across almost every application.  These are the concepts of Pareto&#8217;s 80/20 Principle.  A concept like building could easily be out, but concepts like money and time (and whatever it takes to really understand money and time) are in.  Location, including distance, is in.  Luminousity could be out, but probably not if color is in.  Charge and current could be out, but not if electricity or magnetism is in.  The cutoff is less scientific than practical, but what is in has to be deeply consistent and completely rational (i.e., logically rigorous).<sup>[2]<span id="more-93"></span></sup></p>
<h2>Angle is in</h2>
<p>It took working with a team that is getting software to pass AP tests in physics, chemistry, and other areas (e.g., biology and economics) for me to realize that the physicists and chemists who defined the the International System of Units overlooked something fundamental not only to math and physics, but to our integration of language with visual perception. </p>
<p>It turns out that understanding angles can be just as important &#8211; and is just as primitive &#8211; as understanding distance.  Consider longitude and latitude, for example.  And if you&#8217;re trying to get a computer to <a target="_blank" href="http://www.cs.utexas.edu/~kbarker/papers/kr04-shaken.pdf">get college credit by passing the advanced placement test for Physics</a> , then you need to understand angular momentum. </p>
<p>Surprisingly, distance is among the <a target="_blank" href="http://physics.nist.gov/cuu/Units/units.html">SI units</a>, but angle is not!<sup>[1]</sup></p>
<p>Getting computers to understand angles involves circles and fractions, which bring a few other concepts into our upper ontology, such as diameter, circumference, arcs, and denominators and numerators.  Curiously, angles are primitive and yet they are intrinsically fractions of circles.</p>
<p>A piece of a pie could be an eighth or a sixth of the whole pie.  The angle of either such piece would be 30 or 45 degrees, since there are 360 degrees in a circle (assuming the pie is round).</p>
<h2><strong>What is an angle?</strong></h2>
<p>Here&#8217;s what Wikipedia has to say on the matter:</p>
<blockquote><p>In geometry and trigonometry, an angle (in full, plane angle) is the figure formed by two rays sharing a common endpoint, called the vertex of the angle. The magnitude of the angle is the &#8220;amount of rotation&#8221; that separates the two rays, and can be measured by considering the length of circular arc swept out when one ray is rotated about the vertex to coincide with the other (see &#8220;Measuring angles&#8221;, below). Where there is no possibility of confusion, the term &#8220;angle&#8221; is used interchangeably for both the geometric configuration itself and for its angular magnitude (which is simply a numerical quantity).</p></blockquote>
<p>And another definition says:</p>
<blockquote><p>A shape formed by two rays sharing a common endpoint or two lines that intersect. An angle has one vertex and two sides.</p></blockquote>
<p>So before we are done with all this our ontology will have to include fundamental geometry, including rays, endpoints, vertices, length (which we&#8217;ve previously covered), lines, and intersection.  Here we are going to deal with nothing more than how angles are measured and pi.</p>
<p>Angles are measured in degrees or radians.  Radians may seem strange at first but they are harmless.  The ratio of a circle&#8217;s circumference to its diameter is the same, regardless of its radius (which is half its diameter, of course).  This is really no big surprise, since the ratio of a square&#8217;s circumference to the length of its sides is constant, too.  Unlike a square, where the ratio is the number of sides (i.e., 4), the ratio of the circumference of a circle to its diameter is an irrational number, called pi. </p>
<h2><strong>Is pi a fraction?</strong></h2>
<p>Most of us know that pi is approximately 3.1415.  It is also true that the digits after the decimal point continue forever without a repeating pattern.  Numbers that cannot be expressed with a mantissa that is either finite or repeating are called irrational numbers.  If a real number is not irrational then it is rational.  There are also complex numbers that mix real and imaginary numbers (involving the square root of negative one), but we&#8217;ll skip those for now, OK?</p>
<h2>Rational numbers are not irrational</h2>
<p>Rational numbers are normal fractions, one integer over another.  Any rational number that is not equal to an integer has either a limited mantissa or a repeating pattern.  For example, the mantissa of ½ is a single digit, as in 0.5.  The mantissa of ¼ is two digits, as in 0.25.  The mantissa of 1/3 is an infinite number of digits, all of which are 3, as in 0.33333&#8230;.  The mantissa of 1/7 is a little more interesting.  It is a repeating pattern of 6 digits, as in 0. 142857142857142857142857142857142857&#8230;</p>
<p><a href="http://haleyai.com/wordpress/wp-content/uploads/2008/04/rational-number.jpg" title="An ontology of numbers including rational and irrational numbers and fractions using Protege"><img src="http://haleyai.com/wordpress/wp-content/uploads/2008/04/rational-number.jpg" alt="An ontology of numbers including rational and irrational numbers and fractions using Protege" /></a><a href="http://haleyai.com/wordpress/wp-content/uploads/2008/04/fraction.jpg" title="Ontology of fractions and rational numbers using Protege"></a></p>
<p>So our ontology of numbers includes both rational numbers (i.e., fractions) and irrational numbers and pi is an irrational number.  But recall that rational numbers and irrational numbers are disjoint.  That is, a fraction cannot be an irrational number.  So <strong>pi cannot be a fraction</strong>!</p>
<h2><strong>Ratios are not fractions</strong></h2>
<p>Fractions are the ratio of one integer to another.  Not all ratios are between integers, however.  In fact, some ratios are between amounts, such as an amount of distance per an amount of time (e.g., miles or kilometers per hour).</p>
<p><a href="http://haleyai.com/wordpress/wp-content/uploads/2008/04/fraction.jpg" title="Ontology of fractions and rational numbers using Protege"><img src="http://haleyai.com/wordpress/wp-content/uploads/2008/04/fraction.jpg" alt="Ontology of fractions and rational numbers using Protege" /></a></p>
<p>Pi is not a fraction because the circumference of a circle is not an integer.  But more generally, a fraction is a ratio but not all ratios are fractions.</p>
<h2><strong>Angular units</strong></h2>
<p>An angle of one radian covers an arc of circle that is equal to the circle&#8217;s radius in length.  Since the diameter is twice the radius in length and the circumference of a circle is pi times its diameter, there are 2 times pi radians in a circle.</p>
<p>A degree is also an angle.  More precisely, there are 360 degrees in a circle.  And each degree equals 60 seconds of an arc.  And each second of arc has 60 minutes.  Thus, a second of arc is 1/3600<sup>th</sup> of a circle.</p>
<h2>Acute and other angles</h2>
<p>So, a radian is a little more than 57° and both a radian a degree are acute angles.  All of these units of angle are therefore acute, which means that they are less than a right angle (which is 90°).  An angle of 180° is called a straight angle and an angle between a right angle and a straight angle is called an obtuse angle.  And an angle that is more than a straight angle is called a reflex angle (they are commonly used to express a heading or bearing).</p>
<p><a href="http://haleyai.com/wordpress/wp-content/uploads/2008/04/degree.jpg" title="Ontology of angle including a degree"><img src="http://haleyai.com/wordpress/wp-content/uploads/2008/04/degree.jpg" alt="Ontology of angle including a degree" /></a>  <a href="http://haleyai.com/wordpress/wp-content/uploads/2008/04/radian.jpg" title="Ontology of angle including a radian"><img src="http://haleyai.com/wordpress/wp-content/uploads/2008/04/radian.jpg" alt="Ontology of angle including a radian" /></a></p>
<h2><strong>Pi at last</strong></h2>
<p>So, pi is a positive, irrational number and the ratio of any circle&#8217;s circumference to its diameter, as shown below.</p>
<p><a href="http://haleyai.com/wordpress/wp-content/uploads/2008/04/pi.jpg" title="Ontology of numbers including the irrational number pi which is the ratio between the circumference of a circle and its diameter"><img src="http://haleyai.com/wordpress/wp-content/uploads/2008/04/pi.jpg" alt="Ontology of numbers including the irrational number pi which is the ratio between the circumference of a circle and its diameter" /></a></p>
<p>Note that <em>e</em> is Napier&#8217;s constant &#8211; the base of the natural logarithm (i.e., ln(<em>x</em>) = log<em><sub>e</sub></em>(<em>x</em>)).  It is another positive, irrational number that is distinct from pi.  These are, practically speaking, the only two irrational numbers most people will ever come across in math or physics.  In fact, most people know pi but never use it.  Relatively few know or use <em>e</em>, but you need to know the natural logarithm in mathematics class or if you hope to get <a target="_blank" href="http://www.qrg.northwestern.edu/papers/files/qrg_dist_files/qrg_2007/klenk-forbus-aaai07-webpage.pdf">college credit for physics</a> with high marks on the AP test.</p>
<p><br clear="all" /></p>
<hr SIZE="1" width="33%" align="left" /><sup>[1]</sup> As noted previously, I&#8217;m also suspect on &#8220;their&#8221; choices concerning electricity and chemistry.<br />
<sup>[2]</sup>If a topic evokes discussions like <a href="http://iandavis.com/blog/2004/12/sumo">this one</a>, it&#8217;s out! (As is anything that includes the word &#8220;corpuscular&#8221;, OK?) The total size of this thing cannot be thousands of concepts.  As many relations (which includes properties, predicates, etc.) as it takes, but hopefully closer to the order of a hundred concepts &#8211; maybe less.  Yes, Viriginia, that means <strong>chartreuse is out</strong>!</p>
]]></content:encoded>
			<wfw:commentRss>http://haleyai.com/wordpress/2008/04/18/a-common-upper-ontology-for-advanced-placement-tests/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>

