RDF-ization: Is That What I’ve Been Up To?
Recently, on his blog, Kingsley Idehen wrote:
RDF-ization is a term used by the Semantic Web community to describe the process of generating RDF from non RDF Data Sources such as (X)HTML, Weblogs, Shared Bookmark Collections, Photo Galleries, Calendars, Contact Managers, Feed Subscriptions, Wikis, and other information resource collections.
Although Idehen identifies a number of data sources, he does not explicitly identify two data sources I’ve been spending a fair amount of time with over the past few years:
- One source of data is that generated by scientific instruments. With various colleagues, the semantic framework I’ve built around this data source allows for RDF-ization of scientific data from semi-structured ASCII to XML (specifically ESML) to RDF via GRDDL. (Please see the illustration.) In principle, it should be possible to further transform the RDF representation into OWL thus resulting in what I’ve referred to elsewhere as an informal ontology. (According to Morville as well as Shadbolt et al., the RDF-ization of the data sources Idehen identifies result in folksonomies, rather than informal ontologies.) Again with various colleagues, I’ve also made use of RDF to annotate features inherent in the scientific data via XML Pointer Language (XPointer).

- Even more recently, with members of my Network Operations team at York University, I’ve been working with a relational database as a source of data on the topology of IP networks. (Please see the illustration.)

Of course, whether the motivation is personal/social-networking or scientific/IT related, the attention to RDF-ization is win-win for all stakeholders. Why? Anything that accelerates the RDF-ization of non-RDF data sources brings us that much closer to realizing the true value of the Semantic Web.
Introducing Jott for BlackBerry
I just learned about Jott for BlackBerry:
We have a lot of happy Blackberry customers at Jott, and Jott for BlackBerry is the ultimate BlackBerry download. It is a simple, but very powerful tool that will let you reply to emails on your BlackBerry just using your voice – either speaking directly into your BlackBerry, or while wearing a Bluetooth headset. It is seamlessly integrated into the email application you already use, and is a huge leap forward for BlackBerry lovers in three ways: first, it is 3-5 times faster than ‘thumbing’ text; two, you won’t be known for sending just terse replies because you don’t want to thumb type out a normal email message; and three, you will be safer because you won’t have to take your eyes off the road.*
(*Jott does NOT encourage messaging while driving).Jott for Blackberry makes an already awesome device even better.
The following is the body of a reply I just created:
Thanks for sharing this interesting service with me. It's definitely something that I'm interested in investigating and it's my intention to follow up very very soon. Thanks very much. Bye for now. Sent with my voice via Jott for Blackberry ~ http://jott.com/bb To listen: http://www.jott.com/show.aspx?id=e4eb3151-9007-448c-bd73-7de70ecc4766
In this example, the transcription quality was excellent. Note that the recipient is advised that the response was Jott’ed, and has the option of listening to the original audio recording. Nice!
Although I’m only at the testing stage, I expect to make extensive use of Jott for BlackBerry!
Note to Jott and Google: Please enable Jott for BlackBerry in the GMail for BlackBerry application.
Annotation Modeling: To Appear in Comp & Geosci
What a difference a day makes!
Yesterday I learned that my paper on semantic platforms was rejected.
Today, however, the news was better as a manuscript on annotation modeling was
accepted for publication.
It’s been a long road for this paper:
- Its conception dates back to a presentation I gave at the 2006 Fall Meeting of the AGU.
- The paper was submitted as a contribution for Computers
& Geosciences Special Issue on Geoscience Knowledge Representation in
Cyberinfrastructure. - The initial reviews called for major revisions. With tremendous support from my co-authors, the paper was significantly revised, and re-submitted.
- After some additional interactions, I just learned that the paper was finally accepted for publication.
The abstract of the paper is as follows:
Annotation Modeling with Formal Ontologies:
Implications for Informal Ontologies
L. I. Lumb[1], J. R. Freemantle[2], J. I. Lederman[2] & K. D.
Aldridge[2]
[1] Computing and Network Services, York University, 4700 Keele Street,
Toronto, Ontario, M3J 1P3, Canada
[2] Earth & Space Science and Engineering, York University, 4700 Keele
Street, Toronto, Ontario, M3J 1P3, Canada
Knowledge representation is increasingly recognized as an important component of any cyberinfrastructure (CI). In order to expediently address scientific needs, geoscientists continue to leverage the standards and implementations emerging from the World Wide Web Consortium’s (W3C) Semantic Web effort. In an ongoing investigation, previous efforts have been aimed towards the development of a semantic framework for the Global Geodynamics Project (GGP). In contrast to other efforts, the approach taken has emphasized the development of informal ontologies, i.e., ontologies that are derived from the successive extraction of Resource Description Format (RDF) representations from eXtensible Markup Language (XML), and then Web Ontology Language (OWL) from RDF. To better understand the challenges and opportunities for incorporating annotations into the emerging semantic framework, the present effort focuses on knowledge-representation modeling involving formal ontologies. Although OWL’s internal mechanism for annotation is constrained to ensure computational completeness and decidability, externally originating annotations based on the XML Pointer Language (XPointer) can easily violate these constraints. Thus the effort of modeling with formal ontologies allows for recommendations applicable to the case of incorporating annotations into informal ontologies.
I expect the whole paper will be made available in the not-too-distant future …
Evolving Semantic Frameworks into Platforms: Unpublished ms.
I learned yesterday that the manuscript I submitted to HPCS 2008 was not accepted ![]()
It may take my co-authors and I some time before this manuscript is revised and re-submitted.
This anticipated re-submission latency, along with the fact that we believe the content needs to be shared in a timely fashion, provides the motivation for sharing the manuscript online.
To whet your appetite, the abstract is as follows:
Evolving a Semantic Framework into a Network-Enabled Semantic Platform
A data-oriented semantic framework has been developed previously for a project involving a network of globally distributed scientific instruments. Through the use of this framework, the semantic expressivity and richness of the project’s ASCII data is systematically enhanced as it is successively represented in XML (eXtensible Markup Language), RDF (Resource Description Formal) and finally as an informal ontology in OWL (Web Ontology Language). In addition to this representational transformation, there is a corresponding transformation from data into information into knowledge. Because this framework is broadly applicable to ASCII and binary data of any origin, it is appropriate to develop a network-enabled semantic platform that identifies the enabling semantic components and interfaces that already exist, as well as the key gaps that need to be addressed to completely implement the platform. After briefly reviewing the semantic framework, a J2EE (Java 2 Enterprise Edition) based implementation for a network-enabled semantic platform is provided. And although the platform is in principle usable, ongoing adoption suggests that strategies aimed at processing XML via parallel I/O techniques are likely an increasingly pressing requirement.
Browser Wars Revisited: Safari vs. Firefox?
Seemingly not to be out-done by all the buzz surrounding Firefox 3, Apple today (March 18, 2008) released version 3.1 of its Safari Web browser.
Apparently, we’ll love Safari because:
- It’s fast – Up to 3x Firefox 2 on page loads and 4.5x on JavaScript execution. And although that’s impressive, performance is definitely coming across as one of Firefox 3′s core competencies. It’d be interesting to run the same tests with Safari 3.1 and even Firefox 3 Beta 4.
- The UI – Of course. However, this is another area where Firefox 3 has made significant headway. Even on a Mac, Firefox 3′s UI is also elegant and clean. (For an amusing take on the Apple UI paradigm, have a look at this Eric Burke cartoon. I’m not sure how Burke would represent the Mozilla UI … However, one thing’s for sure, it’s become a lot more elegant and cleaner over the years.)
- Find – The Firefox 3 implementation looks remarkably like Safari’s.
- Resizable text areas – Excellent. Not sure if Firefox 3 has this.
Safari 3.1 also presents a twofold irony with respect to Web standards:
- You need to do a little digging (page 8 of the Safari Product Overview) to determine what is meant by Web standards support. And once you do, you’ll learn that it relates to CSS, HTML 5 and SVG. Of these, “HTML 5 offline storage support” has the potential to be most interesting, as Google is analogously demonstrating with Google Gears. So, it’s ironic you need to dig for something that has such value.
- In it’s support of HTML 5, we have a commercial entity (Apple) leading the way in terms of implementing standards. This is refreshing in general, and in particular in Apple’s case, as traditional expectations would have the Open Source implementations (e.g., Firefox) ahead in this regard. To quote Alanis Morissette: “Isn’t it ironic… don’t you think?”
When you factor in support for Windows, and apparently frequent releases, it’s no wonder that Safari is gaining momentum at the expense of Microsoft Internet Explorer (IE) and Firefox.
And not that I’ve been following developments with IE, but one has to wonder, if we are to re-visit the IE vs. Netscape browser wars of yesteryear, might the combatants this time be Apple Safari 3.x and Mozilla Firefox 3.x?
One can only hope!
Parsing XML: Commercial Interest
Over the past few months, a topic I’ve become quite interested in is parsing XML. And more specifically, parsing XML in parallel.
Although I won’t take this opportunity to expound in any detail on what I’ve been up to, I did want to state that this topic is receiving interest from significant industry players. For example, here are two data points:
Parsing of XML documents has been recognized as a performance bottleneck when processing XML. One cost-effective way to improve parsing performance is to use parallel algorithms and leverage the use of multi-core processors. Parallel parsing for XML Document Object Model (DOM) has been proposed, but the existing schemes do not scale up well with the number of processors. Further, there is little discussion of parallel parsing methods for other parsing models. The question is: how can we improve parallel parsing for DOM and other XML parsing models, when multi-core processors are available?
Intel Corp. released a new software product suite that is designed to enhance the performance of XML in service-oriented architecture (SOA) environments, or other environments where XML handling needs optimization. Intel XML Software Suite 1.0, which was announced earlier this month, provides libraries to help accelerate XSLT, XPath, XML schemas and XML parsing. XML performance was found to be twice that of open source solutions when Intel tested its product …
As someone with a vested interest in XML, I regard data points such as these as very positive overall.

