SIOC goes OWL-DL

Just sent that to sioc-dev, but I guess it worth a larger announcement :

We just made some changes to the SIOC Core ontology and to the related modules:

- Added OWL-DL compliance statements for SIOC Core and the Types / Access / Services modules
- Edited owl:disjointWith statements for some classes of SIOC Core
- Removed domain of sioc:note
- Removed domain of sioc:has_owner and range of sioc:owner_of
- Defined sioc:account_of as inverse property of foaf:holdsAccount
- Defined sioc:avatar as a subproperty of foaf:depiction

So, SIOC is now OWL-DL !
This change was motivated by the current SWANSIOC integration project that will be introduced during the upcoming ISWC tutorial on Semantic Web for Health Care and Life Sciences.

The SIOC Core Ontology Specification has been updated according to the changes.

The other good news regarding SIOC is that Yahoo! SearchMonkey now supports (and recommends !) it in its developer documentation. Moreover, in case you did not already read it, John published the Tales from the SIOC-o-sphere #8 about two weeks ago.

More generally, if you want to join the SIOC community, by developing new applications or APIs, or if you request some help regarding implementing SIOC in your existing tools, feel free to come on #sioc on irc.freenode.net or ask on the sioc-dev ML.

Were are all the Semantic Web presentations ?

A follow-up to my previous LODr introduction post, and as you might guess with the title, one more way to show the value of RDF-based applications in general. Or more precisely, open-RDF-based and LOD-compliant:

What I want to stress in that post is how such applications can become components of a general infrastructure (the Semantic Web itself) that will provide new services to end-users. Especially, regarding LODr, it lets users interlink popular Web 2.0 content to Semantic Web resources and such interaction can then be used for data discovery. For instance, the following query will retrieve all Slideshare presentations related to the Semantic Web, i.e. linked to a resource that is itself linked to the SW category in DBpedia. This query involves various vocabularies, as SIOC (to retrieve the item), FOAF (its author), the Tag Ontology (its tags), MOAT (tags meanings) and a DBpedia URI as an entry point to find related topics.

SELECT DISTINCT ?item ?author ?date ?tag ?meaning
WHERE {
  ?item a sioc:Item ;
  dct:created ?date ;
  sioc:has_space <http://slideshare.net> ;
  foaf:maker ?author .
  [] a tags:RestrictedTagging ;
    tags:taggedResource ?item ;
    tags:taggedWithTag [
      tags:name ?tag .
    ] ;
    moat:tagMeaning ?meaning .
  ?meaning ?p <http://dbpedia.org/resource/Category:Semantic_Web> .
}
ORDER BY DESC(?date)
LIMIT 5

You can browse the answer here, formatted in HTML.

Of course, the URIs that you can use in LODr and with MOAT in general are not restricted to DBpedia ones. You can use URIs defining some of your friends, conferences you attended, etc. Consequently, those URIs can be used in queries patterns, as well as other interlinked URIs. For instance, the following one will retrieve all pictures from Flickr linked to an event that happened in Tenerife, and in that case it will use the ESWC2008 URI, going through some Geonames data:

SELECT DISTINCT ?item ?author ?date ?tag ?meaning
WHERE {
  ?item a sioc:Item ;
  dct:created ?date ;
  sioc:has_space <http://flickr.com> ;
  foaf:maker ?author .
  [] a tags:RestrictedTagging ;
    tags:taggedResource ?item ;
    tags:taggedWithTag [
      tags:name ?tag .
    ] ;
    moat:tagMeaning ?meaning .
  ?meaning foaf:based_near <http://sws.geonames.org/2522437/> .
}
ORDER BY DESC(?date)
LIMIT 5

Answer

Finally, while all those queries involve the lodr.info endpoint, each LODr intance comes with its own triplestore (and related endpoint), so that one can add some more RDF in it for advanced mash-ups. And as it also provides RDFa and semantic sitemap support, semantic web crawlers and indexes as SWSE or Sindice can also consume it and then deliver it when you look for a particular URI.

Say hello to lodr.info

In one of my recent post, I mentionned LODr, a semantic-tagging application based on MOAT. While I started it a few months ago, it’s finally online now. I put the code in svn last friday and twitted about it, but did not make any official announcement yet, so here it is. I certainly should have released before, but as the source code involves lots of classes, I wanted to be sure of the architecture.

So, what is it about ?

LODr aims to apply to MOAT principles (in a few words, link your tags to concepts URIs - people URI, Musicbrainz artists, DBpedia resources … - , share those relationships in a community and then tag content with those URIs) to existing Web 2.0 content. So you can “re-tag” your existing Flickr pics, slideshare presentations, etc, using those principles and make your social data enter the LOD cloud. I think focusing on the existing word is important here, as LODr lets you keep your Web 2.0 habits by using your favourite tools, but provides a separate service to semantically-enrich it. I don’t want to go into too much details here, but in brief, some interesting points regarding the applications are:

So, you can simply download the code from the website and install it. For those who just want to have a look, you can check my LODr instance (while you won’t be able to edit it, you can check the display interfaces). As there might be some bugs and I’m still adding features, please consider using the SVN version instead of the tgz. And then, enjoy the power of Linked Data for your Web 2.0 content ;-)

Lightweight subPropertyOf / subClassOf inference with ARC2

As a regular user of the ARC2 framework, I really enjoy the way it ease the development of Semantic-Web applications. Especially, its SPARQL capabilities offer an intuitive way to write / get / update graphs and triples in the backend triple-store.

Unfortunately, while ARC2 provides resource consolidation based on IFPs or using some pre-defined properties, it does not feature lightweight RDFS entailement based on subPropertyOf and subClassOf subsumption. As Benjamin pointed out on IRC, such inference can be done using a combination of ARC2 triggers and SPARQL INSERT / CONSTRUCT clause. I just created two triggers that does the job, providing lightweight inferencing for subproperties and subclasses in ARC2, using the SPARQL query that follows (in that case, regarding the properties):

INSERT INTO <$graph> CONSTRUCT {
 ?s ?top ?o .
} WHERE {
  GRAPH <$graph> {
    ?s ?prop ?o .
  }
  ?prop rdfs:subPropertyOf ?top .
}

The trigger are, in my case, launched after each LOAD action, but can also be used in combinaison with the INSERT clause, by simply editing the store parameters:

$config = $arc_config + array
  'store_triggers' => array(
    'insert' => array('graphTimestamp'),
    'load' => array('subPropertyInference'),
  ),
);

As you can notice, the query is limited to a particular $graph (both in selecting and inserting). As this $graph var corresponds to the URI of the graph that has just been loaded in the store, it avoids recomputing the triples on the whole store each time a new graph is added. Moreover the new statements also belong to the original graph. You might want to change this according to your inference policy, but I think for such lightweight inference patterns (that do not involve other graphs), that makes sense to store additional statements in the original graph.

Regarding the inference pattern itself, instead of defining manually the properties that must be taken into account, this query retrieves all the properties that have been defined as subproperties of any others to automatically infer the ‘top property’ relationship. While this is certainly better than manually adding some property / subproperty lists, especially for maintenance purposes, it requires that the underlying models (e.g. FOAF if you want to deal with rdfs:label / foaf:name subsumption) must be loaded in the triple store, which you can do when setting it up, e.g.:

$default_vocabs = array(
  'http://xmlns.com/foaf/spec/index.rdf',
  'http://www.geonames.org/ontology/ontology_v2.0_Full.rdf',
);
// Setup the store
$this->store->setUp();
// Load ontologies so that we can infer subproperties later
foreach($default_vocabs as $vocab) {
  $graph = LODrTools::get_datagraph($vocab);
  $this->store->query("LOAD <$vocab> INTO <$graph>");
}

Then, you can benefit from that lightweight inference engine when querying data from your store, as for instance a query related to “?s rdfs:label ?o” will retrieve “?s foaf:name ?o” statements.

Finally, one important trick to consider when LOAD-ing data in ARC2 is that when using LOAD <URI> on dereferencable URIs, the graph name will be the URI itself, which is confusing, especially if you want to define statements about the graph (i.e. provenance, creation date - as in this trigger - ). A simple solution is to define an arbitrary GRAPH URI based on the ressource URI itself, and then run LOAD <URI> INTO <GRAPH> as done on the previous snippet of code, which solves the problem and let you assign statements to the graph, and not to the URI itself.

Links to the triggers:

Sunny Side of the Street

This is a long time since I have not posted here but I must admit I spend the last weeks (maybe the last month) away from that blog.

Indeed, I moved from Paris to Galway at the end of August, and so was packing / traveling and then started my new job. As you can guess when reading Galway in a semweb-related blog, I’m now in DERI, where I will keep working - mainly - on the Social Semantic Web, i.e. federating distributed online communities, augmented tagging, linked-data, etc.

Some interesting (well, I hope) stuff will come soon, especially the first public release of LODr, unfortunately a bit later that I expected because of this moving and as I wanted to clean up some parts of the code before publishing it. You may have already heard about it either on my ESWC poster about MOAT or by reading the SDoW program, but in a few words, this tools uses the MOAT principles to let you re-tag your existing Web 2.0 content with URIs, i.e. loading all you social data into the Semantic Web and interlinking with the (freshly-updated) LOD cloud. Thus, queries like “Please give me all the bibsonomy items linked to a topic related to dbpedia:Semantic_Web” of “Can I see Flickr pictures linked to one of my foaf:knows connection” will have an answer !

Regarding SDoW, Yves Raimond (you know, the BBC guy ;) ) and myself will present a paper about music recommendations and the Semantic Web, extending some ideas of my previous talk about the topic (Yves will also be a guest speaker at the Web of Data Practitioners Days and was recently interviewed about those relationships between music and the SW). There is a total of 12 accepted paper (long, short and demos) from 24 submissions for this full-day workshop. We were really happy to get that much submissions, and we guess it will be a really interesting event regarding the different topics covered by the papers. Harry Halpin will give a keynote there, introducing the upcoming Social Web Incubator Group, and the second speaker is to be confirmed. BTW, if you consider attending it, ISWC early registration deadline is tonight.

Finally, after spending last week at RWSS, I’ll be in Oxford next week for the fist VoCamp, which should be a really productive event according the the schedule and topic ideas. Let’s hope that new vocabularies will emerge, but I have no doubt about this.

BBC, LOD and the Music Ontology

Relaying this great news from Yves‘ blog (especially for planetrdf.com / planete.websemantique.org readers) 

Just a couple of minutes ago on the Music Ontology mailing listNicholas Humfrey from the BBC announced the availability of linked data on BBC Music.    

Integrating laconi.ca to SMOB

Gautier (aka lespetitescases) just hacked his SMOB client to make it post on identi.ca as well as on twitter. He’s using the new laconica API combined with the same method that SMOB uses at the moment to post on twitter (CURL + HTTP-Auth). As this feature was on the TODO list and some people were asking for it (a thread was also started by John recently on both SIOC and identi.ca mailing-lists) that was a really great news !

I just added those changes to the SMOB client (get it or update from svn) , so that it can now be used to post on identi.ca ! Moreover, I added the ability to post not only on identi.ca but on any laconi.ca based microblogging platform. You’ll see news parameter in the config_dist.php file, that let you set-up a list of servers with dedicated user / pass, as follows. Then, when publishing an update, you’ll be able to choose those servers as well as the previous ones (including twitter), and enjoy multiple-posting:

multismob.png

Then it’s browsable everywhere, but you still own it locally as RDF data so that you can re-use it later, etc.

image-4.png

Once again, merci Gautier !

Social Music meets the Semantic Web

No much time to blog at the moment, as I’m mainly concentrated on writing my PhD thesis (and so I wish best of luck - and motivation - to the ones in the same case !)

Yet, I gave a talk at a Center For Digital Music seminar last week, invited there by Yves Raimond.  The goal was to showcase how the usual suspects of the Social Semantic Web (FOAF / SIOC / MOAT / LOD) can be used in the context of music-related services and can provide new ways regarding music recommendation. If you’re interested in music-related computing (not only from a SW point of view), you may also be interested in browsing the lab homepage and various projects they host (as the Giant Instrument and others like 3D-sound modeling, automatic mixing or human-synth beat-box)

SDoW2008 deadline extended

The SDoW deadline have been extended to the 4th of August, so that you have two additional weeks to submit your paper, demo or poster.

The 1st Social Data on the Web workshop (SDoW2008) co-located with the 7th International Semantic Web Conference (ISWC2008) aims to bring together researchers, developers and practitioners involved in semantically-enhancing social media websites, as well as academics researching more formal aspect of these interactions between the Semantic Web and Social Media.  

Complete details about the workshop can be found on its website. Also note that the poster and demo submissions can be up to 3 pages in LNCS format, while it was 2-pages only at the begining.

ESWC2008 slides

I finally uploaded the slides of the various talks I gave at last ES(W)C in Tenerife:

As you can see, it was a busy but - once again - really valuable week. Lots of quality papers, especially, from my point of view, the ones about querying (esp. distributed approaches like DARQ - more details on Orri Erling’s blog), LOD-related (Semantic Sitemaps) and the OntoGame one. Readers of this blog should also be interested in xOperator, an approach combining LOD, social networking and instant messaging.

Interesting things were also presented during the workshops, as SWOOKI (a Semantic P2P wiki), ACEWiki (a wiki using controlled natural language to model both ontology and instances from the wiki, with reasoning capabilities using Pellet) and the use of semantic wikis for mathematics, which was particulary interesting from the use-case point of view, as for Flyspeck. Also nice (and sometimes fun) papers and demos in the SFSW workshop (congrats to Benjamin!) and in the demo session, as RDBToOnto that mine ontologies (and instances) from relational databases.

Another interesting fact for LOD-ers are the first steps of voiD, a vocabulary to describe datasets (which could efficiently combined with the previously mentioned work on distributed SPARQL: send your query on the Web, let the system find datasources, query and merge).

Finally, as in Beijing, it was a great opportunity to meet people I knew only online. Hope to see you in Karlsruhe!

Next Page →