Friday, May 11, 2012

Schema.org markup for external lists


The world is too rich, complex and interesting for a single schema to describe fully on its own. With schema.org we aim to find a balance, by providing a core schema that covers lots of situations, alongside extension mechanisms for extra detail. There are many situations where the use of existing controlled vocabularies, standards and datasets would improve schema.org markup. This is the role of the schema.org "external enumerations" mechanism.

We introduce "external enumerations" with a simple example - countries - and encourage implementors to join the schema.org community in W3C's 'Web Schemas' group where the full details are being discussed.

Each schema.org type (such as Person, PostalAddress) is associated with a set of properties, such as
"nationality", "addressCountry". In turn, each property has one or more expected types; in this case, both the "nationality" of a Person, and the "addressCountry" of a PostalAddress expect to have a Country value. Rather than adding large lists of specific countries to schema.org, instead we encourage the use of external lists.  We will publish a set of well-known authority lists, linked to the types and properties they are used with. To get started, we take simple Wikipedia links as an example of such an authority. Other more specialist examples (such as IPTC codes) will follow.

Taking our existing Movie example in Microdata, let's add nationality details for one of the actors. To do this, we simply add a link:


<div itemscope itemtype="http://schema.org/Movie">
 <h1 itemprop="name">Pirates of the Carribean: On Stranger Tides (2011)</h1>
 <span itemprop="description">Jack Sparrow and Barbossa embark on[...]</span>
 <div itemprop="actor" itemscope itemtype="http://schema.org/Person">
  <span itemprop="name">Johnny Depp</span>
  <link itemprop="nationality" href="http://en.wikipedia.org/wiki/United_States"/>
 </div>
</div> 
 
Here we use  'http://en.wikipedia.org/wiki/United_States' to stand for the specific country. Other authorities also publish useful structured data about countries and have stable URLs that could be used. For example, we could use the UN FAO's GeoPolitical Ontology, and their URL for the USA. From a schema.org perspective, we do not take account of any types and properties defined by these external sites, since it is important to support a variety of quite different authority lists, who often have different ways of modeling things. Each external authority essentially supplies a set of URI/URL item identifiers that can be dropped into schema.org markup.

We've shown here the use of Wikipedia links for identifying members of the Country type. Take a look at the detailed document for discussion on how to use this with Microdata's 'itemid' attribute, if you want to describe the Country (or other object) in further detail. The W3C wiki also gives other examples, and shows how the markup would look in RDFa Lite

While there are more details to work out as we start to apply this idea across schema.org, we wanted to share this initial example.  The basic idea is very simple: everywhere in schema.org where external lists will help, we will need to have a specific schema.org type (like Country), for which the external authority supplies identifiers. In some cases, we will have to add new types to support this. Beyond the basics presented here, there are various technical details of syntax, discussion of exactly which authorities and URI identifiers to use, and so on. We welcome suggestions (here or via the Web Schemas group) for existing enumerations that would be useful additions, and feedback on the general approach.

Monday, December 12, 2011

Building a Web of Objects at Yahoo!

We'd like to share a link to a recent set of presentation slides by Peter Mika, Ralph Rabbat, Philip Bohannon of Yahoo! The talk by Ralph Rabbat was part of the Industry Track during October's International Semantic Web Conference in Bonn. Slides (in .pptx format) are linked from the conference site and describe some work they're doing that relates to schema.org. This presentation covers some aspect of Yahoo's work in semantic web both from engineering and research aspect. Please, feel free to connect directly with the authors if you have any comments/questions. The other Industry Track presentations, and materials from the wider conference; are worth checking out too.

Friday, November 11, 2011

Using RDFa 1.1 Lite with Schema.org

As a result of our continued discussions and collaborations with publishers, implementers and standards-makers, we're pleased to give advance notice of a new way of adopting schema.org's structured data vocabulary. W3C's RDF Web Applications group are right now putting the finishing touches to the latest version of the RDFa standard. This work opens up new possibilities also for developers who intend to work with schema.org data using RDF-based tools and Linked Data, and defines a simplified publisher-friendly 'Lite' view of RDFa.

Early adopters can follow the in-progress drafts (rdfa-core, rdfa-lite) while the W3C group work through the remaining details. We hope that our support for 'RDFa Lite', alongside Microdata, will allow publishers to focus more on what they want to say with their data, rather than on the details of its specific encoding as markup. We also want to take a moment to thank the members of the RDFa community for taking on board our feedback; making standards is hard work, and we believe this latest version of RDFa is a major contribution to the Web of structured data.

Monday, November 7, 2011

Schema.org support for job postings

We’re happy to announce that schema.org, working together with the United States Office of Science and Technology Policy, has added support for marking up job postings on the web.

Leveraging this markup, the US Veterans administration has created a search widget that is accessible across a growing number of federal websites including nrd.gov and whitehouse.gov, to find job listings from veteran-committed employers.

We feel privileged to have played a role in enabling this. More details can be found in the US CTO, Aneesh Chopra’s post on the White House Blog.

R.V. Guha
Google

Friday, November 4, 2011

Yandex now supports schema.org markup

One of the primary goals in creating schema.org was to simplify structured data markup requirements for content creators across search engines, which we hope will drive greater adoption across the Web. In that vein, we're very happy that Yandex has announced its support for schema.org. In addition to being a major consumer of schema.org markup, Yandex will be increasingly contributing to discussions about the evolution of the schema on the W3C-hosted Web Schemas group, and they are also investigating translation of the schema.org website to local languages. It's great to have growing support for the schema.org markup around the world!

Wednesday, October 5, 2011

W3C "Web Schemas" group is our new public feedback forum

Our Discussion Group will be moving to the W3C forum, which will use the public-vocabs@w3.org mailing list.
The Schema.org Google Group will not be completely shut down, however we do encourage all discussion of vocabulary, schema and deployment practicalities to move to public-vocabs@w3.org mailing list. These are also linked from the Documents page.

This comes out of a plan for closer collaboration between Schema.org and other related efforts. The Web standards organization W3C has created two related task forces of its Semantic Web Interest Group that are relevant for Schema.org. One is devoted to the technicalities of syntax and other one, "Web Schemas" is for vocabulary discussions. We have decided to adopt this forum as a new home for Schema.org feedback, since it provides a natural connection point to related efforts from other groups and communities.

The W3C group is open to all, and will have a Wiki and issue tracker to help organize feedback; not only for Schema.org but for other Web Schemas who are interested to collaborate. We hope this will give rise to collaborations, mappings and shared modeling styles.

Looking forward to greater opportunities for collaboration.

Wednesday, September 28, 2011

Schema.org Workshop Wrap-Up

On September 21st, we held the first ever schema.org workshop in Mountain View, California. There were 75 attendees from web markup & standards groups (including W3C, Microformats, RDFa, Creative Commons), as well as other search engines (Ask, Yandex, Baidu), and top content publishers and tools vendors (including NYTimes, Disney, Foursquare, Shopping.com, OpenTable, Drupal, Sharepoint). The objective of this workshop was to evolve the schema.org specification, solicit feedback from the standards community, and build momentum for web publishers.

We felt the event was a success -- there was lots of enthusiasm around creating extensions to the vocabulary as well as great feedback on how to evolve the syntax. We will be working through this feedback in our working group over the coming months, so stay tuned for more developments, and thanks in advance for all your feedback.

At the workshop, we also announced a couple new schema.org developments. First, we announced that we have adopted IPTCs rNews specification into schema.org (see earlier blog post). In addition to being a fantastic vocabulary extension for news articles, we believe this a great template for future industry-specific vocabulary extensions, and there are a couple ongoing discussions for new extensions in education (working with LRMI) and for job postings (working with Whitehouse CTO). We expect continued interest in industry specific extensions, and to facilitate this discussion we also announced a new W3C-hosted venue for further discussion and engagement with the community: http://www.w3.org/QA/2011/09/proposing_two_new_sw_interest.html.

There have been a few inquiries about the next workshop. We’re planning to do another workshop in the future, but don’t have any concrete plans at this point. When we have something to announce, we’ll do it here on this blog.

Slides from the event can be found here:

http://schemaorg.cloudapp.net/2011Workshop/

Additional Coverage:

http://semanticweb.com/schema-org-workshop-a-path-forward_b23387

http://twitter.com/#!/search/schemaorg

http://www.w3.org/2010/02/rdfa/meetings/2011-09-22#schema__2e_org_workshop


Mike Van Snellenberg

Microsoft