Semantic XBRL
Semantic XBRL is a repository that contains semantic data generated from the data submitted by participants in the XBRL Voluntary Program EDGAR promoted by the U.S Securities and Exchange Commission (SEC).
Semantic data is generated by translating XBRL XML to RDF following the XML Semantics Reuse Methodology, which is implemented by the ReDeFer tools. First, there is the XSD2OWL mapping, which generates XBRL OWL ontologies from the XBRL XML Schemas. This mapping is complemented with the XML2RDF one. This one maps XML to RDF step, based on previous mappings from the XML Schemas the XML is based on to OWL ontologies.
Status
- January 2009: 1,03 million triples have been generated from 489 XBRL filings.
- April 2009: 1,34 million triples have been generated from 612 XBRL filings.
- March 2010: 2,98 million triples have been generated from 1201 XBRL filings (dump).
The resulting RDF partially mimics the original XBRL structure from which it is automatically generated. Some small changes are introduced, mainly a new FactType class for facts. Fig. 1 shows a diagram of the core concepts of the resulting SemanticXBRL model. The resulting data is semantically enriched so it is easier to integrate different filings and to cross query filings for different dates, companies, accounting principles,...
Some example queries.

