I’ve got a Federated Bridge to Sell You (A Defense of the Warehouse) - semanticweb.com(About) This is not at all to say that semantically linking datasets isn’t valuable. On the contrary! I believe that coating old, weather-beaten databases with a coat of semantic paint is awesomely valuable. It makes creating ETL pipelines that bring together data from all kinds of locations a breeze as compared to traditional, relationally-oriented ETL pipelines. It’s hardly even fair to compare the two approaches, except insofar as the maturity of the traditional technologies is concerned, and I’ll try to pick up on specific reasons for this belief in future posts. In fact, I see semantics as enabling on-demand datamarts in ways that traditional data integration technologies simply have failed to do
Extracting Enterprise Vocabularies Using Linked Open Data(About) A common vocabulary is vital to smooth business operation, yet codifying and maintaining an enterprise vocabulary is an arduous, manual task. We describe a process to automatically extract a domain specific vocabulary (terms and types) from unstructured data in the en- terprise guided by term definitions in Linked Open Data (LOD). We validate our techniques by applying them to the IT (Information Tech- nology) domain, taking 58 Gartner analyst reports and using two specific LOD sources – DBpedia and Freebase. We show initial findings that ad- dress the generalizability of these techniques for vocabulary extraction in new domains, such as the energy industry.
IBM Watson Research Center