Material Detail

A Scalable Approach for Efficiently Generating Structured Dataset Topic Profiles

A Scalable Approach for Efficiently Generating Structured Dataset Topic Profiles

This video was recorded at 11th Extended Semantic Web Conference (ESWC), Crete 2014. The increasing adoption of Linked Data principles has led to an abundance of datasets on the Web. However, take-up and reuse is hindered by the lack of descriptive information about the nature of the data, such as their topic coverage, dynamics or evolution. To address this issue, we propose an approach for creating linked dataset proles. A prole consists of structured dataset metadata describing topics and their relevance. Proles are generated through the conguration of techniques for resource sampling from datasets, topic extraction from reference datasets and their ranking based on graphical models. To enable a good trade-o between scalability and accuracy of generated proles, appropriate parameters are determined experimentally. Our evaluation considers topic proles for all accessible datasets from the Linked Open Data cloud. The results show that our approach generates accurate proles even with comparably small sample sizes (10%) and outperforms established topic modelling approaches

Quality

  • User Rating
  • Comments
  • Learning Exercises
  • Bookmark Collections
  • Course ePortfolios
  • Accessibility Info

More about this material

Comments

Log in to participate in the discussions or sign up if you are not already a MERLOT member.