StructScraper

A tool for web developers


Allows to on-the-fly include into the web page content semantic data of external web resources.

About the project

The semantic markups of the World Wide Web have accumulated a large amount of data, and their number continues to grow. These data are widely used by search engines to present information about found pages in the form of rich snippets. We offer a tool that allows web developers to incorporate these data into their web pages in the same way as search engines do in their SERPs.

This tool allows a web developer to include in the content of the web page data of external web resources embedded in the popular semantic markups microdata and JSON‑LD, as well as metadata from the <meta> tags of HTML documents and the properties of Word and PDF documents.

The tool does not need coding, as well as installation. When working with StructScraper, it is enough for the page author to add some markup to the HTML page and include some start scripts by inserting a piece of ready-made code, and the rest of the work is performed automatically during page loading.

The StructScraper tool allows creating web pages with up-to-date data collected from various external sources, whether or not they enable cross-origin requests. This tool can be useful to bloggers, authors of pages with recipes, scientists to create personal pages and lists of publications; it can be used to compare product prices, site ratings, and in other cases. Some examples with real data from pages of the World Wide Web are presented here.

The tool is open source. Its code is available on GitHub: RimmaSkorn/struct-scraper.

© 2019, KIAM RAS
e-mail: structscraper@gmail.com

References