Towards End-user Web Scraping For Customization
Websites are malleable: users can run code in the browser to customize them. However, this malleability is typically only accessible to programmers with knowledge of HTML and Javascript. Previously, we developed a tool called Wildcard which empowers end-users to customize websites through a spreadsheet-like table interface without doing traditional programming. However, there is a limit to end-user agency with Wildcard, because programmers need to first create site-specific adapters mapping website data to the table interface. This means that end-users can only customize a website if a programmer has written an adapter for it, and cannot extend or repair existing adapters.
In this paper, we extend Wildcard with a new system for end-user web scraping for customization. It enables end-users to create, extend and repair adapters, by performing concrete demonstrations of how the website user interface maps to a data table. We describe three design principles that guided our system’s development and are applicable to other end-user web scraping and customization systems: (a) users should be able to scrape data and use it in a single, unified environment, (b) users should be able to extend and repair the programs that scrape data via demonstration and (c) users should receive live feedback during their demonstrations.
We have successfully used our system to create, extend and repair adapters by demonstration on a variety of websites and we provide example usage scenarios that showcase each of our design principles. Our ultimate goal is to empower end-users to customize websites in an intuitive and flexible way, making the web more malleable for all of its users.
(We have provided a PDF but it is best viewed in the online version at https://kapaya.github.io/px21/, which includes videos demonstrating the system.)
Tue 23 MarDisplayed time zone: Belfast change
| 17:00 - 19:00 | |||
| 17:0030m Talk | Type Engineering: A design language for unified Software Engineering PX/21 Anton Dmukhovskiy Art Deco Code Ltd | ||
| 17:3030m Talk | Studying Programmer Behaviour at Scale: A Case Study Using Amazon Mechanical Turk PX/21 | ||
| 18:0030m Talk | Towards End-user Web Scraping For Customization PX/21 Kapaya Katongo Massachusetts Institute of Technology (MIT), Geoffrey Litt MIT, Daniel Jackson Massachusetts Institute of Technology (MIT) | ||
| 18:3030m Talk | Towards exploratory understanding of software using test suites PX/21 Dominik Meier Hasso-Plattner-Institute, Toni Mattis Hasso Plattner Institute, University of Potsdam, Robert Hirschfeld Hasso Plattner Institute (HPI), University of Potsdam, Germany | ||


