Scoopi

Web Scraper

Apache Ivy Logo

Introduction

Scoopi web scraper extracts and transform data from HTML pages. It uses a set of definition files to scrape data and no coding skill is required

Installation and Quick Start

Scoopi web scraper can be installed in multiple ways. Easiest way to use Scoopi is to run it from docker image which comes with pre-configured JRE.

Definition File

Scoopi uses set of YML definition files to extract data from web pages. Set of example definition files is provided to learn the yml elements used by Scoopi

DataDef

Scoopi web scraper uses datadef to define data. Datadef contains axis, query, script and members which collectively defines the data