Basics

Apache Ivy Logo

Installation and Quick Start

Scoopi web scraper can be installed in multiple ways. Easiest way to use Scoopi is to run it from docker image which comes with pre-configured JRE.

Definition File

Scoopi uses set of YML definition files to extract data from web pages. Set of example definition files is provided to learn the yml elements used by Scoopi

DataDef

Scoopi web scraper uses datadef to define data. Datadef contains axis, query, script and members which collectively defines the data

Dimensions

Scoopi DataDef uses dimensions to add extra attributes to the scraped data such as date. Any number of attributes may be added to a data item.