Capture

Capture is a process of extracting some data from the web page and processing it via data processors.

To extract data the agent uses Data Patterns. Data patterns are created with point and click wizards. Once created data pattern will be used by Agent or Extractor to extract data on web pages with similar structure. Creating of data pattern is described in separate chapter.

Also there is special type of capture that extracts page meta information like: title, url, source, load date. These captures are described in Special Captures.

Refine Data

Captured data may be refined by data processors. The sequence of data processors can be attached to the capture statement and do some data modification like: clearing text data, extract substring from the text, load images and files from URL string, etc. You can read more about data processors here.