项目作者: minyk
项目描述 :
Headless browser processor for Apache Nifi
高级语言: Java
项目地址: git://github.com/minyk/nifi-headlessbrowser-processor.git
Nifi Headless Browser Processor
Currently, URL Provided
configuration is only tested.
- Returns the page source in its current state to FlowFile, including any DOM updates that occurred after page load.
- Use JBrowserDriver.
Prerequisite
Configurations
Most configuration is used to make JBrowserDriver.
configurations
- Host: Hostname for the browser. hostname or ip address.
- Url Provided: if true, the processor read target from
Page URL
configuration. if false, the input flowfile must contain URL. - Page URL: URL for processing. Used only
Url Provided
is true
. - Timezone: Timezone for browser. Select from dropdown list.
- Port Range: port range for JBrowserDriverServer. This range should be multiple of three.
Javascript: Script after page loading. Currently, EL is not supported.
- Remove for now due to timing issue.
Relationship
- success: success relationship of this processor. Flowfile contains page source of input URL.
- failure: failure relationship of this processor.
TODOs