项目作者: Ak-wa

项目描述 :
A Website Spider/Crawler, Python 3.x
高级语言: Python
项目地址: git://github.com/Ak-wa/Kankra.git
创建时间: 2020-02-06T09:00:40Z
项目社区:https://github.com/Ak-wa/Kankra

开源协议:

下载


Kankra

A Website Spider/Crawler, Python 3.x

python
size
lastcommit
follow

  • Crawls a website for hrefs, js & img files
  • Detects links that use a full URL and those without
    -> e.g <a href="https://www.ssllabs.com/index.html" VS <a href="/projects/index.html
  • Adjusts the results for a useful output
  • Removes duplicates
  • Automatic out of Scope checking
  • Configurable:
    1. --target <str> | Target to scan
    2. --depth <int> | Depth to crawl through
    3. --output <str> | Output file for raw links
    4. --full | Show output with full website links
    5. --silent | Do not show progress, only results

    Usage & examples

  1. Basic Crawling with depth 10

python3 kankra.py --target https://example.com --depth 10

  1. Basic Crawling with depth 15, output file, silent mode & full URL output

python3 kankra.py --target https://example.com --depth 15 --silent --full --output output.txt

Installation

  1. git clone https://github.com/Ak-wa/Kankra.git
  2. cd Kankra
  3. pip3 install -r requirements.txt
  4. python3 kankra.py -h