hakrawler

What is it?

hakrawler is a Go web crawler designed for easy, quick discovery of endpoints and assets within a web application. It can be used to discover:

Forms
Endpoints
Subdomains
Related domains
JavaScript files

The goal is to create the tool in a way that it can be easily chained with other tools such as subdomain enumeration tools and vulnerability scanners in order to facilitate tool chaining, for example:

amass | hakrawler | some-xss-scanner

Features

Unlimited, fast web crawling for endpoint discovery
Fuzzy matching for domain discovery
robots.txt parsing
sitemap.xml parsing
Plain output for easy parsing into other tools
Accept domains from stdin for easier tool chaining
SQLMap-friendly output format

Changelog v2.1

Better stability
More features (proxy, timeouts, JSON output, page size limits)

Installation

go get github.com/hakluke/hakrawler

Usage

$ hakrawler -h

Usage of hakrawler:

  -all

    	Include everything in output - this is the default, so this option is superfluous (default true)

  -auth string

    	The value of this will be included as a Authorization header

  -cookie string

    	The value of this will be included as a Cookie header

  -depth int

    	Maximum depth to crawl, the default is 1. Anything above 1 will include URLs from robots, sitemap, waybackurls and the initial crawler as a seed. Higher numbers take longer but yield more results. (default 1)

  -domain string

    	The domain that you wish to crawl (for example, google.com)

  -forms

    	Include form actions in output

  -js

    	Include links to utilised JavaScript files

  -outdir string

    	Directory to save discovered raw HTTP requests

  -plain

    	Don't use colours or print the banners to allow for easier parsing

  -robots

    	Include robots.txt entries in output

  -schema string

    	Schema, http or https (default "http")

  -scope string

    	Scope to include:

    	strict = specified domain only

    	subs = specified domain and subdomains

    	fuzzy = anything containing the supplied domain

    	yolo = everything (default "subs")

  -sitemap

    	Include sitemap.xml entries in output

  -subs

    	Include subdomains in output

  -urls

    	Include URLs in output

  -usewayback

    	Query wayback machine for URLs and add them as seeds for the crawler

  -wayback

    	Include wayback machine entries in output

Source: https://github.com/hakluke/

Tags: hakrawler