Scraper example using bible.com as the source.

Steve Thielemann 9ed732fee2 Fixed verse -f 1 week ago
src 9ed732fee2 Fixed verse -f 1 week ago
.gitignore f6790cbbbc Making a single request to httpbin.org. 2 weeks ago
Cargo.lock 507c90360f Added user agent updater. 2 weeks ago
Cargo.toml 507c90360f Added user agent updater. 2 weeks ago
README.md 736758e4bd Added extract --output for JSON output. 1 week ago
initial.config 507c90360f Added user agent updater. 2 weeks ago

README.md

Rust - Reqwest and Scraper example

This is an example web scraper project. It downloads and extracts the Bible from https://www.bible.com.

User-Agent

The program will update the user-agent string it uses via https://www.mozilla.org/en-US/firefox/releases/ because I can.

-a or --agent-update

Bible versions

  • ESV
  • NIV
  • YLT98
  • KJV
  • NASB2020
  • MKJV

-v NIV

Relative/absolute URLs

See relative_to_absolute(url: &str, href: &str) -> Result to properly handle those.

Using

  • Copy initial.config to app.config.
  • Make a bible directory.
  • Run the fetch command.
  • Run the extract command. Try the --output option.
  • Run the verse command.