Scraper example using bible.com as the source.

Steve Thielemann 9ed732fee2 Fixed verse -f hai 1 semana
src 9ed732fee2 Fixed verse -f hai 1 semana
.gitignore f6790cbbbc Making a single request to httpbin.org. hai 3 semanas
Cargo.lock 507c90360f Added user agent updater. hai 2 semanas
Cargo.toml 507c90360f Added user agent updater. hai 2 semanas
README.md 736758e4bd Added extract --output for JSON output. hai 1 semana
initial.config 507c90360f Added user agent updater. hai 2 semanas

README.md

Rust - Reqwest and Scraper example

This is an example web scraper project. It downloads and extracts the Bible from https://www.bible.com.

User-Agent

The program will update the user-agent string it uses via https://www.mozilla.org/en-US/firefox/releases/ because I can.

-a or --agent-update

Bible versions

  • ESV
  • NIV
  • YLT98
  • KJV
  • NASB2020
  • MKJV

-v NIV

Relative/absolute URLs

See relative_to_absolute(url: &str, href: &str) -> Result to properly handle those.

Using

  • Copy initial.config to app.config.
  • Make a bible directory.
  • Run the fetch command.
  • Run the extract command. Try the --output option.
  • Run the verse command.