modulepackage
0.0.0-20240801035318-e3b44bdf5619
Repository: https://github.com/alphastrata/scrapethisfor.git
Documentation: pkg.go.dev
# README
scrapeThisFor
A command line tool for scraping URLs that contain a given keyword and downloading their contents.
Roadmap
- Nil, this is feature complete for what I needed.
Requirements
-
Clone this repo:
git clone https://github.com/alphastrata/scrapeThisFor.git
-
Golang >= 1.20
-
A working internet connection (for
go mod
to work) and for the app to download content. -
Some command line/terminal proficiency.
-
A browser if you want to read the documentation generated by godoc.
Note: You may need to restart your machine/re-login to your shell (depending on your OS).
Building
- Run
go mod tidy
to download the required packages. - Run
go build main.go
to build the application.
Example Usage
-
This command:
go run . https://huggingface.co/bigscience/bloom/tree/main model_000
-
will produce:
https://huggingface.co/bigscience/bloom/blob/main/model_00001-of-00072.safetensors
https://huggingface.co/bigscience/bloom/resolve/main/model_00001-of-00072.safetensors
... Snipped for brevity ...
https://huggingface.co/bigscience/bloom/blob/main/model_00045-of-00072.safetensors
https://huggingface.co/bigscience/bloom/resolve/main/model_00045-of-00072.safetensors
Run it with the 'download' arg to make it download all of the hits in parallel:
go run . https://huggingface.co/bigscience/bloom/tree/main model_000 download
Installation
go build -o scrapeThisFor main.go
Or:
go install github.com/alphastrata/scrapeThisFor@latest
# Functions
No description provided by the author