# README

Metascraper

Metascraper is a web scraping utility. It transforms valid HTML markup into a hierarchy of Go structs. In addition to capturing the raw HTML at the given endpoint, metascraper will pull out meta tags from the page's head, and also extracts schema.org metadata embedded in the document body.

Usage

p, err := metascraper.Scrape(url)
if err != nil {
    log.Fatal(err)
}
log.Println(p.Title)
pretty.Print(p.MetaData())
pretty.Print(p.SchemaData())

See API documentation

Released under the MIT License

# Functions

AttrMap

AttrMap parses the attributes of the current element into a friendly map.

Scrape

Scrape creates a new page and populates its fields from the content found at the given URL.

# Structs

ItemProp

ItemProp represents a simple schema.org itemprop.

ItemScope

ItemScope represents a schema.org itemscope.

# Interfaces

TokenReader

TokenReader presents a lightweight version of the usual SAX parser interface, with methods for handling the typical events in a token stream.