This is an HTTP API you can use to unfurl and extract content from any web page as JSON. You can get the title, description, open graph, embed content or any other information available at a given public URL (check examples below). You can use this for building: Chatbots

Rich text inputs (auto-linking)

Browser extensions

Site monitoring & validation tools

Sales & marketing tools

Data scrapers for research You will need an access token to use Page.REST API. An access token costs $5 for 10,000 requests.

Why use this?

You might be wondering why you should use Page.REST API rather than coding it yourself. Here are some reasons: It handles the nitty gritty edge cases (HTML parsing is 😤 😰 😩 )

You save network bandwidth (only download what you need from a page)

Hosted using Google Cloud Functions - so it will have high availability

You want to hack something quickly!

How to use

Try the examples to see what API returns. You can edit the code to try different URLs. (alternatively, you can run it on Postman

Basic

The default request grabs site’s title, description, logo, favicons, canonical URL, status code, and Twitter handle.

JS

cURL ▶︎

Selector queries

most useful feature. You can use This is probably thefeature. You can use CSS selectors to retrieve content from matching elements. In the example, we use selectors to retrieve the businesses and their founders featured in IndieHackers. (You can use up to 10 selector queries.)

JS

cURL ▶︎

Pre-render content

Append &prerender=1 to the request URL to extract content from pages that render on client-side using JavaScript. In the example, we extract currently available engineering jobs from Tesla’s career page which is built using React. The selector won’t return anything if you drop the prerender parameter.

JS

cURL ▶︎

Embed content

&embed=1 to the request URL to get the Appendto the request URL to get the oEmbed content for the page as part of the response (only if available).

JS

cURL ▶︎

Open Graph

&og=1 to the request URL to get the Appendto the request URL to get the OpenGraph content for the page as part of the response (only if available).

JS

cURL ▶︎

Response headers

Get any HTTP headers defined in the response. In the example, we check security headers of github.com.

JS

cURL ▶︎

Sorry, not accepting new customers for the service.