Scrape Web Page

Extracts raw HTML content from any URL. Simply provide the target URL and the API returns the complete HTML source code. Supports optional JavaScript rendering via Google Chrome headless browser for dynamic websites (React, Vue, Angular apps that load content via JavaScript). Use proxy rotation to avoid IP-based rate limiting and blocks by rotating through different IP addresses on each request. Include cookies to scrape authenticated content or maintain session state. Returns the raw HTML which can be parsed with libraries like Cheerio, BeautifulSoup, or DOM parsers to extract specific data (product prices, article text, metadata, links, images, etc.). Perfect for price monitoring, content aggregation, competitive intelligence, lead generation, SEO analysis, and web automation tasks.

MethodGET

Parameters4 parameters

CategoryWeb Scraping & Data

Action

Scrape Web Page

Input & Output Details

A clear overview of the data fields required to trigger Scrape Web Page action and the information you'll receive in return.

Input Fields

url

Required

string

url

The URL to extract data from. Must include the full HTTP protocol (http:// or https://). If your URL has parameters, encode special characters (e.g., & becomes %26). Examples: "https://news.ycombinator.com", "https://example.com/page?id=123&category=tech".

render_js

boolean

render_js

If true, the request will render JavaScript on the target site using a Google Chrome headless browser. Use this for dynamic websites that load content via JavaScript (React, Vue, Angular apps). Defaults to false. Note: Enabling this increases processing time and resource usage.

use_proxy

boolean

use_proxy

If true, the request will use a different IP address on each call to avoid rate limiting and IP blocks. Useful for scraping sites with strict rate limits or IP-based restrictions. Defaults to false.

cookie_jar

array

cookie_jar

An array of cookie objects to include with the request. Used to make authenticated requests or maintain session state. Each cookie object must have: path, value, name, and domain. Useful for scraping content behind login walls or personalized pages.

Output Fields

html_content

Required

string

html_content

The raw HTML content from the scraped web page. Contains the complete HTML source code including all tags, scripts, styles, and text. Can be parsed with libraries like Cheerio, BeautifulSoup, or DOM parsers to extract specific data.

url

Required

string

url

The URL that was scraped.

status_code

Required

number

status_code

The HTTP status code of the response (e.g., 200 for success, 404 for not found, 500 for server error).

content_type

string

content_type

The Content-Type header from the response (e.g., "text/html; charset=utf-8").

content_length

number

content_length

The size of the HTML content in bytes.

render_js_used

boolean

render_js_used

Whether JavaScript rendering was used for this request.

proxy_used

boolean

proxy_used

Whether a proxy was used for this request.

More Actions from Web Scraping (AbstractAPI)

Explore the powerful actions available for Web Scraping (AbstractAPI)

All Actions

Scrape Web Page

Action

Learn More about Web Scraping (AbstractAPI)

Extract data from any website with ease. Scrape HTML content from URLs with optional JavaScript rendering via headless Chrome browser, proxy rotation to avoid IP blocks and rate limits, and cookie support for authenticated sessions. Returns raw HTML that can be parsed to extract specific data. Perfect for price monitoring, content aggregation, competitor analysis, lead generation, and web automation. Handles dynamic JavaScript-heavy sites (React, Vue, Angular) and static HTML pages.

FAQs about Web Scraping (AbstractAPI) integration

Connect Other Integrations

Explore more tools and services you can connect with Integrations

AddressZen

Address lookup, verification, and validation service for USA, UK, Canada, Ireland, and Australia. Provides real-time address autocomplete, postcode lookup, and address verification APIs.

Analytics & Reports

AgentMail

API platform providing AI agents with email inboxes to send, receive, and act upon emails for communication with services, people, and other agents.

Email & Messaging

AI/ML API

Unified API for accessing 100+ AI/ML models including GPT-4, Claude, Stable Diffusion, DALL-E, and more. Supports text generation, image creation, video generation, embeddings, and audio transcription.

AI & Developer Tools

Airtable

Airtable integration for managing bases, tables, records, and comments. Enables creating, reading, updating, and deleting data in Airtable workspaces.

Productivity & Notes

Work & Collaboration

Amdoren API

Access weather forecasts, currency conversion, timezone information, time conversion, and location data. Supports 5-day weather forecasts, 150+ currencies, and worldwide timezone conversions.

Anchor Browser

Browser automation platform for managing profiles, sessions, and automated browsing tasks.

AI & Developer Tools

Automation & Workflows

AnyAPI

Access 13+ APIs through a single platform: email validation, currency exchange, holidays, IBAN/VAT validation, IP geolocation, phone validation, web scraping, screenshots, sentiment analysis, and more.

Custom APIs & Webhooks

AI & Developer Tools

API Ninjas

Access 50+ APIs for data and utilities including jokes, facts, quotes, weather, nutrition, city data, QR codes, barcodes, trivia, and more. All through a single API key.

AI & Developer Tools

ApiFlash

Simple yet powerful screenshot API for capturing pixel-perfect website screenshots at scale. Built on AWS Lambda with up-to-date Chrome browser.

AI & Developer Tools

How Platoona Works

Build, Automate & Scale — In One Smart Workspace.

Agents assemble & collaborate
The right specialists come together automatically — Writer, Researcher, CMO, CTO, Ops — working as a real team.

Start Working with Agents

Tell Platoona what you need
Just chat normally. Your Platoona Agents understand your goal instantly and know what outcome you're aiming for.

Start Your Journey

Marketplace of Agents, Integrations & Flows
Explore ready-made agents, tools, and automation flows created by the community and the Platoona team.

Explore Marketplace

Connect with 250+ integrations
Tools like Slack, Notion, Gmail, HubSpot, Twilio, Drive, and more — all connected so agents can act directly.

Connect Your Tools

Set up workflows & automation
Create recurring tasks and automated workflows. Your AI platoon runs them on schedule or trigger, keeping work flowing without your constant input.

Automate Your Workflow