Render raises $100M at a $1.5B valuation

Read the announcement

Web Scraper & Summarizer AI Agent

An intelligent web scraping and summarization tool powered by AI.

Why deploy Web Scraper & Summarizer AI Agent on Render?

Web Scraper & Summarizer AI Agent is a tool that crawls websites and generates AI-powered summaries of their content. It solves the problem of manually reviewing multiple web pages by automatically extracting and synthesizing information about companies, products, and services into structured insights.

This template wires together a Next.js frontend and Express backend with WebSocket communication already configured for real-time crawling updates—no need to manually set up the bidirectional streaming between services. You just supply your Browserbase and Anthropic API keys, and Render's one-click deploy handles the service linking and environment variable injection across both components. Skip the Docker Compose orchestration and CORS configuration you'd wrestle with locally, and get straight to a production URL you can share.

Architecture

What you can build

After deploying, you'll have a web app where you can submit any URL and get an AI-generated summary of the site's company info, products, and services—with real-time progress updates as pages are crawled. The app retains session context for 30 minutes, so you can ask follow-up questions about the analyzed content without re-crawling.

Key features

  • Browserbase Cloud Browsers: Uses Browserbase SDK with Playwright for cloud-hosted browser automation that handles web crawling without managing browser infrastructure.
  • Claude AI Summarization: Integrates Anthropic Claude API (Sonnet, Haiku, Opus models) to generate structured summaries with company overview, products, and insights from scraped content.
  • WebSocket Real-time Updates: Bidirectional WebSocket communication streams crawling progress and status updates to the frontend as pages are processed.
  • Follow-up Chat Sessions: Maintains 30-minute session retention allowing users to ask follow-up questions about analyzed websites after initial scraping.
  • One-click Render Deploy: Includes render.yaml blueprint for automated production deployment with pre-configured service networking, health checks, and SSL.

Use cases

  • Sales rep researches prospect company before a discovery call
  • Product manager analyzes competitor websites to identify feature gaps
  • Journalist quickly summarizes unfamiliar company for article background
  • Investor screens startup websites during deal sourcing research

What's included

Service
Type
Purpose
browser-research-agent-backend
Web Service
Handles API requests and business logic
browser-research-agent-frontend
Web Service
Serves the user interface

Prerequisites

  • Browserbase API Key: API key for Browserbase cloud browser infrastructure that powers the web scraping functionality.
  • Browserbase Project ID: Your project identifier from Browserbase to associate browser sessions with your account.
  • Anthropic API Key: API key for Anthropic's Claude AI that generates intelligent summaries and insights from scraped content.

Next steps

  1. Open the frontend URL and enter a test website URL like https://render.com — You should see real-time crawling progress messages appear in the chat interface as pages are discovered and analyzed
  2. Test the WebSocket connection by watching the status indicator while a scrape runs — You should see live updates showing each page being crawled and the final AI-generated summary with company overview, products, and insights
  3. Configure the ANTHROPIC_MODEL environment variable in Render dashboard to switch AI models if needed — You should see the change reflected in the summary quality and response time on your next scrape

Stack

express
node.js
claude
websockets
next.js

Tags

ai
web scraper
realtime