Scrapegraph-ai/examples/document_scraper_graph
2025-01-07 14:48:40 +01:00
..
ollama refactoring examples 2025-01-07 14:48:40 +01:00
openai refactoring examples 2025-01-07 14:48:40 +01:00
.env.example refactoring examples 2025-01-07 14:48:40 +01:00
README.md refactoring examples 2025-01-07 14:48:40 +01:00

Document Scraper Graph Example

This example demonstrates how to use Scrapegraph-ai to extract data from various document formats (PDF, DOC, DOCX, etc.).

Features

  • Multi-format document support
  • Text extraction
  • Document parsing
  • Metadata extraction

Setup

  1. Install required dependencies
  2. Copy .env.example to .env
  3. Configure your API keys in the .env file

Usage

from scrapegraphai.graphs import DocumentScraperGraph

graph = DocumentScraperGraph()
content = graph.scrape("document.pdf")

Environment Variables

Required environment variables:

  • OPENAI_API_KEY: Your OpenAI API key