Scrapegraph-ai/scrapegraphai
Harsh Abasaheb Chavan e81a4ed745 feat: Add configurable timeout to FetchNode
- Add timeout parameter to FetchNode (default: 30 seconds)
- Apply timeout to requests.get() calls to prevent indefinite hangs
- Implement timeout for PDF parsing using ThreadPoolExecutor
- Propagate timeout to ChromiumLoader via loader_kwargs
- Add comprehensive unit tests for timeout functionality
- Fully backward compatible (timeout can be disabled with None)

Fixes issue with requests.get() and PDF parsing blocking indefinitely
on slow/unresponsive servers or large documents.

Usage:
  node_config={'timeout': 30}  # Custom timeout
  node_config={'timeout': None}  # Disable timeout
  node_config={}  # Use default 30s timeout
2025-11-01 09:08:13 +00:00
..
builders codebeaver/pre/beta-963 - . 2025-04-14 07:50:46 +00:00
docloaders codebeaver/pre/beta-963 - . 2025-04-14 07:50:46 +00:00
graphs fix: url redirect 2025-10-23 19:11:16 -07:00
helpers feat: update model tokens 2025-10-22 09:45:34 -07:00
integrations feat: ⛏️ enhanced contribution and precommit added 2025-01-06 15:10:35 +01:00
models fix: grok integration and add new grok models 2025-05-31 00:13:44 +07:00
nodes feat: Add configurable timeout to FetchNode 2025-11-01 09:08:13 +00:00
prompts Merge pull request #993 from ScrapeGraphAI/main 2025-06-24 17:29:48 +02:00
telemetry feat: ⛏️ enhanced contribution and precommit added 2025-01-06 15:10:35 +01:00
utils Fix critical schema transformation bugs and improve logging 2025-07-25 13:33:48 +05:00
__init__.py feat: update logs 2025-06-07 16:53:55 +02:00