From 15421eff7009b80293f7d84df5086d22944dfb99 Mon Sep 17 00:00:00 2001 From: Marco Perini Date: Wed, 12 Jun 2024 01:23:01 +0200 Subject: [PATCH] feat(merge): add scriptcreatormulti, rag cache and semchunk --- README.md | 7 +++++-- 1 file changed, 5 insertions(+), 2 deletions(-) diff --git a/README.md b/README.md index 08cf2150..8fe3a692 100644 --- a/README.md +++ b/README.md @@ -43,11 +43,14 @@ The documentation for ScrapeGraphAI can be found [here](https://scrapegraph-ai.r Check out also the Docusaurus [here](https://scrapegraph-doc.onrender.com/). ## 💻 Usage -There are three main scraping pipelines that can be used to extract information from a website (or local file): +There are multiple standard scraping pipelines that can be used to extract information from a website (or local file): - `SmartScraperGraph`: single-page scraper that only needs a user prompt and an input source; - `SearchGraph`: multi-page scraper that extracts information from the top n search results of a search engine; - `SpeechGraph`: single-page scraper that extracts information from a website and generates an audio file. -- `SmartScraperMultiGraph`: multiple page scraper given a single prompt +- `ScriptCreatorGraph`: single-page scraper that extracts information from a website and generates a Python script. + +- `SmartScraperMultiGraph`: multi-page scraper that extracts information from multiple pages given a single prompt and a list of sources; +- `ScriptCreatorMultiGraph`: multi-page scraper that generates a Python script for extracting information from multiple pages given a single prompt and a list of sources. It is possible to use different LLM through APIs, such as **OpenAI**, **Groq**, **Azure** and **Gemini**, or local models using **Ollama**.