Commit Graph

327 Commits

Author SHA1 Message Date
Alok Saboo
58289eccc5 feat: Add new models and update existing ones 2024-05-22 15:07:27 -04:00
Marco Vinciguerra
f2dffe534f fix: pdf scraper bug 2024-05-22 11:54:55 +02:00
Marco Vinciguerra
1ef63217dd add deepseek embeddings 2024-05-21 10:41:01 +02:00
Marco Vinciguerra
659fad770a fix: add deepseek embeddings 2024-05-21 10:36:40 +02:00
Mobin Chowdhury
7f30da9930
Update models_tokens.py 2024-05-18 23:32:27 +06:00
Marco Vinciguerra
a0125b7e90 add orca mini integration 2024-05-18 09:31:40 +02:00
Marco Perini
d4d913c8a3 fix(deepcopy): switch whether we have obj in the config 2024-05-17 10:36:39 +02:00
Marco Perini
999c930f42 fix(deepcopy): reaplced to shallow copy 2024-05-15 16:00:42 +02:00
VinciGit00
cc5adefd29 fix: come back to the old version 2024-05-15 15:54:00 +02:00
VinciGit00
5587a64d23 fix: removed unused 2024-05-15 15:51:27 +02:00
VinciGit00
7ced0d0564 remove asdt 2024-05-15 15:49:09 +02:00
Marco Vinciguerra
cffcf80a75
Merge branch '88-blockscraper-implementation' into main 2024-05-15 15:20:12 +02:00
Marco Vinciguerra
22cd9e3605
Merge branch 'search_link_context' into main 2024-05-15 15:16:57 +02:00
Marco Perini
096b665c01 fix(searchgraph): used shallow copy to serialize obj 2024-05-15 15:05:47 +02:00
Marco Vinciguerra
932df8d491
Merge pull request #238 from VinciGit00/gpt4-omni 2024-05-14 17:09:23 +02:00
mayurdb
a458ec4b9f Update the prompt for the search_link_node 2024-05-14 16:59:16 +02:00
Marco Perini
a6e1813ddd fix(fetch_node): bug in handling local files 2024-05-14 16:51:10 +02:00
Marco Perini
fcb3abb01d feat(omni-search): added omni search graph and updated docs 2024-05-14 16:26:43 +02:00
Marco Perini
a296927624 feat(omni-scraper): working OmniScraperGraph with images 2024-05-14 13:46:49 +02:00
Marco Perini
90955ca52f feat(gpt-4o): image to text single node test 2024-05-14 11:43:21 +02:00
Federico Aguzzi
52a4a3b22d feat: add gpt-4o
closes #232
2024-05-14 11:11:34 +02:00
VinciGit00
d6f5ca874f Merge branch 'main' into pre/beta 2024-05-14 10:24:29 +02:00
Marco Perini
367dea5cbd
Merge branch 'pre/beta' into feat/parallel-node-execution 2024-05-13 23:50:40 +02:00
Marco Perini
a8d5e7db05 feat(batchsize): tested different batch sizes and systems 2024-05-13 23:49:48 +02:00
Alok Saboo
d9752b1619 chore: update models_tokens.py with new model configurations 2024-05-13 17:22:32 -04:00
JGalego
859c5d5ba5 Refactored to include custom AWS client for bedrock; Added missing Anthropic class 2024-05-13 17:57:53 +01:00
Marco Perini
dedc733047 fix(asyncio): replaced deepcopy with copy due to serialization problems 2024-05-13 18:46:34 +02:00
Marco Vinciguerra
f3d44c09ae
Merge pull request #228 from JGalego/fix/bedrock-support
fix: Amazon Bedrock Support
2024-05-13 14:57:23 +02:00
VinciGit00
d0167dee71 fix: bug for claude 2024-05-13 13:43:42 +02:00
JGalego
39be38f292 Fixed anthropic/bedrock conflict; Removed duplicate class Claude; Updated BedrockEmbeddings class 2024-05-13 12:35:42 +01:00
Marco Perini
0c1594737f fix(fetch-node): removed isSoup from default 2024-05-13 12:09:55 +02:00
Marco Perini
7e8acd8e6a
Merge branch 'pre/beta' into fix/fetch-node-proxybroker 2024-05-13 11:17:37 +02:00
Marco Perini
5d6d996e8f fix(proxy-rotation): removed max_shape duplicate 2024-05-13 07:26:43 +02:00
Marco Perini
1e9a564616 fix(proxy-rotation): removed duplicated arg and passed the loader_kwarhs correctly to the node 2024-05-12 18:39:03 +02:00
VinciGit00
e2350eda62 feat: add new prompt info 2024-05-12 11:14:30 +02:00
Federico Aguzzi
156b67b91e feat: add support for deepseek-chat
closes #222
2024-05-12 01:00:11 +02:00
mayurdb
dd29c16cbe Merge branch 'deepScrape' of github.com:mayurdb/Scrapegraph-ai into deepScrape 2024-05-11 16:58:54 +05:30
mayurdb
9a67a26cd3 Update documentation 2024-05-11 16:57:22 +05:30
Mayur Bhosale
ea3b545573
Merge branch 'pre/beta' into deepScrape 2024-05-11 16:54:40 +05:30
mayurdb
8f1fbe7e10 minor changes 2024-05-11 16:52:36 +05:30
mayurdb
df271b6451 Add search link node that can find out relevant links in the webpage 2024-05-11 16:39:55 +05:30
Federico Minutoli
b54d984c13 fix(chromium-loader): ensure it subclasses langchain's base loader 2024-05-11 11:44:56 +02:00
VinciGit00
dc91719365 Update cleanup_html.py 2024-05-11 10:49:16 +02:00
Marco Vinciguerra
b752499fab
Merge pull request #217 from mayurdb/fetchLinkFix
Fetch links in the page while parsing html
2024-05-11 09:42:40 +02:00
mayurdb
300fd5d253 Fetch links in the page while parsing html 2024-05-11 09:46:51 +05:30
Eric Page
0683e78e78
Merge branch 'pre/beta' into fix-GenerateScraperGraph 2024-05-11 01:59:28 +02:00
Eric Page
24c3b0580c Removed nonfunctional RAG node from ScriptCreatorGraph 2024-05-11 01:38:40 +02:00
Eric Page
aac51ba290 Removed dead code, allows GenerateScraperNode to generate scraper with
one chunk of context
2024-05-11 01:34:51 +02:00
Eric Page
40884747c7 Added parse_html option in parse_node 2024-05-11 00:32:01 +02:00
Federico Minutoli
627cbeeb20 feat(parallel-exeuction): add asyncio event loop dispatcher with semaphore for parallel graph instances
TODO: still untested
2024-05-11 00:13:27 +02:00