Commit Graph

102 Commits

Author SHA1 Message Date
Marco Vinciguerra
f7ba1f30de refactoring of the code 2024-08-23 11:33:22 +02:00
Federico Aguzzi
b48ee825ee Merge branch 'pre/beta' into support_structured_output_shema_openai 2024-08-19 13:31:01 +02:00
Federico Aguzzi
683bf57d89 fix(ParseNode): leave room for LLM reply in context window 2024-08-19 11:33:09 +02:00
Marco Vinciguerra
8b8d8f09b7 refactoring of the code according to pylint style
Some checks are pending
/ build (push) Waiting to run
Release / Build (push) Waiting to run
Release / Release (push) Blocked by required conditions
2024-08-18 11:51:34 +02:00
Marco Vinciguerra
b115f9612f refactoring of code and documentation 2024-08-16 09:01:27 +02:00
Marco Vinciguerra
8b2c266aff refactoring of the code
Co-Authored-By: Matteo Vedovati <68272450+vedovati-matteo@users.noreply.github.com>
2024-08-10 17:44:35 +02:00
Marco Vinciguerra
886bab78e7
Merge branch 'main' into temp-1 2024-08-06 10:04:57 +02:00
Marco Vinciguerra
9355507a2d feat: refactoring of the code 2024-08-02 12:00:00 +02:00
Marco Perini
5c08eea189 docs: prev version
Some checks failed
CodeQL / Analyze (python) (push) Has been cancelled
/ build (push) Has been cancelled
Release / Build (push) Has been cancelled
Release / Release (push) Has been cancelled
2024-07-24 23:37:57 +02:00
Marco Vinciguerra
e08b30495d boh
Some checks are pending
CodeQL / Analyze (python) (push) Waiting to run
/ build (push) Waiting to run
Release / Build (push) Waiting to run
Release / Release (push) Blocked by required conditions
2024-07-24 14:33:55 +02:00
Marco Perini
1d41f6eafe
fix: md conversion 2024-07-23 19:27:18 +02:00
Marco Perini
5a45e9f2d8
fix: md conversion 2024-07-23 18:09:15 +02:00
Marco Perini
12b5eada6e
fix(md_conversion): add absolute links md, added missing dependency 2024-07-23 15:34:12 +02:00
Marco Vinciguerra
602dd00209 feat: refactoring_to_md function 2024-07-16 12:39:48 +02:00
Federico Aguzzi
7ba2f6ae0b chore: correct search engine name
Some checks failed
/ build (push) Has been cancelled
2024-07-14 22:39:34 +02:00
Marco Vinciguerra
7e5789baa3 Update research_web.py 2024-07-14 19:31:22 +02:00
Federico Aguzzi
104d8692d6
Merge branch 'pre/beta' into support
Some checks failed
/ build (push) Has been cancelled
2024-07-04 18:36:34 +02:00
Marco Vinciguerra
30ca15ca28
Merge branch 'md_scraper_integration' into integration_markdown
Some checks failed
/ build (3.10) (push) Has been cancelled
2024-06-30 16:58:37 +02:00
Marco Vinciguerra
2804434a9e feat: add integrations for markdown files
Some checks are pending
/ build (3.10) (push) Waiting to run
2024-06-29 13:35:39 +02:00
Marco Vinciguerra
3a537eec6f fix: add test 2024-06-23 17:54:09 +02:00
Marco Vinciguerra
9917972c11 fixed request
Some checks are pending
/ build (3.10) (push) Waiting to run
2024-06-22 21:39:37 +02:00
Marco Vinciguerra
7af411aa99 add trigger
Co-Authored-By: Matteo Vedovati <68272450+vedovati-matteo@users.noreply.github.com>
2024-06-21 13:36:27 +02:00
Marco Vinciguerra
5d6123847e add new convert function
Some checks are pending
/ build (3.10) (push) Waiting to run
Co-Authored-By: Federico Minutoli <40361744+DiTo97@users.noreply.github.com>
2024-06-20 21:15:16 +02:00
Marco Vinciguerra
8bb560a489 add convert function 2024-06-19 20:17:45 +02:00
Marco Vinciguerra
aa2160c108 feat: add research with bing + test function
Some checks failed
/ build (3.10) (push) Has been cancelled
2024-06-18 21:28:29 +02:00
Marco Vinciguerra
073d226723 feat: add new search engine avaiability and new tests 2024-06-18 14:35:13 +02:00
Marco Perini
283b61fafc docs: better logging 2024-06-13 18:13:47 +02:00
Marco Vinciguerra
a6757aca57
Merge pull request #380 from Kshitij-Jande/main
Strip out the scheme from the proxy server address URI.
2024-06-13 13:23:26 +02:00
Kshitij Jande
2a9ab69f4b Strip out the scheme from the server address URI 2024-06-13 12:18:41 +05:30
Marco Vinciguerra
5065aa00e1
Merge branch 'pre/beta' into supercoder-327 2024-06-12 12:12:15 +02:00
supercoder-dev
879c94a2b5
Update cleanup_html.py 2024-06-12 14:30:02 +05:30
seyf97
acece72c28
Update cleanup_html.py
Remove redundant lines in Links extraction
2024-06-04 13:49:00 +03:00
Marco Perini
8be27bad80 fix(3.9): python 3.9 logging fix 2024-05-31 22:32:20 +02:00
Marco Perini
5fb9115330 feat(version): python 3.12 is now supported 🚀 2024-05-26 08:49:27 +02:00
Marco Perini
5684578fab fix(kg): removed unused nodes and utils 2024-05-25 00:44:14 +02:00
Marco Vinciguerra
b913b51cca Merge branch 'logger-integration' into pre/beta 2024-05-24 12:39:14 +02:00
Federico Minutoli
d1394809d7 fix(logging): source code citation 2024-05-24 01:08:08 +02:00
Marco Perini
c75e6a06b1 feat(kg): working rag kg 2024-05-18 10:26:25 +02:00
Marco Perini
0196423bde feat(knowledgegraph): add knowledge graph node 2024-05-17 23:41:44 +02:00
VinciGit00
cc5adefd29 fix: come back to the old version 2024-05-15 15:54:00 +02:00
VinciGit00
29d284e497 Merge branch 'main' into logger-integration 2024-05-15 15:28:20 +02:00
Marco Vinciguerra
cffcf80a75
Merge branch '88-blockscraper-implementation' into main 2024-05-15 15:20:12 +02:00
VinciGit00
e53766b16e feat: add logger integration 2024-05-14 15:20:39 +02:00
Marco Perini
a296927624 feat(omni-scraper): working OmniScraperGraph with images 2024-05-14 13:46:49 +02:00
Marco Perini
7e8acd8e6a
Merge branch 'pre/beta' into fix/fetch-node-proxybroker 2024-05-13 11:17:37 +02:00
Marco Perini
5d6d996e8f fix(proxy-rotation): removed max_shape duplicate 2024-05-13 07:26:43 +02:00
Marco Perini
1e9a564616 fix(proxy-rotation): removed duplicated arg and passed the loader_kwarhs correctly to the node 2024-05-12 18:39:03 +02:00
VinciGit00
dc91719365 Update cleanup_html.py 2024-05-11 10:49:16 +02:00
mayurdb
300fd5d253 Fetch links in the page while parsing html 2024-05-11 09:46:51 +05:30
Federico Minutoli
fc2aa3ac1c Merge branch 'pre/beta' of https://github.com/DiTo97/Scrapegraph-ai into fix/fetch-node-proxybroker 2024-05-10 21:20:40 +02:00