Commit Graph

290 Commits

Author SHA1 Message Date
Federico Aguzzi
720f18729b Merge branch 'fireworks_integration' into support 2024-07-04 18:17:38 +02:00
Marco Vinciguerra
30ca15ca28
Merge branch 'md_scraper_integration' into integration_markdown
Some checks failed
/ build (3.10) (push) Has been cancelled
2024-06-30 16:58:37 +02:00
Marco Vinciguerra
2804434a9e feat: add integrations for markdown files
Some checks are pending
/ build (3.10) (push) Waiting to run
2024-06-29 13:35:39 +02:00
Marco Vinciguerra
9b45ebcdcf modify fetch node with no cut mode 2024-06-28 14:38:36 +02:00
Marco Vinciguerra
228a1de2be add new force 2024-06-27 18:57:27 +02:00
Marco Vinciguerra
4b56604413 add examples + test
Some checks failed
/ build (3.10) (push) Has been cancelled
2024-06-25 10:32:29 +02:00
Marco Vinciguerra
df0e310829 feat: add fireworks integration 2024-06-24 23:11:28 +02:00
Marco Vinciguerra
92cabe1da6 add load examples from a yml file
Some checks failed
/ build (3.10) (push) Has been cancelled
2024-06-23 13:02:35 +02:00
Marco Vinciguerra
d8fcb6ccd1 add new examples 2024-06-22 20:59:53 +02:00
Marco Vinciguerra
6549915962 Update Readme.md
Some checks failed
/ build (3.10) (push) Has been cancelled
2024-06-21 15:00:31 +02:00
Marco Vinciguerra
5d6123847e add new convert function
Some checks are pending
/ build (3.10) (push) Waiting to run
Co-Authored-By: Federico Minutoli <40361744+DiTo97@users.noreply.github.com>
2024-06-20 21:15:16 +02:00
Marco Vinciguerra
2f02830c81 refactoring of fetch node
Some checks are pending
/ build (3.10) (push) Waiting to run
2024-06-20 13:44:42 +02:00
Marco Vinciguerra
23bc6332d0 fixed a bug 2024-06-19 21:46:31 +02:00
Marco Vinciguerra
6d783755ce add benchmark 2024-06-19 21:11:15 +02:00
Marco Perini
0cb46f5f30
Merge pull request #392 from VinciGit00/391-switch-between-search-engines
Some checks failed
/ build (3.10) (push) Has been cancelled
Release / Build (push) Has been cancelled
Release / Release (push) Has been cancelled
feat: add new search engine avaiability and new tests
2024-06-18 23:46:28 +02:00
Jason Vertrees
aedda44868 fix: updated for schema changes
docs: updated for schema changes
2024-06-18 13:31:10 -05:00
Marco Vinciguerra
073d226723 feat: add new search engine avaiability and new tests 2024-06-18 14:35:13 +02:00
Marco Perini
080a318ff6 feat(telemetry): add telemetry module 2024-06-17 13:00:33 +02:00
Marco Vinciguerra
2419003999 fix: fix robot node
Some checks are pending
/ build (3.10) (push) Waiting to run
Release / Build (push) Waiting to run
Release / Release (push) Blocked by required conditions
2024-06-16 14:04:36 +02:00
Marco Perini
9b0e62742b changed source to text 2024-06-14 15:24:50 +02:00
Marco Perini
12f4386552
Merge branch 'pre/beta' into 349-problem-with-scrapegraphaigraphspdf_scraper_graphpy 2024-06-14 15:22:43 +02:00
Marco Perini
203de83405 fix(pdf): correctly read .pdf files 2024-06-14 15:20:30 +02:00
Marco Perini
91c5b5af43 fix(multi): updated multi pdf scraper with schema 2024-06-14 14:59:12 +02:00
Marco Perini
283b61fafc docs: better logging 2024-06-13 18:13:47 +02:00
Marco Vinciguerra
e45f159a31 enhanced performance and readibility 2024-06-12 14:59:10 +02:00
Marco Vinciguerra
58a257f05b update model tokens
Some checks are pending
/ build (3.10) (push) Waiting to run
Release / Build (push) Waiting to run
Release / Release (push) Blocked by required conditions
2024-06-12 12:41:58 +02:00
Marco Perini
a10b060409
Merge pull request #361 from VinciGit00/multi_scraper_implementation
Multi scraper implementation
2024-06-12 00:49:18 +02:00
Marco Perini
5d692bff9e feat(schema): merge scripts to follow pydantic schema 2024-06-12 00:48:08 +02:00
Marco Vinciguerra
9326637cc3 Merge branch 'main' into pre/beta 2024-06-09 17:04:11 +02:00
Marco Vinciguerra
bde02492c0 add examples 2024-06-09 15:26:56 +02:00
Marco Vinciguerra
fe8083fe48 Update pdf_scraper_graph_haiku.py
Some checks failed
/ build (3.10) (push) Has been cancelled
Release / Build (push) Has been cancelled
Release / Release (push) Has been cancelled
2024-06-09 10:02:29 +02:00
Marco Vinciguerra
5dc6165881 add example
Some checks failed
CodeQL / Analyze (python) (push) Has been cancelled
/ build (3.10) (push) Has been cancelled
Release / Build (push) Has been cancelled
Release / Release (push) Has been cancelled
2024-06-09 09:25:37 +02:00
Marco Vinciguerra
c14fb88fca add examples
Some checks failed
/ build (3.10) (push) Has been cancelled
2024-06-09 08:58:47 +02:00
Marco Vinciguerra
cb00c4fb17 changed model 2024-06-08 12:22:50 +02:00
Marco Vinciguerra
1981230e6f add multi scraper integration 2024-06-08 12:13:18 +02:00
Marco Perini
f41a755519
Merge pull request #356 from VinciGit00/321-integration-with-indexify
fixed pydantic schema
2024-06-07 20:27:25 +02:00
Marco Vinciguerra
dd2b3a8f59 add examples
Some checks failed
/ build (3.10) (push) Has been cancelled
Release / Build (push) Has been cancelled
Release / Release (push) Has been cancelled
2024-06-05 21:08:00 +02:00
Marco Perini
5d1fbf806a feat(indexify-node): add example
Some checks failed
/ build (3.10) (push) Has been cancelled
2024-06-05 18:45:37 +02:00
Marco Vinciguerra
95725789ff add earnie example 2024-06-05 13:21:32 +02:00
Marco Vinciguerra
4f53b09bf1 add examples for schema
Some checks are pending
/ build (3.10) (push) Waiting to run
Release / Build (push) Waiting to run
Release / Release (push) Blocked by required conditions
2024-06-05 10:43:57 +02:00
Marco Vinciguerra
74fd530914
Merge branch 'pre/beta' into 332-pydantic-schema-validation 2024-06-05 09:05:19 +02:00
Marco Perini
376f758a76 feat(pydantic): added pydantic output schema 2024-06-04 23:07:49 +02:00
Marco Vinciguerra
fff89f431f feat: refactoring of abstract graph 2024-06-04 19:41:11 +02:00
Marco Vinciguerra
8de720d379 feat: removed a bug 2024-06-03 21:45:37 +02:00
Marco Vinciguerra
1dde43cdeb add new examples 2024-06-03 21:03:13 +02:00
Marco Perini
79ace115c7
Merge pull request #323 from VinciGit00/refactoring-pdf_scraper
Refactoring pdf scraper and json scrape
2024-06-03 13:26:11 +02:00
Marco Vinciguerra
743dfe1191 add all possible examples 2024-06-03 12:19:43 +02:00
Marco Vinciguerra
b4086550cc feat: add csv scraper and xml scraper multi 2024-06-02 22:57:33 +02:00
Marco Vinciguerra
fa9722d2b9 add examples 2024-06-02 14:43:02 +02:00
Marco Vinciguerra
5cfc10178a feat: add forcing format as json 2024-06-02 12:24:54 +02:00