Commit Graph

970 Commits

Author SHA1 Message Date
Vik Paruchuri
e966a20990 CI fix
Some checks failed
Unit tests / build (t4_gpu) (push) Has been cancelled
Unit tests / build (ubuntu-latest) (push) Has been cancelled
Unit tests / build (windows-latest) (push) Has been cancelled
Test CLI scripts / build (push) Has been cancelled
2026-05-27 10:59:19 -04:00
Vik Paruchuri
c421e742b4 Finalize surya README 2026-05-27 10:49:59 -04:00
Vik Paruchuri
79246df837 Add screenshot app 2026-05-27 10:36:53 -04:00
Vik Paruchuri
80c2903ea2 Cleanups 2026-05-27 09:48:56 -04:00
Vik Paruchuri
3ba0cdf8a1 README fixes 2026-05-27 09:04:58 -04:00
Vik Paruchuri
f6e5b106e1 Fix repo names
Some checks failed
Unit tests / build (t4_gpu) (push) Has been cancelled
Unit tests / build (ubuntu-latest) (push) Has been cancelled
Unit tests / build (windows-latest) (push) Has been cancelled
Test CLI scripts / build (push) Has been cancelled
2026-05-27 07:01:27 -04:00
Vik Paruchuri
05f007aa74 Merge dev 2026-05-26 18:38:15 -04:00
Vik Paruchuri
dba0419e2b Remove einops 2026-05-26 18:12:17 -04:00
Vik Paruchuri
0671165d5d Cleanups 2026-05-26 18:03:24 -04:00
Vik Paruchuri
91bccf73f2 Update README 2026-05-26 17:15:36 -04:00
Vik Paruchuri
48a5068d05 Fix metal 2026-05-18 09:16:48 -04:00
Vik Paruchuri
0299f2978d Update model paths 2026-05-14 17:32:26 -04:00
Vik Paruchuri
2becd37692 Cleanup README 2026-05-14 16:49:54 -04:00
Vik Paruchuri
9cf57d5884 Update README 2026-05-14 16:46:23 -04:00
Vik Paruchuri
9158557600 Cleanups 2026-05-14 16:26:39 -04:00
Vik Paruchuri
8e1d94ff7e Cleanup deps 2026-05-14 15:54:55 -04:00
Vik Paruchuri
05874ae9ad Heavy cleanups 2026-05-14 15:50:11 -04:00
Vik Paruchuri
8e40566be0 Clean up licensing 2026-05-14 15:29:07 -04:00
Vik Paruchuri
6510e7e459 Migrate surya 2026-05-14 15:29:07 -04:00
Tarun Menta
237e63e35d Bump pyproject 2026-05-14 15:29:07 -04:00
Ashish Uppala
ed2fac1340
update readme
Some checks failed
Integration test / build (push) Has been cancelled
Unit tests / build (t4_gpu) (push) Has been cancelled
Unit tests / build (ubuntu-latest) (push) Has been cancelled
Unit tests / build (windows-latest) (push) Has been cancelled
Test CLI scripts / build (push) Has been cancelled
2026-04-22 17:00:25 -04:00
Ashish Uppala
a069e0e8ab update readme 2026-04-22 16:57:37 -04:00
Tarun Menta
3969d6b49d
Bump pyproject
Some checks failed
Integration test / build (push) Has been cancelled
Unit tests / build (t4_gpu) (push) Has been cancelled
Unit tests / build (ubuntu-latest) (push) Has been cancelled
Unit tests / build (windows-latest) (push) Has been cancelled
Test CLI scripts / build (push) Has been cancelled
2026-01-30 16:43:42 -05:00
Tarun Menta
059d94aeaa
Merge pull request #479 from datalab-to/tarun/latex-hotfix
Fix latex detokenization bug - Unescaping sequences
2026-01-30 16:43:15 -05:00
Tarun Menta
c428222535
Better latex fixing 2026-01-30 16:30:41 -05:00
Tarun Menta
3e08b67e30
Fix latex detokenization bug - Unescaping sequences 2026-01-30 10:59:05 -05:00
Vik Paruchuri
fe3e7696cf Merge remote-tracking branch 'origin/dev' into dev
Some checks failed
Integration test / build (push) Has been cancelled
Unit tests / build (t4_gpu) (push) Has been cancelled
Unit tests / build (ubuntu-latest) (push) Has been cancelled
Unit tests / build (windows-latest) (push) Has been cancelled
Test CLI scripts / build (push) Has been cancelled
2025-10-21 13:34:36 -04:00
Vik Paruchuri
f890b6d05b Add model license 2025-10-21 13:34:27 -04:00
Vik Paruchuri
a2f46b80a4
Merge pull request #466 from wkpark/apache2-fix
restore apache2 license some part of code from original source code
2025-10-20 11:34:17 -04:00
Vik Paruchuri
25a00e8d92 Fix license 2025-10-20 11:32:29 -04:00
Won-Kyu Park
113fe128c2
restore apache2 license from original source code 2025-09-27 07:10:14 +09:00
github-actions[bot]
a46d448a1f
@wkpark has signed the CLA in datalab-to/surya#464
Some checks failed
Integration test / build (push) Has been cancelled
Unit tests / build (t4_gpu) (push) Has been cancelled
Unit tests / build (ubuntu-latest) (push) Has been cancelled
Unit tests / build (windows-latest) (push) Has been cancelled
Test CLI scripts / build (push) Has been cancelled
2025-09-24 17:43:06 +00:00
Tarun Menta
869a321aa0
Bump version
Some checks failed
Integration test / build (push) Has been cancelled
Unit tests / build (t4_gpu) (push) Has been cancelled
Unit tests / build (ubuntu-latest) (push) Has been cancelled
Unit tests / build (windows-latest) (push) Has been cancelled
Test CLI scripts / build (push) Has been cancelled
2025-09-23 17:38:46 -04:00
Tarun Menta
59fd62921b
Merge pull request #463 from datalab-to/dev
Dev
2025-09-23 17:31:25 -04:00
Tarun Menta
466aba72b9
Merge pull request #461 from datalab-to/layout-release
Layout Model Release
2025-09-23 17:30:08 -04:00
Tarun Menta
d3aecc0977
Pick correct dtype on T4 GPUs
Some checks failed
Integration test / build (push) Has been cancelled
Unit tests / build (t4_gpu) (push) Has been cancelled
Unit tests / build (ubuntu-latest) (push) Has been cancelled
Unit tests / build (windows-latest) (push) Has been cancelled
Test CLI scripts / build (push) Has been cancelled
2025-09-23 17:22:34 -04:00
Tarun Menta
eb179cc543
Update layout batch sizes 2025-09-23 17:09:09 -04:00
Tarun Menta
9bee27c4e0
Fix tests 2025-09-23 16:57:27 -04:00
Tarun Menta
1d09025ea3
Bump foundation checkpoint 2025-09-23 16:39:10 -04:00
Tarun Menta
4d7be669ab
Models moved to S3 2025-09-23 16:33:05 -04:00
Tarun Menta
5811d072f4
Separate models for layout and OCR 2025-09-23 15:39:44 -04:00
Tarun Menta
c1719e982c
Update loaders with dtype instead of torch_dtype -- transformers
`torch_dtype` was deprecated, will silently fail
2025-09-22 12:49:51 -04:00
github-actions[bot]
e42aec0385
@Mohking1 has signed the CLA in datalab-to/surya#462
Some checks failed
Integration test / build (push) Has been cancelled
Unit tests / build (t4_gpu) (push) Has been cancelled
Unit tests / build (ubuntu-latest) (push) Has been cancelled
Unit tests / build (windows-latest) (push) Has been cancelled
Test CLI scripts / build (push) Has been cancelled
2025-09-20 11:21:54 +00:00
Tarun Menta
42e016f49c
Update tqdm desc string based on founation model mode
Some checks failed
Integration test / build (push) Has been cancelled
Unit tests / build (t4_gpu) (push) Has been cancelled
Unit tests / build (ubuntu-latest) (push) Has been cancelled
Unit tests / build (windows-latest) (push) Has been cancelled
Test CLI scripts / build (push) Has been cancelled
2025-09-19 12:51:01 -04:00
Tarun Menta
9ab25b3ede
Merge branch 'dev' into layout-release 2025-09-19 12:47:06 -04:00
Tarun Menta
20f9179503
Pin sliding window for layout 2025-09-18 15:51:32 -04:00
Tarun Menta
74e790ccd8
Tokenizer fix 2025-09-17 16:06:15 -04:00
Tarun Menta
9673ec6abe
Merge in recognition predictor changes from dev 2025-09-16 21:27:53 -04:00
Tarun Menta
2d5dd9b86f
Move back to old table_rec model for now 2025-09-16 21:18:37 -04:00
Vik Paruchuri
a7ffa7ee51
Merge pull request #457 from datalab-to/dev
Some checks failed
Integration test / build (push) Has been cancelled
Unit tests / build (t4_gpu) (push) Has been cancelled
Unit tests / build (ubuntu-latest) (push) Has been cancelled
Unit tests / build (windows-latest) (push) Has been cancelled
Test CLI scripts / build (push) Has been cancelled
Move flash attention funcs
2025-09-08 12:38:58 -04:00