Commit Graph

881 Commits

Author SHA1 Message Date
Zach Nussbaum
1c5cc17b3f
fix: don't break 2025-09-09 03:49:25 +00:00
Zach Nussbaum
9b015ea3ff
fix: don't use flash rope? 2025-09-09 03:48:24 +00:00
Vik Paruchuri
5234bc09c2 Move flash attention funcs
Some checks failed
Integration test / build (push) Has been cancelled
Unit tests / build (t4_gpu) (push) Has been cancelled
Unit tests / build (ubuntu-latest) (push) Has been cancelled
Unit tests / build (windows-latest) (push) Has been cancelled
Test CLI scripts / build (push) Has been cancelled
2025-09-08 12:21:52 -04:00
Vik Paruchuri
ebf5ec72e1
Merge pull request #455 from datalab-to/vik/no_autoset
Get rid of attention method checks
2025-09-08 11:28:02 -04:00
Vik Paruchuri
a4c3c85279 Bump transformers 2025-09-08 09:58:46 -04:00
Vik Paruchuri
3c9dfe48bf Fix tests 2025-09-08 09:39:35 -04:00
Vik Paruchuri
605339d68b Better heuristics if implementation is None 2025-09-08 09:24:40 -04:00
Vik Paruchuri
4ed13823fc Get rid of attention method checks 2025-09-08 09:04:19 -04:00
Vik Paruchuri
cf6dbde72c Foundation predictor init
Some checks failed
Integration test / build (push) Has been cancelled
Unit tests / build (t4_gpu) (push) Has been cancelled
Unit tests / build (ubuntu-latest) (push) Has been cancelled
Unit tests / build (windows-latest) (push) Has been cancelled
Test CLI scripts / build (push) Has been cancelled
2025-09-07 21:39:20 -04:00
Vik Paruchuri
35d2734412 Bump version 2025-09-07 21:12:42 -04:00
Vik Paruchuri
43e74bcd95
Merge pull request #451 from datalab-to/sdpa-fix
Foundation Model Performance Improvements
2025-09-07 20:52:46 -04:00
Vik Paruchuri
7da76d18cc Add attention implementation 2025-09-07 20:48:04 -04:00
Tarun Menta
b96ac10282
Sort by total image pixels, since we are in block mode now 2025-09-07 10:31:50 -04:00
Tarun Menta
179848b037
Fix cache positions for SDPA 2025-09-06 19:30:43 -04:00
Vik Paruchuri
2db254c7e8
Merge pull request #449 from datalab-to/master
Backport
2025-09-05 16:29:13 -04:00
Zach Nussbaum
dca1f48dd2
bump version 2025-09-05 19:58:31 +00:00
Zach Nussbaum
7b600a5886
bump version 2025-09-05 19:15:53 +00:00
Zach Nussbaum
13beff2723
Merge pull request #448 from datalab-to/dev
Dev
2025-09-05 15:13:33 -04:00
Zach Nussbaum
273040d8fc
Merge pull request #446 from datalab-to/multi-token
feat: multi-token decoding
2025-09-05 15:05:59 -04:00
Zach Nussbaum
8821050961
fix: update model
Some checks failed
Integration test / build (push) Has been cancelled
Unit tests / build (t4_gpu) (push) Has been cancelled
Unit tests / build (ubuntu-latest) (push) Has been cancelled
Unit tests / build (windows-latest) (push) Has been cancelled
Test CLI scripts / build (push) Has been cancelled
2025-09-05 18:49:04 +00:00
Zach Nussbaum
b8a464c881
fix: opencv version fix 2025-09-05 18:32:51 +00:00
Ashish Uppala
1a32fa6cd1
Update README 2025-09-05 11:31:46 -04:00
github-actions[bot]
950559b5fd
@u-ashish has signed the CLA in datalab-to/surya#447 2025-09-05 15:16:58 +00:00
u-ashish
d9121e17bc Update README 2025-09-05 11:14:17 -04:00
Zach Nussbaum
691030445f
refactor: move min confidence to settings 2025-09-04 00:42:26 +00:00
Zach Nussbaum
9282294356
fix: don't do position rolling bc it's handled already 2025-09-03 21:37:26 +00:00
Zach Nussbaum
0ef78da9b4
Merge branch 'dev' into multi-token 2025-09-03 21:09:05 +00:00
Zach Nussbaum
2c172f2fb2
feat: not working conditional decode 2025-09-03 21:05:43 +00:00
github-actions[bot]
0410f44270
@davidxifeng has signed the CLA in datalab-to/surya#445
Some checks failed
Integration test / build (push) Has been cancelled
Unit tests / build (t4_gpu) (push) Has been cancelled
Unit tests / build (ubuntu-latest) (push) Has been cancelled
Unit tests / build (windows-latest) (push) Has been cancelled
Test CLI scripts / build (push) Has been cancelled
2025-09-03 14:52:28 +00:00
Vik Paruchuri
4acbc8e86d
Merge pull request #444 from datalab-to/dev
Remove unused import
2025-09-02 14:25:01 -04:00
Vik Paruchuri
5715935c48 Remove unused import
Some checks failed
Integration test / build (push) Has been cancelled
Unit tests / build (t4_gpu) (push) Has been cancelled
Unit tests / build (ubuntu-latest) (push) Has been cancelled
Unit tests / build (windows-latest) (push) Has been cancelled
Test CLI scripts / build (push) Has been cancelled
2025-09-02 13:56:46 -04:00
Tarun Menta
9234534274
Merge pull request #441 from datalab-to/dev
Dev
2025-08-29 15:03:22 -04:00
Tarun Menta
a9cd0a55ae
Bump version 2025-08-29 15:03:02 -04:00
Tarun Menta
65fed74c00
Merge pull request #440 from datalab-to/table-cell-updates
Model Update: New Tokenizer and Encoder
2025-08-29 15:01:11 -04:00
Tarun Menta
d4496f8caa
Fix bad test - Add real latex image
Some checks failed
Integration test / build (push) Has been cancelled
Unit tests / build (t4_gpu) (push) Has been cancelled
Unit tests / build (ubuntu-latest) (push) Has been cancelled
Unit tests / build (windows-latest) (push) Has been cancelled
Test CLI scripts / build (push) Has been cancelled
2025-08-29 14:53:41 -04:00
Tarun Menta
e7ec40ecb4
Move new model to R2 2025-08-29 14:40:14 -04:00
Tarun Menta
cdc7b18af9
Merge branch 'table-cell-updates' of https://github.com/VikParuchuri/surya into table-cell-updates
Some checks failed
Integration test / build (push) Has been cancelled
Unit tests / build (t4_gpu) (push) Has been cancelled
Unit tests / build (ubuntu-latest) (push) Has been cancelled
Unit tests / build (windows-latest) (push) Has been cancelled
Test CLI scripts / build (push) Has been cancelled
2025-08-28 10:42:55 -04:00
Zach Nussbaum
4cdf1080cd
fix: ignore on utf16 errors 2025-08-28 00:08:51 +00:00
Zach Nussbaum
5d1c369477
feat: new tokenizer 2025-08-28 00:08:51 +00:00
Tarun Menta
c37c42e72c
Merge branch 'vik/new-enc' into table-cell-newenc 2025-08-27 10:21:59 -04:00
Zach Nussbaum
bc7ee4895a
fix: ignore on utf16 errors
Some checks failed
Integration test / build (push) Has been cancelled
Unit tests / build (t4_gpu) (push) Has been cancelled
Unit tests / build (ubuntu-latest) (push) Has been cancelled
Unit tests / build (windows-latest) (push) Has been cancelled
Test CLI scripts / build (push) Has been cancelled
2025-08-27 10:45:08 +00:00
Tarun Menta
a4ed5523d0
Filter more unwanted tags
Some checks failed
Integration test / build (push) Has been cancelled
Unit tests / build (t4_gpu) (push) Has been cancelled
Unit tests / build (ubuntu-latest) (push) Has been cancelled
Unit tests / build (windows-latest) (push) Has been cancelled
Test CLI scripts / build (push) Has been cancelled
2025-08-26 15:16:31 -04:00
Zach Nussbaum
a37919feff
feat: new tokenizer
Some checks failed
Integration test / build (push) Has been cancelled
Unit tests / build (t4_gpu) (push) Has been cancelled
Unit tests / build (ubuntu-latest) (push) Has been cancelled
Unit tests / build (windows-latest) (push) Has been cancelled
Test CLI scripts / build (push) Has been cancelled
2025-08-26 13:59:13 +00:00
Vik Paruchuri
cb82cc883c
Merge pull request #436 from datalab-to/dev
Some checks failed
Integration test / build (push) Has been cancelled
Unit tests / build (t4_gpu) (push) Has been cancelled
Unit tests / build (ubuntu-latest) (push) Has been cancelled
Unit tests / build (windows-latest) (push) Has been cancelled
Test CLI scripts / build (push) Has been cancelled
Dev
2025-08-23 12:52:58 -04:00
Vik Paruchuri
226bab71c7
Merge pull request #434 from sandy0kwon/update-license-openrail
Some checks failed
Integration test / build (push) Has been cancelled
Unit tests / build (t4_gpu) (push) Has been cancelled
Unit tests / build (ubuntu-latest) (push) Has been cancelled
Unit tests / build (windows-latest) (push) Has been cancelled
Test CLI scripts / build (push) Has been cancelled
update commercial license description in README and update LICENSE to…
2025-08-22 15:33:04 -04:00
github-actions[bot]
3429ad8a07
@n0kovo has signed the CLA in datalab-to/surya#435
Some checks failed
Integration test / build (push) Has been cancelled
Unit tests / build (t4_gpu) (push) Has been cancelled
Unit tests / build (ubuntu-latest) (push) Has been cancelled
Unit tests / build (windows-latest) (push) Has been cancelled
Test CLI scripts / build (push) Has been cancelled
2025-08-20 22:22:21 +00:00
github-actions[bot]
96da8a4de9
@sandy0kwon has signed the CLA in datalab-to/surya#434 2025-08-20 20:12:08 +00:00
sandy0kwon
c3884f6e3d
Merge branch 'master' into update-license-openrail 2025-08-20 16:11:55 -04:00
Sandy Kwon
b9f64ff8a4 update commercial license description in README and update LICENSE to reflect OpenRAIL update 2025-08-20 16:05:23 -04:00
Tarun Menta
78302facbf
Merge branch 'vik/new-enc' into table-cell-updates 2025-08-19 16:56:28 -04:00