Commit graph

14 commits

Author SHA1 Message Date
Brent Schroeter
24e59e0bfc properly cast null sql values 2026-01-15 21:44:45 +00:00
Brent Schroeter
f0a4ef253b re-run ocr as needed to screen false positives 2026-01-15 21:33:57 +00:00
Brent Schroeter
9a94719dc1 replace sharpness edge detection with gradient 2026-01-15 21:33:00 +00:00
Brent Schroeter
ce42ab58f1 fix sloppy llm code for pdf parsing 2026-01-15 21:33:00 +00:00
Brent Schroeter
1ca2238c5d switch from sqlite to phonograph 2026-01-15 21:31:55 +00:00
Brent Schroeter
d48b672e1b improve contrast norm and sharpness measurement 2026-01-15 21:31:55 +00:00
Brent Schroeter
3da76d4537 reuse pdf ocr when available 2025-12-20 02:17:03 +00:00
Brent Schroeter
ac7e93a75b add interchangeable ocr engines 2025-11-07 05:41:18 +00:00
Brent Schroeter
d5757e3811 rewrite data fetching into archive_item.py 2025-10-04 18:03:03 -07:00
Brent Schroeter
4d9161b043 rewrite to engine.py 2025-10-04 15:10:10 -07:00
Brent Schroeter
815934ad23 store results to sqlite 2025-08-18 20:31:55 -07:00
Brent Schroeter
d33a7dc515 crop detection tuning 2025-08-10 22:56:25 -07:00
Brent Schroeter
a5e3a2a429 add ocr crop warnings 2025-08-10 22:10:16 -07:00
Brent Schroeter
8dbcb19b43 init 2025-08-10 12:27:39 -07:00