tesseract-ocr/tessdata

Trained models with fast variant of the "best" LSTM models + legacy models

ocrtesseract
This is stars and forks stats for /tesseract-ocr/tessdata repository. As of 06 May, 2024 this repository has 5430 stars and 2023 forks.

tessdata These language data files only work with Tesseract 4.0.0 and newer versions. They are based on the sources in tesseract-ocr/langdata on GitHub. (still to be updated for 4.0.0 - 20180322) These have models for legacy tesseract engine (--oem 0) as well as the new LSTM neural net based engine (--oem 1). The LSTM models (--oem 1) in these files have been updated to the integerized versions of tessdata_best on GitHub. So, they should be faster but probably a little less accurate than tessdata_best. tessdata_fast...
Read on GithubGithub Stats Page
repotechsstarsweeklyforksweekly
siyuan-note/siyuanTypeScriptGoJavaScript12.4k09390
ocrmypdf/OCRmyPDFPythonShellDockerfile10.1k07750
xushengfeng/eSearchTypeScriptHTMLCSS2.3k02040
clovaai/donutPython4.5k03580
maxent-ai/ocrpyJupyter NotebookPython217080
Gabattal/Scripts-LeagueOfLegendsPythonBatchfile43040
axa-group/ParsrJavaScriptTypeScriptOther5.4k02920
hiroi-sora/Umi-OCRPythonOther9.3k09230
TheJoeFin/Text-GrabC#PowerShell2.5k01710
koreader/koreader-baseLuaCC++1090960