Which OCR engine is best for Hebrew?

For clean 300+ DPI scans and privacy-sensitive data, Tesseract with LSTM is enough. For noisy scans, mixed form types, or variable quality, Google Cloud Vision or Azure Read usually give better results. Claude Vision is a good fit when you also need the model to understand the document, not just extract text.

Why does OCR read an Israeli ID as 8 digits instead of 9?

Single-digit OCR errors are common. Always run the check-digit algorithm on the extracted number. If it fails, flag for manual review or try a higher-resolution scan (600 DPI).

Do these engines work on handwritten Hebrew?

Not reliably. Hebrew handwriting OCR accuracy drops sharply. For forms with handwritten parts, extract the printed parts automatically and flag the handwritten sections for manual review.

How do I handle mixed Hebrew-Arabic text in government forms?

Run OCR with a heb+ara+eng language list. Both languages are RTL but use different Unicode ranges, so Bidi normalization is essential after extraction.

Hebrew Ocr Forms

Verified94/100

Before deciding whether to install, talk to the skill

Process and extract data from scanned Israeli government forms using OCR. Supports Tabu (land registry), Tax Authority forms, Bituach Leumi documents, and other official Israeli paperwork. Use when user asks to OCR Hebrew documents, extract data from Israeli forms, "lesarek tofes", parse Tabu extract, read scanned tax form, or process Israeli government documents. Includes Hebrew OCR configuration, field extraction patterns, and RTL text handling. Do NOT use for handwritten Hebrew recognition (requires specialized models) or non-Israeli form processing.

The Problem

Israeli government forms still arrive in large volumes as scans and unstructured PDFs, with handwriting, stamps, and dense Hebrew text. Standard OCR tools struggle with Hebrew due to script direction, similar-looking characters, and low-quality scans. The result is hours of manual data entry and frequent errors.

skills-il Localization|85installs1,592views

0Write a Review

1.1.0MITGitHub

85installs1,592views

0Write a Review

Updated: June 10, 2026|Tags:ocr hebrew forms government tabu tax bituach-leumi

npx skills-il add skills-il/localization --skill hebrew-ocr-forms -a claude-code

Install on Claude.ai, Claude Desktop, ChatGPT, Manus, or other platforms

1. Click "Download ZIP" to download the skill files.
2. Open Claude Desktop and go to Customize > Skills.
3. Click "+" and select "Upload a skill", then upload the ZIP file.
4. Start a new conversation. The skill will activate automatically when relevant.

A new version released? How to update your installed skill

Not sure how? Read the guide

When to Apply

Extracting data from scanned Israeli forms (Tabu extract, Tofes 106, ishur nikui)
Batch-processing PDFs or images of Hebrew government documents, including multi-page PDFs
Detecting TIN, Israeli ID (teudat zehut), or gush/chelka from printed text
Choosing a Hebrew OCR engine (local Tesseract vs a cloud Vision API vs Claude Vision)
Validating extracted fields (Israeli ID check digit, date format)

Try These Prompts

Tax form OCR

How do I scan an income tax form 106 and extract its data: salary, withheld tax, and deductions, into a JSON structure?

Tabu extract

How do I extract data from a scanned Tabu (land registry) document? I need to retrieve property owners, block and parcel numbers, liens, and registrations.

Document validation

How do I validate that a scanned government document (such as an ID card or business license) is authentic and that the extracted data is accurate?

Frequently Asked Questions

Related Skills

Hebrew Nlp Toolkit

Verified·97

Author: skills-il

v1.1.0Popular

Guide developers in using Hebrew NLP models and tools including DictaLM, DictaBERT, AlephBERT, and ivrit.ai. Use when user asks about Hebrew text processing, Hebrew NLP, "ivrit", Hebrew tokenization, Hebrew NER, Hebrew sentiment analysis, Hebrew speech-to-text, or needs to process Hebrew language text programmatically. Covers model selection, preprocessing, and Hebrew-specific NLP challenges. Do NOT use for Arabic NLP (different tools) or general English NLP tasks.

Ask the Skill

5.03011,618

Claude CodeCursorGitHub Copilot+5

Israeli UI Design System

Verified·91

Author: skills-il

v1.1.0Popular

Build RTL-first UI component libraries and design systems for Israeli applications with Hebrew typography. Use when user asks about Hebrew UI components, "itzuv" (design), Israeli design system, Hebrew font pairing, RTL component library, "tipografia ivrit" (Hebrew typography), or gov.il design patterns. Covers RTL-first component architecture, Hebrew font pairings (Heebo+Inter, Rubik+Source Sans Pro), gov.il design system patterns, Israeli formatting conventions (shekel sign, DD/MM/YYYY dates, 24-hour clock), and culturally appropriate UI for Israeli users. Do NOT use for general RTL CSS (use hebrew-rtl-best-practices) or accessibility audits (use israeli-accessibility-compliance instead).

Ask the Skill

0.01321,434

Claude CodeCursorGitHub Copilot+5

Hebrew RTL Best Practices

Verified·91

Author: skills-il

v1.2.0Popular

Implement right-to-left (RTL) layouts for Hebrew web and mobile applications. Use when user asks about RTL layout, Hebrew text direction, bidirectional (bidi) text, Hebrew CSS, "right to left", or needs to build Hebrew UI. Covers CSS logical properties, Tailwind RTL, React/Next.js RTL setup, Hebrew typography, and font selection. Do NOT use for Arabic RTL (similar but different typography) unless user explicitly asks for shared RTL patterns.

Ask the Skill

0.04861,875

Claude CodeCursorGitHub Copilot+5

Found an issue with this skill?

Use at your own risk. Terms of Use · Security

Want to build your own skill? Try the Skill Creator · Submit a Skill

Reviews (0)

No reviews yet. Be the first to write one!

Hebrew Ocr Forms

When to Apply

Try These Prompts

Developer & AI Agent Instructions

Security Analysis

Quality Score

Performance Data

Frequently Asked Questions

Which OCR engine is best for Hebrew?

Which OCR engine is best for Hebrew?

Why does OCR read an Israeli ID as 8 digits instead of 9?

Why does OCR read an Israeli ID as 8 digits instead of 9?

Do these engines work on handwritten Hebrew?

Do these engines work on handwritten Hebrew?

How do I handle mixed Hebrew-Arabic text in government forms?

How do I handle mixed Hebrew-Arabic text in government forms?

Related Skills

Hebrew Nlp Toolkit

Israeli UI Design System

Hebrew RTL Best Practices

Reviews (0)