Hebrew Ocr Forms
Verified94/100Process and extract data from scanned Israeli government forms using OCR. Supports Tabu (land registry), Tax Authority forms, Bituach Leumi documents, and other official Israeli paperwork. Use when user asks to OCR Hebrew documents, extract data from Israeli forms, "lesarek tofes", parse Tabu extract, read scanned tax form, or process Israeli government documents. Includes Hebrew OCR configuration, field extraction patterns, and RTL text handling. Do NOT use for handwritten Hebrew recognition (requires specialized models) or non-Israeli form processing.
Trust score 94/100 (Verified) · 85+ installs · 2 GitHub contributors · MIT license
Israeli government forms still arrive in large volumes as scans and unstructured PDFs, with handwriting, stamps, and dense Hebrew text. Standard OCR tools struggle with Hebrew due to script direction, similar-looking characters, and low-quality scans. The result is hours of manual data entry and frequent errors.
npx skills-il add skills-il/localization --skill hebrew-ocr-forms -a claude-codeInstall on Claude.ai, Claude Desktop, ChatGPT, Manus, or other platforms
- 1. Click "Download ZIP" to download the skill files.
- 2. Open Claude Desktop and go to Customize > Skills.
- 3. Click "+" and select "Upload a skill", then upload the ZIP file.
- 4. Start a new conversation. The skill will activate automatically when relevant.
When to Apply
- Extracting data from scanned Israeli forms (Tabu extract, Tofes 106, ishur nikui)
- Batch-processing PDFs or images of Hebrew government documents, including multi-page PDFs
- Detecting TIN, Israeli ID (teudat zehut), or gush/chelka from printed text
- Choosing a Hebrew OCR engine (local Tesseract vs a cloud Vision API vs Claude Vision)
- Validating extracted fields (Israeli ID check digit, date format)
Try These Prompts
How do I scan an income tax form 106 and extract its data: salary, withheld tax, and deductions, into a JSON structure?
How do I extract data from a scanned Tabu (land registry) document? I need to retrieve property owners, block and parcel numbers, liens, and registrations.
How do I validate that a scanned government document (such as an ID card or business license) is authentic and that the extracted data is accurate?
Frequently Asked Questions
Related Skills
Guide developers in using Hebrew NLP models and tools including DictaLM, DictaBERT, AlephBERT, and ivrit.ai. Use when user asks about Hebrew text processing, Hebrew NLP, "ivrit", Hebrew tokenization, Hebrew NER, Hebrew sentiment analysis, Hebrew speech-to-text, or needs to process Hebrew language text programmatically. Covers model selection, preprocessing, and Hebrew-specific NLP challenges. Do NOT use for Arabic NLP (different tools) or general English NLP tasks.
Build RTL-first UI component libraries and design systems for Israeli applications with Hebrew typography. Use when user asks about Hebrew UI components, "itzuv" (design), Israeli design system, Hebrew font pairing, RTL component library, "tipografia ivrit" (Hebrew typography), or gov.il design patterns. Covers RTL-first component architecture, Hebrew font pairings (Heebo+Inter, Rubik+Source Sans Pro), gov.il design system patterns, Israeli formatting conventions (shekel sign, DD/MM/YYYY dates, 24-hour clock), and culturally appropriate UI for Israeli users. Do NOT use for general RTL CSS (use hebrew-rtl-best-practices) or accessibility audits (use israeli-accessibility-compliance instead).
Implement right-to-left (RTL) layouts for Hebrew web and mobile applications. Use when user asks about RTL layout, Hebrew text direction, bidirectional (bidi) text, Hebrew CSS, "right to left", or needs to build Hebrew UI. Covers CSS logical properties, Tailwind RTL, React/Next.js RTL setup, Hebrew typography, and font selection. Do NOT use for Arabic RTL (similar but different typography) unless user explicitly asks for shared RTL patterns.
Use at your own risk. Terms of Use · Security
Want to build your own skill? Try the Skill Creator · Submit a Skill