Our Products & Services
Via our Github, you can experiment the IAHLT open-source annotated content and decide if you would like to become IAHLT member to access our large Hebrew & Arabic datasets and models.
Our products in IAHLT Github - Click here
Services and Tools for Hebrew & Arabic
Universal Dependencies (UD) is a framework for consistent annotation of grammar (parts of speech, morphological features, and syntactic dependencies) across different human languages. UD is an open community effort with over 300 contributors producing nearly 200 treebanks in over 100 languages.
IAHLT public contribution
The UD Hebrew-IAHLTWiki treebank consists of 5,000 contemporary Hebrew sentences representing a variety of texts originating from Wikipedia entries:
Named-entity recognition (NER) (also known as (named) entity identification, entity chunking, and entity extraction) is a subtask of information extraction that seeks to locate and classify named entities mentioned in unstructured text into pre-defined categories such as person names, organizations, locations, medical codes, time expressions, quantities, monetary values, percentages, etc.
IAHLT Automatic Annotations Demos
Automatic Hebrew NER demo
Automatic Arabic NER demo
Please contact us for any content creation or annotation needs, (text and audio).
IAHLT Open Source Use
Forge with hosting for git/mailing list/CI and more
Code hosting/continuous integration/mailing list/issue tracking
Wikidata entity extractor
Entity linking and NE preannotation
Coreference annotation tool
UD parser and NE recognizer
UD parsing and NE recognition
Classical UD parser
Sentence segmentation (HE + AR) and lemmatization
Universal dependencies annotation tool
Annotation for UD
Named entity annotation tool
Named entity annotation
Graph-based corpus search tool
Corpus search and validation for lemmatization and UD