Relation Extraction
Event-Argument Relation Extraction corpus and model (Open Source). We support three relations (has Agent, hasLocation, hasDate) between
events and 21 entity types (see
article).
Performance: WojoodHadath (93.99%) WojoodOutOfDoman
(74.90%)
Method: We formulated
the relation extraction task as a Natural Language
Inference task and fine-tuned a BERT model using a large
set of sentence pairs (NLI dataset) extracted from the
WojoodHadath Corpus. (see
article).
WojoodHadath Corpus:
We extended the Wojood nested NER corpus (550K tokens), by
manually annotating event entities with 3 relations.
WojoodOutOdDomain Corpus: New corpus with 80K tokens in MSA, manually annotated
with entities and relations as WojoodHadath. It
covers 10 domains (Economics, Finance, Politics, Science,
Technology, Art, Law, Agriculture,
History, and Sports).
Relations:
has Agent: participant(s) involved in the event (Domain:
Event, Range: PERS, ORG, OCC, NORP)
hasLocation: where the event occurred (Domain: event,
Range: GPE, LOC, FAC)
hasDate: when the event occurred (Domain: event,
Range: TIME, DATE)
SinaTools:
Relation Extraction module as python library.
GitHub:
training source code + sample data (~20 sentences with
event mentions).
Hugging Face:
fine-tuned BERT model using WojoodHadath.
WojoodHadath
(Corpus only)
WojoodOutOfDomain
(Corpus only)
Alaa Aljabari, Lina Duaibes, Mustafa Jarrar, Mohammed Khalilia: Event-Arguments Extraction Corpus and Modeling using BERT for Arabic. In Proceedings of the Second Arabic Natural Language Processing Conference (ArabicNLP 2024), Bangkok, Thailand. Association for Computational Linguistics.