Home ACE — Reverse Analyzer Flowchart

ACE — Reverse Analyzer Processing Flowchart

This diagram shows how the Reverse Analyzer decomposes a fully formed Arabic or Latin token back into its grammatical components. Conceptually, it walks the forward pipeline in reverse.

Observed Token Arabic (Unicode) OR Latin Normalization arabicToLatin() normalize hamza / shadda / sukūn Strip Conjunctions wa / fa / bi / li / la / sa particle24–44 groups record order & spacing Article Detection al / waal / bial / fal sun vs moon letter restore base consonant Suffix & Pronoun Removal verb suffix (perfect/imperfect) declension suffix (15–23) object / genitive pronouns Stem Isolation remove VN / prefixes retain pure stem Form & Root Recovery Form I–X pattern match extract r0,r1,r2 (+ weak rules) Final Analysis ace_assess() form · tense · pronoun · voice

Key idea: the Reverse Analyzer does not “guess randomly”. It removes layers in a deterministic order, mirroring the forward pipeline: particles → article → suffixes → stem → form → root.