Home ACE — Reverse Analyzer Flowchart (Current Approach)

ACE — Reverse Analyzer Processing Flowchart

This diagram shows how the current Reverse Analyzer processes an Arabic or Latin token. The current engine is not a simple one-way “strip everything” pipeline. Instead, it preserves context, protects exact/non-derived items, routes the token through competing analysis families, and then ranks the resulting candidates.

Observed Token Arabic (Unicode) OR Latin transliteration Normalization lowercase / trim / tokenize cleanly preserve digraphs, long vowels, hamza logic Context Window check previous token / nearby particles may strengthen perfect / subjunctive / jussive Left-Edge Proclitic Handling wa / fa / bi / li / la / sa also combined forms: wal / fal / bial ... record prefix separately; do not mix with root Exact / Non-Derived Protection particles, pronouns, demonstratives, relatives test full token and valid stripped remainder protect strong exact matches from false morphology Controlled Right-Edge Testing object pronouns + family-aware suffix variants perfect / imperfect / imperative / nominal no single blind strip stage Candidate Family Routing Exact / Perfect / Imperfect / Imperative / Nominal families compete; each may emit one or more candidates candidate bundle keeps prefix, suffix, stem, root, form, confidence Verb Routers perfect active / passive imperfect active / passive + mood logic imperative as its own family Nominal Routers active/passive participles, masdar, adjectives instruments, colors/defects, broken plurals strict nominal evidence required Root / Form / Weak Analysis Form I–X pattern checks strong, hollow, defective, assimilated, doubled allow Form VIII assimilation and short imperatives Translation / Dictionary Enrichment + Ranking root support, gloss support, particle-context fit rank competing candidates; best candidate displayed first retain alternates when ambiguity is genuine Final Reverse Analysis Output token · prefix · rest · POS · category · tense/voice form · pattern · root · suffix · removed tail · pronoun · confidence

Key idea: the current Reverse Analyzer is not just reversing the forward pipeline step by step. Instead, it uses a router-based, evidence-weighted approach: preserve context, protect exact items, test family-specific analyses, then rank the resulting candidates.

Current summary: normalize → context → proclitic handling → exact protection → family routing → family-aware suffix testing → root/form analysis → enrichment → ranking → final display