EVEREDGE
Research-Grade Linguistic Analysis Platform
Case study
EducationWeb Application8 weeks

An Arabic-language education initiative

EducationWeb Application8 weeks

A corpus of 676 Arabic doubled-verb conjugations needed to be explorable by researchers and students alike. Existing academic tools required CSV wrangling and produced static PNGs that nobody could interact with.

The challenge

What they came to us with.

The initiative maintains a 676-entry corpus of Arabic doubled-verb (fi‘l muda‘‘af) conjugations indexed by fāʾ / ʿayn letter combinations. Existing academic tooling required manual CSV wrangling and produced static PNG outputs — fine for a paper, useless for a student trying to explore the space.

They needed something researchers could publish from and students could browse — same data, two audiences — without forcing either to write Python.

Our approach

How we built it.

The front end is an interactive Next.js matrix of all 676 entries. Click any cell and the platform surfaces its conjugation variants, phonetic attributes, and the cluster it belongs to. Behind it sits a Python pipeline that extracts phonetic features from the sifat strings, computes UPGMA hierarchical clusters using Jaccard distance, reduces to 2D via UMAP for visualization, and selects k by silhouette score.

The deliberate choice here was to auto-discover phonetic features rather than hand-code them. The initial hand-engineered feature set was concealing real cluster structure; letting the data reveal what was load-bearing surfaced fāʾ / ʿayn co-occurrence patterns that the hand-coded version had flattened out.

The outcome

What shipped. What changed.

Time to explore corpus

Before: Days (CSV wrangling)After: Minutes (interactive)
~100x

Feature discovery method

Before: Hand-codedAfter: Auto-discovered from sifat
Automated

Cluster count selection

Before: Manual tuningAfter: Silhouette-guided
Automated

Output reproducibility

Before: Ad-hoc PNGsAfter: Deterministic exports
Citable

Ready to achieve similar results?

Let's discuss how we can help transform your business.