Introduction
KalimaLab is the most comprehensive Arabic language intelligence API for developers. 117 endpoints across 25 modules — dictionary, morphology, Quran, NLP, and AI — via REST API or 4 official SDKs.
What is KalimaLab?
KalimaLab is a comprehensive Arabic language intelligence API — programmatic access to the deepest Arabic language data ever offered commercially. Whether you are building a word game, an NLP pipeline, an educational app, a Quran study tool, or a voice assistant, KalimaLab provides the linguistic infrastructure you need.
Unlike general-purpose dictionary APIs, KalimaLab is designed Arabic-first. Every endpoint is built around Arabic morphology — roots (جذور), patterns (أوزان), إعراب, and classical grammar. Usage is metered in Data Points (DP) rather than plain request counts, so cost aligns with the value of data extracted.
Key Features
477,000 Lemmas
Dictionary-grade depth: 70+ fields per entry sourced from Ibn Manẓūr, Farāhīdī, and modern corpora — fully searchable with four data tiers.
Morphology & إعراب
Full conjugation tables, root extraction, POS tagging, and complete إعراب parse trees with classical Arabic grammar labels.
Quran Module
114 surahs, 6,236 ayahs, word-by-word morphology, 7 reciters' audio, full-text search, and juz/hizb/page navigation.
NLP Pipeline
Diacritization, dialect identification (26 cities), prosody meter, spell check, transliteration, and text normalization.
AI Layer
Semantic search via pgvector, multilingual embeddings, machine translation (NLLB-200), TTS (Azure Neural + Google Chirp3), and ASR (Whisper).
4 Official SDKs
TypeScript, Python, Go, and PHP — all open source, fully typed, with automatic retries and async pagination.
Built for
Word Games
Arabic Wordle, Scrabble helpers, crossword generators — filter by letter count, root, or pattern.
Education
Build Arabic language learning apps with vocabulary quizzes, root exploration, and grammar exercises.
NLP Pipelines
Enrich Arabic NLP datasets with morphological annotations, roots, and POS tags.
Content Moderation
Verify that user input is valid Arabic, detect script, and normalize text before processing.
Language Support
The KalimaLab dataset covers Modern Standard Arabic with extended classical coverage. All words include transliteration, English meanings, and morphological metadata. Response text preserves full Arabic diacritics (tashkeel) where available.
30-second example
Install the SDK, add your API key, and make your first request:
npm install kalimalabimport { KalimaLab } from 'kalimalab'const client = new KalimaLab({ apiKey: process.env.KALIMALAB_API_KEY! })// Get today's word of the dayconst word = await client.words.daily()console.log(word.lemma) // كَتَبَconsole.log(word.primaryGloss) // to writeconsole.log(word.primaryPos) // verb✓Free Tier
Consistent Response Format
Every KalimaLab response follows the same envelope structure:
{ "data": { ... }, "error": null, "meta": { "requestId": "req_01j9abc...", "responseTimeMs": 12, "page": 1, "total": 477000, "totalPages": 12868 }}data — The requested resource(s). Always present on success.error — null null on success, or an error object with code and message.meta — Request metadata including pagination info and timing.Get started in 5 minutes
Follow the quickstart guide to make your first API request in under 5 minutes.