Introduction

KalimaLab is the most comprehensive Arabic language intelligence API for developers. 117 endpoints across 25 modules — dictionary, morphology, Quran, NLP, and AI — via REST API or 4 official SDKs.

What is KalimaLab?

KalimaLab is a comprehensive Arabic language intelligence API — programmatic access to the deepest Arabic language data ever offered commercially. Whether you are building a word game, an NLP pipeline, an educational app, a Quran study tool, or a voice assistant, KalimaLab provides the linguistic infrastructure you need.

Unlike general-purpose dictionary APIs, KalimaLab is designed Arabic-first. Every endpoint is built around Arabic morphology — roots (جذور), patterns (أوزان), إعراب, and classical grammar. Usage is metered in Data Points (DP) rather than plain request counts, so cost aligns with the value of data extracted.

Key Features

477,000 Lemmas

Dictionary-grade depth: 70+ fields per entry sourced from Ibn Manẓūr, Farāhīdī, and modern corpora — fully searchable with four data tiers.

Morphology & إعراب

Full conjugation tables, root extraction, POS tagging, and complete إعراب parse trees with classical Arabic grammar labels.

Quran Module

114 surahs, 6,236 ayahs, word-by-word morphology, 7 reciters' audio, full-text search, and juz/hizb/page navigation.

NLP Pipeline

Diacritization, dialect identification (26 cities), prosody meter, spell check, transliteration, and text normalization.

AI Layer

Semantic search via pgvector, multilingual embeddings, machine translation (NLLB-200), TTS (Azure Neural + Google Chirp3), and ASR (Whisper).

4 Official SDKs

TypeScript, Python, Go, and PHP — all open source, fully typed, with automatic retries and async pagination.

Built for

Word Games

Arabic Wordle, Scrabble helpers, crossword generators — filter by letter count, root, or pattern.

Education

Build Arabic language learning apps with vocabulary quizzes, root exploration, and grammar exercises.

NLP Pipelines

Enrich Arabic NLP datasets with morphological annotations, roots, and POS tags.

Content Moderation

Verify that user input is valid Arabic, detect script, and normalize text before processing.

Language Support

Arabic-first477,000 lemmasModern Standard ArabicClassical Arabic

The KalimaLab dataset covers Modern Standard Arabic with extended classical coverage. All words include transliteration, English meanings, and morphological metadata. Response text preserves full Arabic diacritics (tashkeel) where available.

30-second example

Install the SDK, add your API key, and make your first request:

npm install kalimalab

import { KalimaLab } from 'kalimalab'const client = new KalimaLab({ apiKey: process.env.KALIMALAB_API_KEY! })// Get today's word of the dayconst word = await client.words.daily()console.log(word.lemma)        // كَتَبَconsole.log(word.primaryGloss) // to writeconsole.log(word.primaryPos)   // verb

✓Free Tier

The free plan includes 500 DP/day — more than enough to explore the API and build a prototype. Common words cached at 0 DP. No credit card required.

Consistent Response Format

Every KalimaLab response follows the same envelope structure:

{  "data": { ... },  "error": null,  "meta": {    "requestId": "req_01j9abc...",    "responseTimeMs": 12,    "page": 1,    "total": 477000,    "totalPages": 12868  }}

data — The requested resource(s). Always present on success.

error — null null on success, or an error object with code and message.

meta — Request metadata including pagination info and timing.

Get started in 5 minutes

Follow the quickstart guide to make your first API request in under 5 minutes.

Quickstart guide Get a free API key

Edit this page on GitHub

NextQuickstart