French IPA Transcription - Help

← Back to French transcription ↑ All Languages

About This Implementation

This French transcription app uses the Wiktionary French Pronunciation Module to generate phonemic transcriptions for French text.

The system uses comprehensive Wiktionary data dumps[1][2] as a lexicon to first retrieve transcriptions from the dictionary. When a word is not found in the lexicon, it falls back to generating transcriptions using the pronunciation module's rule-based approach.

Dialects Supported

Transcription Forms

Technical Information

This implementation combines the Wiktionary French pronunciation module with a comprehensive lexicon extracted from Wiktionary data dumps, providing both high accuracy for common words and broad coverage through rule-based generation.

Wiktionary French pronunciation module Pipeline Overview

1. Initialization & Normalization

The input string is prepared and normalized to remove orthographic variance that does not affect pronunciation.

2. Early Consonant Mutations

Fixing ambiguous consonants based on the following vowel.

2.1. The "Capital E" Protection

The digraphs œu and oeu are converted to a placeholder Eu.

Why? This prevents the letter C from softening.
Example: cœurcEu... → C remains hard /k/ because it is not followed by a lowercase 'e' or 'œ'.

2.2. C and G Logic

2.3. Consonant Devoicing

Rule: The letter b is devoiced to p when immediately followed by s or t.

3. Verb Morphology Overrides (pos="v")

Applied strictly when the module input is tagged as a verb.

4. Orthographic Standardization

Mapping lexical exceptions and fixed spelling patterns.

4.1 Assibilation (T → S)

Rule: ti becomes si before on, en, al, el.

Exceptions (Blockers): Assibilation is cancelled if the T is preceded by:

4.2. Special Prefix/Suffix Logic

Pattern Logic Example
eu- / neuro- Force eu to closed /ø/ even before R. Europe/ø.ʁɔp/
hyper- / super- Force er to open /ɛʁ/ (preventing /e/). supermarché/sy.pɛʁ.../
trans- / intrans- Force s to voiced /z/ before vowel. intransigeant/ɛ̃.tʁɑ̃.zi.ʒɑ̃/
(h)ex- Initial ex or in+ex/gz/ (Must be followed by vowel/h). examen/ɛg.za.mɛ̃/
désh- Force s to voiced /z/. déshabiller/de.za.bi.je/

4.3. Basic Digraphs

4.4. Posterior A Shift

5. Vowels, Ligatures, and Nasalization

5.1. Base Mappings

5.2. Nasalization Engine

Logic: The script scans for Vowel + n/m followed by a Consonant or End-of-Word.

The "Coin" Exception: If the sequence is specifically oi + n (and satisfying nasal conditions), the script overrides the standard output to /wɛ̃/.
Example: point/pwɛ̃/ (Not /pwɑ̃/).

5.3. Special Consonant Handling in Nasals

6. Structural Logic: Y and ILL

Handling the complex context-dependency of Y and the "Liquid L" digraphs.

6.1. The Y Hierarchy

The script determines whether y functions as a vowel /i/ or a semi-vowel /j/.

6.2. The ILL Logic (Liquid L)

The script processes ill sequences in a strict order of operations:

  1. Consonant + uill:/ɥij/.
    Example: cuillère/kɥi.jɛʁ/.
  2. Vowel + ill:/j/.
    Example: travailler/tʁa.va.je/ (The L is absorbed).
  3. Standard ill:/ij/.
    Example: fille/fij/.
  4. Final Vowel + il:/j/.
    Example: soleil/sɔ.lɛj/.

7. Word Endings (Deletions)

Removing silent letters at word boundaries ().

7.1. Standard List

Delete: d, g, p, t, s, z.

7.2. Complex Deletions

7.3. Protections

7.4 Special Endings

8. E-Rules & Syllabification

8.1. E + Double Consonant

Rule: e + CC → è + C.

8.2. Syllabification Engine

8.3. Schwa (ə) Logic

9. Glides and Reversion (Cluster Protection)

Converting high vowels to semi-vowels contextually.

9.1. Conversion

9.2. The Reversion Rule (Missing from most guides)

Script Safety Check: If converting a vowel to a glide creates an unpronounceable onset (Consonant + Liquid + Glide), the script reverts it to a full vowel.

10. Final Harmonization

Final polish of the IPA string.

11. Liaison Mutations (Sandhi)

When a silent final consonant becomes pronounced due to liaison (), or when vowels clash at boundaries, mutations occur.

11.1 Consonant Mutations

Letter Mutation Example IPA Result
D Becomes /t/ grand‿homme /gʁɑ̃.tɔm/
F Becomes /v/ neuf‿heures /nœ.vœʁ/
S, X Becomes /z/ beaux‿arts /bo.zaʁ/
I Inserts /j/ y a-t-il (i‿a) /i.ja.t‿il/

11.2 The Nasal Bridge

The script distinguishes between words that keep their nasal quality in liaison and those that denasalize.

12. Reference: Complete IPA Symbol Map

This table lists every phonetic symbol generated by the script, mapping the French orthography to the specific IPA character output.

Category IPA Symbol Description Typical Orthography
Oral Vowels /a/ Anterior A a, à (standard)
/ɑ/ Posterior A â, as, az
/e/ Closed E é, er, ez, ai (final verb)
/ɛ/ Open E è, ê, ai, ei, e (+consonant)
/ə/ Schwa (Mute E) e (unaccented)
/i/ High Front Vowel i, î, y
/y/ High Front Rounded u, û
/o/ vs /ɔ/ Closed O vs Open O /o/: ô, au, eau, o (+z)
/ɔ/: o (standard)
/ø/ vs /œ/ Closed EU vs Open EU /ø/: eu, œu (final/z/t)
/œ/: eu, œu (standard)
/u/ High Back Vowel ou, où
Nasal Vowels /ɑ̃/ Nasal A an, en, am, em
/ɛ̃/ Nasal I in, ain, ein, yn
/ɔ̃/ Nasal O on, om
/œ̃/ Nasal U un, um (often merges with /ɛ̃/)
Semi-Vowels /j/ Yod glide y, il, ill, i (+vowel)
/w/ W glide oi (/wa/), ou (+vowel)
/ɥ/ U glide u (+vowel)
Complex Consonants /ʃ/ Sh sound ch, sh
/ʒ/ Soft J j, g (+e/i/y)
/ɲ/ Palatal N gn
/ŋ/ Velar Nasal ng (in -ing)
/ʁ/ Uvular R r, rr, rh
Standard Consonants /b/ /d/ /f/ /ɡ/ /k/ /l/ /m/ /n/ /p/ /s/ /t/ /v/ /z/ Invariant Mapping b, d, f, g (hard), c (hard)/k/qu, l, m, n, p, s/ç, t, v, z/s(voiced)

For technical issues or suggestions, please visit our GitHub repository.