Sangheili (language)

"Their language doesn't translate in a literal manner, and each word has multiple meanings."

- Cortana



Sangheili describes multiple dialects of a language spoken by the species of the same name. Though the member species of the Covenant had their own unique languages, an advanced dialect of Sangheili, known as basic Sangheili, came to serve as the lingua franca throughout the entire Covenant Empire. And as such, many names of member species are derived from the Sangheili. Even after the fragmentation of the Covenant, many former client species of the Covenant continue to use the Sangheili language in lieu of their native tongues; Sangheili has even replaced their native language for many. A specific trade pidgin also existed within the Covenant fringe.

Description
Text, or writing, in Sangheili appears to be mostly triangle shapes and composed almost completely of equilateral triangles. The triangular characters have been seen oriented both in the horizontal left-to-right direction and in a vertical right-to-left direction.

Though originating with the Sangheili, an advanced dialect of their language has come to be the Covenant lingua franca, used to connect the different races and species. Curiously, in several instances of Forerunner-related terminology, the Sangheili appear to use the English translations of the original Forerunner terms, as for "Forerunner", "Requiem", "Reclaimer", "Didact", and "Librarian". During the San'Shyuum-Sangheili War, the San'Shyuum obtained a comprehensive understanding of the Sangheili language by brutally torturing and interrogating prisoners of war. After establishing peace with the Sangheili, the San'Shyuum of the fledgling Covenant relied on translation software built into their anti-gravity chairs to understand the language. The Ussans developed in isolation from other Sangheili, and their language evolved into a different form of the Sangheili dialect. It possessed similarities to Old Sangheili, but the language developed into a new dialect that could not be readily understood by the "average" Sangheili.

Structure
Sangheili words generally consists of simply structured syllables. Most syllables consist of a single consonant (e.g. /s/) or a consonant complex (e.g. /t͡s/) followed by a long vowel or a diphthong or in rarer cases a short vowel or a short vowel followed by "n". Other types of syllables include a moraic "nn", a short vowel, a single diphthong, a triphthong, or a long vowel. Finally, some syllables in the middle or, more commonly, at the end of their word might have their vowel component devoiced.

Intonation
Sangheili generally seems to possess word stress, though the possibility of dialects featuring pitch accent shouldn't necessarily be excluded. Sangheili words usually possess only a primary stress with longer words sometimes possessing a secondary stress in the first syllable. The primary stress usually falls on the penult (second from last syllable). However, in many words the stress might fall on the ultima (last syllable) or antepenult (third from last syllable).

Phonology
The Sangheili language includes 8 main vowels (/ä/, /ɑ/, /i/, /ɪ/, /e̞/, /ɔ/, /o/ and /u/) and 18 main consonants (/s/, /z/, /ʂ/, /ʐ/, /q/, /ɢ/, /ʈ/, /ɖ/, /f/, /b̪/, /ħ/, /j/, /ɰβ/, /ɱ/, /ɳ/, /ɴ/, /ɻ/ and /ʔ/). Three more consonants also appear in words borrowed from other Sangheili dialects or alien languages. These are /p̪/, /ɺɽ~ɭ/ and /ʋ/. In certain instances, the sounds /o̞/, /ɕ/, /ʑ/, /w/ and /ɽ/ can appear as allophones of /ɔ/, /ʂ/, /ʐ/, /ɰβ/ and /ɻ/ respectively. Furthermore, /ɯβ/ and /ʊ/ are allophones of /u/ while /ə/ can serve as allophone of /ä/ and /e̞/ when those two sounds are weakened.

Consonants /ɳ/, /ɱ/ and /ɴ/ can in some cases be moraic, in which case they are geminated and pronounced as /ɳ:/, /ɱ:/ and /ɴ:/. In addition some consonant complexes can be formed. These are /ʈ͡ʂ/, /ʈ͡ɕ/, /ɖ͡ʐ/, /ɖ͡ʑ/, /t͡s/, /ʈɰβ/, /ɖɰβ/, /ɳɰβ/ and /ɳɱ/.

/ɳɱ/, /ɳ:/ and /ɱ:/ appear only when /ɳ:/ precedes a labiodental /b̪/ or /ɱ/ resulting in /ɳ:/ being pronounced as either /ɳɱ/, /ɳ:/ or /ɱ:/ for phonological reasons.

When followed by /i/, consonants /ʂ/, /ʐ/, /ʈ͡ʂ/ and /ɖ͡ʐ/ can be pronounced as /ɕi/, /ʑi/ /qji/, /ɢji/, /ħji/, /ʈ͡ɕi/ and /ɖ͡ʑi/ respectively.

When followed by /i/, consonants /q/, /ɢ/ and /ħ/ can be pronounced as /qji/, /ɢji/ and /ħji/ respectively. When followed by /e̞/, consonants /q/, /ɢ/ and /ħ/ can be pronounced as /qje̞/, /ɢje̞/ and /ħje̞/ respectively.

/ɴ/ and /ɴ:/ appear before or after the uvular consonants /q/ and /ɢ/ in place of /ɳ/ and /ɳ:/. /ɺɽ~ɭ/ can sometimes be heard as either ɺɽ or ɭ but is generally an intermediate sound. The sound /ɺɽ~ɭ/ itself has no human equivalent. /ɻ/ and /ɽ/ allophones of the same consonant. In reality they are only approximates of the Sangheili r sounds which has no human equivalent. Most users prefer to use /ɻ/. /ä/ is near front and closer to the Japanese /ä/ but some speakers pronounce it as a central vowel like the Italian /ä/. Some speakers use /ä/ insead of /ɑ/. /e̞/ and /i/ are near front vowels and not front vowels. /ɔ/ and /o̞/ are allophones of the same vowel. As such their use depends on the speaker’s preference. /ʊ/ can be used instead of /u/ when /u/ is short only. /ɯβ/ can be used instead of /u/ regardless of whether /u/ is short or long. The sound itself is compressed and neither rounded nor fully unrounded. It is pronounced like the Japanese u. /ɰβ/ is a compressed labiovelar approximant. It can also be symbolized as /wβ/. It is pronounced like the Japanese w.

Sometimes /ɖ͡ʐ/ can be pronounced as /ʐ/ or /ɖ͡ʐ~ʐ/.

Sangheili also makes use of the following diphthongs:

/e̞ɪ/, can be pronounced as /e̞j/ before vowels

/ou/ also pronounced as /oɯβ/, /oʊ/, /o̞u/, /o̞ɯβ/ or /o̞ʊ/

/äɪ/

/ɔɪ/ also pronounced as /o̞ɪ/ and sometimes pronounced as /ɔj/ or /o̞j/ before vowels.

/äu/ also pronounced as /äɯβ/ or /äʊ/ (rare)

Phonetic transcription
In order to phonetically transcribe Sangheili, 343 Industries uses the following transliteration system:


 * a e correspond to /ä/ and /e̞/ respectively. Both can correspond to /ə/ if the syllable is weakened.
 * i corresponds to /ɪ/.
 * ah corresponds to /ɑ:/. Some speakers pronounce it as /ä:/ though.
 * eh corresponds to /e̞:/
 * o corresponds to /ɔ/ and /o̞/ depending on the speaker's preference.
 * oh corresponds to /ɔ:/ and /o̞:/ depending on the speaker's preference.
 * u corresponds to /u/. Some speakers might also pronounce it as /ɯβ/ or /ʊ/.
 * uh corresponds to /u:/. Some speakers might pronounce it as /ɯβ:/.
 * aa corresponds to /ɑ:/. Some speakers pronounce it as /ä:/ though.
 * ee corresponds to /i:/ but can be pronounced as /ɪ:/ or /i~ɪ:/
 * uu corresponds to /u:/. Some speakers might also pronounce it as /ɯβ:/.
 * oo corresponds to /u:/. Some speakers also pronounce it as /ɯβ:/ or /o:/.
 * y corresponds to /j/ when it is between two vowels, or at the beginning of a word. When it is at the end of a word or before a consonant or after any consonant except nn it corresponds to /ɪ/.
 * t corresponds to /ʈ/.
 * d corresponds to /ɖ/.
 * k corresponds to /q/ or /k~q/, but is pronounced as /qj/ when followed by /i/ or /e̞/.
 * g corresponds to /ɢ/or /g~ɢ/, but is pronounced as /ɢj/ or /g~ɢ j / when followed by /i/ or /e̞/.
 * gh corresponds to /ɢ/ or /ɢh/ depending on the speaker's preference. When followed by e (thus forming ghee) the resulting sounds will be /ɢje̞:/ and /ɢhe̞:/. When followed by ei (thus forming ghei) the resulting sounds will be /ɢje̞ɪ/, /ɢhe̞ɪ/, /ɢji:/ and /ɢhe:/.
 * r corresponds to /ɻ/ and /ɽ/ depending on the speaker's preference.
 * w corresponds to /ɰβ/. Sometimes some speakers can pronounce it as /w/.
 * f corresponds to /f/
 * b corresponds to /b̪/
 * ch corresponds to /ʈ͡ʂ/, but can be pronounced as /ʈ͡ɕ/ when found before /i/.
 * sh corresponds to /ʂ/, but can be pronounced as /ɕ/ when found before /i/.
 * j corresponds to /ɖ͡ʐ/, but can be pronounced as /ɖ͡ʑ/ when found before /i/. The sounds /ɖ͡ʐ/ and /ɖ͡ʑ/ can be pronounced as /ʐ/or /ɖ͡ʐ~ʐ/ and /ʑ/ or /ɖ͡ʑ~ʑ/, respectively.
 * s corresponds to /s/.
 * z corresponds to /z/.
 * ts corresponds to /t͡s/.
 * n corresponds to /ɳ/. When n is followed or preceded by the uvular consonants /q/ and /ɢ/ it corresponds to /ɴ/ and when followed by any labiodental consonant ( /ɱ/, /b̪/, /p̪/, /f/, /ʋ/) it corresponds to /ɳɱ/ or /ɳ~ɱ/.
 * nn corresponds to /ɳ:/. When n is followed by the uvular consonants /q/ and /ɢ/ it corresponds to /ɴ:/ or /ɳɴ/ or /ɳ:/ and when followed by any labiodental consonant ( /ɱ/, /b̪/, /p̪/, /f/, /ʋ/) it corresponds to /ɱ:/ or /ɳɱ/ or /ɳ:/.
 * m corresponds to /ɱ/.
 * h corresponds to /ħ/, but is pronounced as /ħj/ when followed by /i/.
 * tw_, dw_, nw_ correspond to /ʈɰβ/, /ɖɰβ/ and /ɳɰβ/, respectively. The symbol [_] represents any vowel.
 * ' corresponds to /ʔ/
 * - is used after h to eliminate confusion by separating syllables. For instance is pronounced as uh-eh as /u:i:/ while uheh as /uħe:/
 * ae corresponds to /äə/ or /äe̞/ or /e̞:/. In the latter two cases it is not a diphthong.
 * ai corresponds to /äɪ/ or /e̞ɪ/ but is usually pronounced /äɪ/.
 * au corresponds to /äu/, /äɯβ/ or /äʊ/ depending on the speaker's preference.
 * ay corresponds to /e̞ɪ/. When followed by a vowel it corresponds to /e̞j/. It may rarely be pronounced as /äj/ and not be a diphthong when y is part of a different syllable but that is unknown.
 * ei corresponds to /e̞ɪ/ or /i:/ (rarely) depending on the speaker's preference.
 * ey corresponds to /e̞ɪ/. When followed by a vowel it corresponds to /e̞j/.
 * ie corresponds to /äɪ/.
 * ou corresponds to /ou/, /oɯβ/, /oʊ/, /o̞u/, /o̞ɯβ/ or /o̞ʊ/ depending on the speaker's preference.
 * oy corresponds to /ɔɪ/ or /o̞ɪ/ depending on the speaker's preference. When followed by a vowel it corresponds to /ɔj/ and /o̞j/ respectively.
 * eay corresponds to /ɪe̞ɪ/. In this case the diphthong is /e̞ɪ/ and /ɪ/ is syllabic.
 * l corresponds to /ɺɽ~ɭ/. It can sometimes be pronounced as ɭ or ɺɽ ɺɽ by the same speaker due to phonological reasons. Most of the times though l is pronounced identically to r.
 * p corresponds to /p̪/.
 * v corresponds to /ʋ/.

Vocabulary
Note: All words in the following list should not be pronounced as if they were English. Please refer to the phonetic transcription system above.

Ancient words, names, and words of extinct or other Sangheili languages.

Modern words with known or assumed meanings based on context.

Transliterated Sangheili
—Nnse-kooree-koocha nee-ey-mawoo.

Dieduckt gahkaboonoh Liebuh-Rahrian musuyano. Kaboonzaywah wohchitah kneekohsoh woorumahtwo.

Nntahbonwon sayoh. Gah-eymayoh Reecleymah toymeh-ushou zosuerohkoh!

Nnkahchee kahnohmoh keenoh ruh-ehnahsheewah cheeruh-eh tayruh-ah. Wahshahteh rohneeahkeh nohkoh wahnahkohroh neeoh-yohnoh sahgahkay. Tahgee wahkeetoh ruhnehdah-ee nnkah-ee tahgee tahruh-yah. Promethean wahzeguhkah kahjee mah.

Neetohmoh neekohnoh.

Cheenoh-ee. Keeoh-eesay. Geeohneechuh.

Cheennsay rehmah-oh. Cheennsay nnteh-hahdeh.

Kaidon sohruh kaysuhtah.

Neeshoh eesah Aabeetah tay shee shoh hyuhmahnn. 'Mdama wooeeee kehnndoh.

Nnrah kahwah ahkeeteh yoh-uh yoh-uh.

Sheennshee roh-ehtahmahnay. Kraken bayoh-hoh.

Promethean ee-hahshee-hoo 'Mdama eerayrah cheennshee. Wahrahtsuah uhtohkah nohkoh eekahtahtah rohkehmah.

Ruhnahshuh ee-hah sheewoo Sangheili ruh-ahnee. Prophet ruh-ah-ee.

Suhrahnahkah sohkuhfoo sohruhsuh.

Gay uhsoh. 'Mdama zurahnahkahgah zohkuh-oh beechee.

Nn-ee eeseh kaysah.

Nn-nee nnseh shuhkohroh.

Didact sah-ee nay.

Non-native speakers of Sangheili
Several species, including humans, Mgalekgolo, Unggoy, and Kig-Yar are able to speak Sangheili. Not all races can speak the same language due to evolutionary design restrictions; for example, the Yanme'e could only communicate through a cacophony of high-pitched clicks and screeches. To facilitate easier communications between member species, translation software is used on Covenant ships to decipher words. During and after the Human-Covenant War, several humans were able to understand Sangheili :
 * Catherine Halsey
 * Evan Phillips
 * Luther Mann
 * Melody Azikiwe
 * Mike Spenser
 * Olympia Vale
 * Jacob Keyes

Symbol types
There are four known written types of the Sangheili language.

Triangle type (original cipher)
This triangle variant of the written language has a few key features. They are a big triangle surrounded by numerous smaller triangles. These smaller triangles are known to be closer to the corners of the bigger triangle, and can either be floating above the corner, or very close to it. The direction of the bigger triangle is not set, leading to there being numerous possibilities to what one symbol could mean. The bigger triangle is also known to have semi-circles cut out of the middle of its sides, sometimes filled with circles. Sometimes the triangles points are not there and are replaced by a circle. Sometimes rectangles are involved in the symbols. There is also thinner and longer triangle symbols in this type of symbols. These can have a smaller triangle at its smaller side or not.

There are a few different translations for these types of symbols. They are seen often in transmissions and on control panels. These have been in use as early as February 4th 2531 in transmissions during the Harvest Campaign.

Bumped triangles type
These triangle symbols are similar to the base triangle type. They are big triangles with a spike coming out near the corners. Smaller triangles, unlike the triangle type symbols, are closer to the middle of the sides, touching the spike coming out near the corner.

There is sometimes a smaller triangle taken out the sides of the bigger triangles, and sometimes triangle taken out the middle of them also.

These are often seen in transmissions and also on control panels. These have been in use as early as April 26, 2526 in transmissions during the Battle of Circinius IV.

Armor
These are all the symbols and patterns related to them.

Forerunner symbol type
These symbols are often borrowed or adapted from actual Forerunner symbols for use in speech, on Sangheili armor, and even on Covenant technology.

Ancient Sangheili triangle type
An ancient version of the Sangheili script appears in Sangheili ruins and ceremonial curveblades. Symbols similar to the modern triangular symbols appear. They can also be seen on modern Swords of Sanghelios flags.

Ancient Sangheili circle type
Usually next to ancient Sangheili triangle types on ancient Sangheili scriptures.

Mural type
This type is numerous symbols and images.

Production notes
Before the release of Halo 2, the official website at Halo2.com was made to look like a Covenant computer complete with the Sangheili language. This language was a simple cipher with the triangular characters. In addition to Halo2.com, a released wallpaper contained triangular characters that made use of this cipher. After the Halo2.com site, the cipher changed. Two wallpapers were released with an entirely new cipher still using the triangular characters. Neither this cipher or the previous one have been used subsequently, though the triangular characters are still commonly used.

The Covenant speech in Halo: Reach is actually able to be translated. A common useful word to understand in gameplay is "awugapu," which is used to announce that a grenade is being thrown.

The languages in their original form are heard in Halo: Combat Evolved. In Halo 2, their words are translated for the convenience of the player. Elites in Halo: CE spoke a deep, warbling tongue. This was achieved by reversing the voice acting of David Scully. The ever-popular "Wort, wort, wort!" shouted by many Elites during gameplay is actually Sergeant Johnson’s famous "Go, go, go!" reversed and sped up. The hissing-like language of the Jackals is actually the English language reversed. This, and the other Covenant languages (Drones, Hunters), have remained the same since Halo 2.

According to voice actor David Sobolov, who played Ripa 'Moramee in Halo Wars and Jul 'Mdama in Halo 4, the fictional language spoken by the Elites in Halo 4 was a "made-up language that was based on Japanese," that started out as being improvised by the voice actors. Except due to legal concerns, the voice lines and language were intentionally scripted instead. Because of the complexity of the fictional language, "it took almost ten minutes per line to record" and required extra time to record than usual.

In Halo 5: Guardians, Sangheili hieroglyphics were inspired by Islamic, Mongolian, and Indian art, as well as Egyptian and Mayan hieroglyphics.

A Covenant language was created by David Peterson, creator of the Dothraki language used in Game of Thrones, for Halo: The Television Series. Peterson posts transcripts for the words and their translations in a series of posts on Archive of Our Own, accessible here.

List of appearances

 * Halo: The Fall of Reach
 * Halo: Combat Evolved
 * Halo: First Strike
 * Halo 2
 * Halo Graphic Novel
 * Halo: Ghosts of Onyx
 * Halo: Landfall
 * Halo: Arms Race
 * Halo: Combat
 * Halo: Last One Standing
 * Halo: Uprising
 * Halo 3
 * Halo: Contact Harvest
 * Halo: The Cole Protocol
 * ''Halo Wars: Genesis
 * Halo Wars
 * Halo 3: ODST
 * Halo Legends
 * The Package
 * Origins
 * Halo: Blood Line


 * Halo: Evolutions - Essential Tales of the Halo Universe
 * Stomping on the Heels of a Fuss
 * Blunt Instruments
 * The Mona Lisa
 * The Return
 * Halo: Reach
 * Halo: Glasslands
 * Halo: Combat Evolved Anniversary
 * Halo: The Thursday War
 * Halo 4: Forward Unto Dawn
 * Halo 4
 * Terminals
 * Spartan Ops
 * Halo: Mortal Dictata
 * Halo: Broken Circle
 * Halo: Nightfall
 * Halo 2: Anniversary
 * Halo: Hunters in the Dark
 * Hunt the Truth
 * Halo 5: Guardians
 * Halo: Shadow of Intent
 * Halo: Tales from Slipspace
 * Hunting Party
 * Knight Takes Bishop
 * Halo: Bad Blood
 * Halo: The Television Series
 * Contact
 * Unbound
 * Emergence
 * Homecoming
 * Transcendence
 * Halo: The Rubicon Protocol