List of Wikipedias

From Wikipedia, the free encyclopedia
Jump to: navigation, search
Screenshot of Wikipedia's portal wikipedia.org showing the different language editions sorted by article count.

This is a list of many of the different language editions of Wikipedia; as of August 2012, there are 285 Wikipedias. For their number of articles, see the main list.

Wikipedia edition codes[change | change source]

Each Wikipedia has a code, which is used as a subdomain below wikipedia.org. Interlanguage links are sorted by that code.

The codes represent the language codes defined by ISO 639-1 and ISO 639-3, and the decision of which language code to use is usually determined by the IETF language tag policy.

One code is not a language code ('be-x-old') but refers to a specific orthography.

Some deviations include:

WP edition name WP code Code meaning in ISO 639
Albanian sq 'sq' is the ISO code as a macrolanguage with four individual languages, but Tosk Albanian is mainly used. 'Als' is the individual code for Tosk Albanian, but this code is already used for Alemannic Wikipedia.
Alemannic als 'als' is the ISO code for Tosk Albanian[1] (an altogether unrelated language); should be moved to one of valid codes such as 'gsw'.[a]
Aromanian roa-rup Not an ISO code. 'roa' is Romance (Other); should be moved to valid 'rup'.
Banyumasan map-bms Not an ISO code. 'map' is Austronesian (Other), bms is Bilma Kanuri, a language of Niger.
Dutch Low Saxon nds-nl Not an ISO code. nds is 'Low Saxon', restricted to Germany in Ethnologue. The Low Saxon dialects in the Netherlands have their own ISO codes.
Belarusian (Taraškievica) be-x-old Not an ISO code; should be moved to valid 'be-tarask'.
Bihari bh collective code 'ISO 639:bih' includes Bhojpuri bho, Maithili mai, Magahi mag and nine others[2]
Cantonese zh-yue Not an ISO code; should be moved to valid 'yue'.
Chinese zh The code is for Chinese in general, but Chinese Wikipedia is written in modern written vernacular Chinese with four standard forms.[b] Modern written vernacular Chinese is based on modern Mandarin (cmn).
Classical Chinese zh-classical Not an ISO code; should be moved to valid 'lzh'.
Malay ms Macrolanguage that includes more than 30 individual languages/dialects. However, the wiki excludes Indonesian because Indonesian Wikipedia (id) exists independently.
Min Nan zh-min-nan Not an ISO code; should be moved to valid 'nan'. 'min' is unrelated.
Norwegian Bokmål no Norwegian in general, i.e. Bokmal ('nb'/'nob') and Nynorsk ('nn'/'nno'). Nynorsk correctly uses 'nn'.
Ripuarian ksh Kölsch, one variety of the Ripuarian languages
Samogitian bat-smg Not an ISO code; should be moved to valid 'sgs'. 'bat' is Baltic (Other), 'smg' is Simbali language.
Simple English simple Not an ISO code
Tarantino roa-tara Not an ISO code. 'roa' is Romance (Other).
Võro fiu-vro Not an ISO code; should be moved to valid 'vro'.
Zamboanga Chavacano cbk-zam Not an ISO code, 'cbk' is Chavacano. zam is unrelated Miahuatlán Zapotec.

List[change | change source]

Size is given in decadic logarithm of number of articles. '6' means more than 1 000 000, '5' more than 100 000, '4' more than 10 000.

Notes[change | change source]

  1. 'gsw' is the ISO 639-2 and -3 code for Swiss German, Alemannic and Alsatian. 'gct' is the ISO 639-3 code for Colonia Tovar dialect, 'swg' is for Swabian German and 'wae' is for Walser German.
  2. They are the standards for Mainland Chinese and Singaporean (written in simplified Chinese characters), and Taiwanese and Hong Kong/Macau (in traditional Chinese characters). They are automatically converted by the wiki machine. See Chinese Wikipedia.
  3. The Romansh Wikipedia was established in December 2003. According to an article from July 5, 2008 in Swissinfo.ch, the first two years were financed by a contribution of SFr60,000 ($58,502) from the canton and the federal government.

References[change | change source]

Other websites[change | change source]