CEDICT
The CEDICT project was started by Paul Denisowski in 1997 and is maintained by a team on mdbg.net under the name CC-CEDICT, with the aim to provide a complete Chinese to English dictionary with pronunciation in pinyin for the Chinese characters.
Content
CEDICT is a text file; other programs are needed to search and display it. This project is considered a standard Chinese-English reference on the Internet and is used by several other Chinese-English projects. The Unihan Database uses CEDICT data for most of its information about character compounds, but this is auxiliary and is explicitly not a part of the main Unicode database.Features:
- Traditional Chinese and Simplified Chinese
- Pinyin
- American English
- , it had 114,087 entries in UTF-8.
Traditional Simplified /American English equivalent 1/equivalent 2/
漢字 汉字 /Chinese character/CL:個|个/
Example of a simple egrep search:
$ egrep -i 有勇無謀 cedict.txt
有勇無謀 有勇无谋 /bold but not very astute/
History
Year | Event |
1991 | EDICT Japanese dictionary project was started by Jim Breen. |
1997 | CEDICT project started by Paul Denisowski, on the model of EDICT. |
2007 | MDBG started a new project called which continues the CEDICT project with a new license: , allowing more projects to use it. Additionally a work flow has been set up to streamline the process of submitting, reviewing and processing new entries. |
Related projects
CEDICT has shown the way to some other projects:- :de:HanDeDict|HanDeDict
- :en:CFDICT|CFDICT for French
- Some older CEDICT data is also found in the Adsotrans dictionary.
- February 2012: , the Spanish-Chinese free dictionary starts
- May 2017: CHDICT for Hungarian
- CC-Canto is Pleco Software's addition of Cantonese language readings in Jyutping transcription to CC-CEDICT
- Cantonese CEDICT features Cantonese language readings in Yale transcription and has Cantonese-specific words, many of which were taken from "A Dictionary of Cantonese Slang" in possible copyright infringement.