CIDE ePub/Kindle conversion

I have produced a free ePub/Kindle English - English dictionary. There are two versions.

  1. Wiktionary only
  2. latest ed. 2011-11-29
  3. Wikt entries from 2011-11-21 (yyyy-mm-dd)
  4. Wiktionary and CIDE v.0.51
  5. latest ed. 2012-06-23
  6. Wikt entries from 2012-06-15 (yyyy-mm-dd)

CIDE stands for The Collaborative International Dictionary of English and is based on Webster's 1913 Unabridged with additions from WordNet.

Hosted in Windows Live

  1. Wiktionary only ePub, circa 15 MB
  2. Wiktionary only mobi, circa 42 MB
  3. Wikt + CIDE ePub, circa 13 MB
  4. Wikt + CIDE mobi, circa 77 MB

Source 1: English Wiktionary

Source 2: CIDE v.0.51 and Micra Inc.'s main site.

Source 3: US Gazetteer 2010 Places file.

Source 4: Moby thesaurus II.

Licenses

  1. Wiktionary
  2. Portions which are from Wiktionary are licensed under the
  3. Creative Commons Attribution/Share-Alike License 3.0 (Unported).
  4. CIDE
  5. CIDE is copyrighted (C) 1996, 1998 by MICRA, Inc. of Plainfield, NJ.
  6. This electronic version may be used freely for personal use or for research, and may be freely distributed provided that the entire set of files are copied, and the headers and copyright notices are not modified or deleted. The inclusion of more than five per cent of the text of this dictionary in a product for sale requires the express written permission of MICRA Inc. Sale of entire copies, including all headers and copyright notices, will not be considered a violation of this provision.
  7. Portions from Wordnet are in the public domain.
  8. Portions from US Gazetteer 2010 are in the public domain.
  9. Portions from Moby thesaurus II are GPL.

Author's Notes

  1. New versions of the Wikt + CIDE version cannot currently be published in Amazon as their policy limits mobi files to 50 MB size. The newest Wikt + CIDE v. is available at my Win Live site
  2. Starting from edition 2011-08-04, the Wikt + CIDE version of the dictionary is based on Wiktionary and CIDE 0.51. Previously GCIDE 0.46 or the GNU version of CIDE was used instead of CIDE
  3. Special chars are now handled much better for CIDE entries than they were for GCIDE and at least partly better for Wiktionary entries
  4. There are yet some imperfections in handling multiple words defined in the same section. Mostly that words beyond the first should be indexed too
  5. A few dozen orthography entries are still missing
  6. Some tables are partial in CIDE and most aren't formatted properly in this edition
  7. CIDE contains some images, which have not been included
  8. The Wiktionary and CIDE entries have merely been sorted together, not combined in any other way. The sorting order was somewhat fixed and changed in edition 2011-08-28 so that UTF-8 sorting with the English locale is used. Special chars, namely mostly Greek chars, are now sorted last (after Z)
  9. Words from Wiktionary may have either a capital or lowercase initial. They have "src. Wiktionary". The words from Webster's 1913 have usually a capital initial. Words from later sources in CIDE, such as Wordnet, don't. CIDE entries have source info such as "src. 1913 Webster" or "src. Webster's 1913 Quotations etc." or "src. WordNet 1.5". PJC stands for Patrick J. Cassidy, JK for Joel Korhonen
  10. The Wiktionary entries are the simple definitions from Wiktionary dumps, not the full article texts. Therefore such data as synonyms, hyponyms etc. is not included
  11. Starting from the edition of 2012-04-27, the Wiktionary dump parser has been rewritten. As the first new feature, the entries now include the quote data
  12. Starting from the edition of 2012-06-23, the dictionary includes data for also the Anglo-Saxon, French, German, Middle-English and Old English terms from Wiktionary as these may be of use while reading English books. More languages may be supported in the future

Original CIDE statement

An electronic field-marked version of:

The Collaborative International Dictionary of English

derived from

Webster's Revised Unabridged Dictionary

Version published 1913

by the C. & G. Merriam Co.

Springfield, Mass.

Under the direction of

Noah Porter, D.D., LL.D.


and from

WordNet(R), a semantic network created by

the Cognitive Science Department

of Princeton University

under the direction of

Prof. George Miller


[and is being updated and supplemented by

an open coalition of volunteer collaborators from

around the world.]*


Contributions of data, time, and effort are requested from any person

willing to assist creation of a comprehensive and organized knowledge base

for free access on the internet.

Users should be acutely aware that most of the text was created before 1890, and it reflects the knowledge and prejudices of the editors of that era.

This version is only a first typing, and has numerous typographic errors, including errors in the field-marks. Assistance in bringing this dictionary to a more accurate and useful state will be greatly appreciated.


Patrick Cassidy

cassidy@micra.com

735 Belvidere Ave.

Plainfield, NJ 07062

Office: (908)668-5252

(908) 561-3416

* Development of CIDE was ceased in the early 2000's due to the publishment of various online Internet dictionaries / JK

CIDE versions note

CIDE 0.51 main body was last edited September 29, 2002.

CIDE stands for The Collaborative International Dictionary of English. There are also GNU versions available called GCIDE which are XML formatted. The former editions of this compilation were based solely on GCIDE. GCIDE V.0.46 was translated by Michael Dyck (jmdyck@metalab.unc.edu) on June 16, 2002.

There is also a version Micra published in Project Gutenberg, Letters A & B being Etext #660. The files are named pgw050*.zip. The file pgw050ab.txt or letters A and B lists version as 0.50 and last edited date as Feb 11th, 1999. Some notes were added to the files in April 2004. This version seems to be close to GCIDE but the text layout is different; there are more forced newlines.