[KS] RESOURCE>Major Upgrade of CJKV-E/DDB Dictionaries (Muller)

Charles Muller cmuller-lst at jj.em-net.ne.jp
Mon Nov 1 22:01:11 EDT 2010


Subject: "CJKV-E/DDB 2.0"
From: Charles Muller

Dear Colleagues,

After almost ten years of operation since Michael Beddow's initial
creation of the programming structure for the online CJKV-E/DDB
dictionaries, we are delighted to announce a major upgrade of these
web services.

The most basic components of this upgrade are (1) a move to a
dedicated server which will be able to deliver more power to search
functions and greater stability to Unicode-related programming, and
(2) an entire rewriting of the underlying search and indexing
routines, resulting in a noticeable increase in speed and variety of
search results, and links to both internal and external
resources. Some major specific additions and enhancements include:

A. Basic Search

    (1) A middle-level of search results, showing a list of head words
    that contain the search term. Previously, searches for a term would
    produce only the headword itself (when it existed), along with a
    long, scattered list of entry body matches.


    (2) The list of body entry matches, which was previously delivered
    without any particular ordering, is now sorted according to
    traditional ascending radical + stroke count (basically equivalent
    to Unicode hex number).

    (3) The list of matched body entries now includes a snippet of
    context, to give the user some hint of the usefulness of each
    listed match.

    (4) Head word searches via Pinyin, Hangeul, Korean romanization,
    Katakana, and Japanese romanization. Previously, searches for
    headwords via their various renderings in East Asian and romanized
    syllabaries would only yield matches as body entries. Now,
    dedicated search indexes for Pinyin, etc. will yield head word
    matches in a very fast search.

    (5) Searches with or without diacritics are equally and
    transparently supported. Searches employing those romanization
    systems that use diacritics may also be made with or without
    diacritical marks (though in the nature of things the latter may
    produce some false positives). This also applies to searches for
    Sanskrit and Pāli terms in entry bodies.


B. Entry results

    (1) Previously, hyperlinks to terms within displayed entries
    sometimes lacked actual targets, or led to the comprehensive
    external index in a roundabout manner. Now, if a term currently has
    no target in the dictionary concerned or (in the case of the DDB)
    in the external index, it will be shown without a hyperlink.

    (2) If the link goes to the comprehensive external index rather
    than the DDB itself, the user will be taken directly to that
    information, with no other message or page in between.

    (3) If  the headword of a DDB entry is also present in the CJKV-E,
    a hyperlink to that entry will automatically be added to the DDB
    entry when it is displayed. The converse applies to CJKV-E entries:
    if the DDB has an entry for the same headword, a link to it will be
    added to the CJKV-E entry on the display.

    (4) A link for a direct search to the SAT Taishō Database will
    automatically be generated for DDB entries (we are  also able and
    willing to generate links directly into other web-based canonical
    collections if the administrators of those collections are willing
    to provide us with the requisite code for such links).


  C. Behind-the-scenes. There are other enhancements which, while not
visible to users, will greatly improve the function of both
dictionaries. Most importantly:

    (1) The two main indexes (on headwords and fulltext) previously
    used have been completely re-implemented to give faster and more
    flexible matching. In addition, a number of specialized
    supplementary indexes have been added which are automatically
    invoked alongside or instead of the main indexes as and when
    appropriate.

    (2) Index updating has been made significantly faster and
    extensively automated. This means that all the indexes can be
    regenerated as frequently as desired. So from now on, corrections
    to existing entries, as well as newly-contributed entries, will be
    browsable and searchable in their entirety very shortly after
    editorial acceptance (assuming of course, that the Human in-charge
    is not indisposed for some reason or other!).

    (3) Great care has been taken to ensure that hyperlinks on external
    sites to DDB and CJKV-E entries which employ the syntax of the
    previous implementation of the Dictionaries continue to function
    exactly as before.  No existing external links made in accordance
    with the methods previously specified for creating such links will
    be broken as a result of the new infrastructure.


D. CJKV-E

    (1) In the process of preparing this upgrade, a great amount of
    work has been put into improving the structure and content of the
    CJKV-E dictionary, which has stayed pretty much on the back burner
    for the past decade or so. Greater attention will henceforth be
    given to the development of this resource.

    (2) In fact, I am presently working with a small grant that will
    have the effect of drastically increasing the coverage of the
    CJKV-E over the next few years.

I would like to take this opportunity to offer my deepest thanks to
those scholars who have provided staunch and enduring support for the
DDB over the past decade. Most importantly to Michael, who has,
without any monetary remuneration whatsoever, provided state-of-the
art programming of these dictionaries (along with web security and all
other related functions), buttressed by a matched level of
understanding of lexicographical and linguistic principles that has
provided us with so much of the structure and precision that these
online references currently exhibit. Many of the technical
enhancements are based on Michael's work on the Anglo-Norman
Dictionary (http://www.anglo-norman.net/), funded by the Arts and
Humanities Research Council of the United Kingdom, whose indirect but
significant support is gratefully acknowledged.

There is also a core group of approximately 25 scholars, many of them
recognized as leading figures in their own areas of expertise, who
have continued to generously contribute large amounts of material from
their own research notes and glossaries. They have also spent much
time in scouring previously-existent entries, amending, appending, and
entirely rewriting, such that the DDB and CJKV-E are in a steady state
of growth in size and accuracy (the names of these scholars can be
browsed at <http://www.buddhism-dict.net/credits/credits-ddb.html>). I
would also like to thank those scholars who have convinced their
libraries of the value of an institutional subscription. The resulting
funds, albeit modest, have been invaluable to help pay for
infrastructure, web hosting, and the employment of part-time
assistants to do input and editing.

I believe we can say that there are few, if any, other examples in the
academic humanities field where a body of scholars, bonded by
overlapping interests but spread across the globe, have contributed to
a central resource on a such a scale, upholding rigorous standards of
composition, accreditation, and citation, and providing an eminently
practical and useful example of how we can collaborate to build
resources that are far  more substantial than mere anonymous
aggregations.

Digital Dictionary of Buddhism: http://buddhism-dict.net/ddb
CJKV-E Dictionary: http://buddhism-dict.net/dealt

Chuck

----------------------
A. Charles Muller
Center for Evolving Humanities
University of Tokyo
www.acmuller.net
----------------------

H-Buddhism (Buddhist Scholars Information Network)

Web Site: http://www.h-net.org/~buddhism
Posting Guidelines: http://www.h-net.org/~buddhism/posting_guidelines.html
Account Handling: http://h-net.msu.edu/cgi-bin/wa?SUBED1=h-buddhism






More information about the Koreanstudies mailing list