« Return to Thread: Afbrekingen

Afbrekingen

by Ruud Baars :: Rate this Message:

Reply to Author | View in Thread

Vandaag, op de mailing list voor taal-specialisten van OOo, is een nieuwe
versie van de afbreekroutines, die een aantal voor ons handige
verbeteringen kent.

Onder andere woordsamenstellingen in relatie tot afbreken. Ik moet me
vanavond nog inlezen, maar wellicht dat iemand zich ook geroepen voelt tot
deze materie.

Het blijkt dat het aangeven van de woordgrenzen dus in elk geval hiervoor
nuttig zou kunnen worden.

Het ziet er veeelbelovend uit.

mvg
Ruud
-----------------------------------------

Hi,

New version of the Hyphen hyphenator has default hyphenmin and
optional compound word hyphenation support, also improved en_US
hyphenation patterns.

The Hyphen hyphenator (standalone version of OpenOffice.org ALTLinux
Libhnj) is the default hyphenator of OpenOffice.org on several
platforms (Debian, Fedora, Ubuntu). Integration with OpenOffice.org
(also the improved hyphenation patterns) is under development.

Source distribution:
http://downloads.sourceforge.net/hunspell/hyphen-2.4.tar.gz

Release notes:

2008-05-01 Hyphen 2.4 release:
  - compound word hyphenation support by recursive pattern matching
    based on two hyphenation pattern sets, see README.compound.
    Especially useful for languages with arbitrary number of compounds
(Danish,
    Dutch, Finnish, German, Hungarian, Icelandic, Norwegian, Swedish etc.).

  - new dictionary parameters (minimal character numbers for hyph.
distances):
    LEFTHYPHENMIN: minimal hyphenation distance from the left end of the word
    RIGHTHYPHENMIN: minimal hyphenation distance from the right end of the
word
    COMPOUNDLEFTHYPHENMIN: min. hyph. dist. from the left compound word
boundary
    COMPOUNDRIGHTHYPHENMIN: min. hyph. dist. from the right comp. word
boundary

  - new API function: hnj_hyphen_hyphenate3() (like hyphenate2(), but
    with hyphenmin options)

en_US hyphenation patterns:

  - extended hyph_en_US.dic with TugBoat hyphenation log (fix thousand
    incompletely or badly hyphenated words, for example acad-e-my, acro-nym,
    acryl-amide, adren-a-line, aero-space, am-phet-a-mine, anom-aly etc.)

  - fixed hyph_en_US.dic: set the right default hyphenation distance of
    the original TeX hyphenation patterns:
    LEFTHYPHENMIN 2
    RIGHTHYPHENMIN 3 (not 2!)
    It is not only a typographical issue. It seems, TeX hyphenation
    patterns are right only with these settings, for example,
    the bad "anoma-ly" is restricted in TeX only by the default
    \righthyphenmin=3 (but not restricted in OpenOffice.org, until now).

  - documentation (README_hyph_en_US.dic)

  - fixes for automake configuration, compiling and checking, see ChangeLog

On the practical usage of the new extension: see README.compound in
the source distribution. More documentation and development tools for
the extended hyphenation patterns are planned. It is suggested that
the (future) hyphenation dictionary developers of the related
languages collect all common non-compound words and sign compound word
boundaries in its hpyhenation dictionaries (the source of the
hyphenation patterns).

FSF.hu Foundation, Hungary (http://www.fsf.hu) was the main supporter
of the work.

Regards,
László Németh


_______________________________________________
Over de OpenTaal-mailinglist: http://opentaal.org/mailinglist.php
Zoeken in het mailinglistarchief: http://opentaal.org/zoeken.php
Juridische voorwaarden: http://opentaal.org/licentie.php

 « Return to Thread: Afbrekingen

LightInTheBox - Buy quality products at wholesale price