Vandaag, op de mailing list voor taal-specialisten van OOo, is een nieuwe
versie van de afbreekroutines, die een aantal voor ons handige
verbeteringen kent.
Onder andere woordsamenstellingen in relatie tot afbreken. Ik moet me
vanavond nog inlezen, maar wellicht dat iemand zich ook geroepen voelt tot
deze materie.
Het blijkt dat het aangeven van de woordgrenzen dus in elk geval hiervoor
nuttig zou kunnen worden.
Het ziet er veeelbelovend uit.
mvg
Ruud
-----------------------------------------
Hi,
New version of the Hyphen hyphenator has default hyphenmin and
optional compound word hyphenation support, also improved en_US
hyphenation patterns.
The Hyphen hyphenator (standalone version of OpenOffice.org ALTLinux
Libhnj) is the default hyphenator of OpenOffice.org on several
platforms (Debian, Fedora, Ubuntu). Integration with OpenOffice.org
(also the improved hyphenation patterns) is under development.
Source distribution:
http://downloads.sourceforge.net/hunspell/hyphen-2.4.tar.gzRelease notes:
2008-05-01 Hyphen 2.4 release:
- compound word hyphenation support by recursive pattern matching
based on two hyphenation pattern sets, see README.compound.
Especially useful for languages with arbitrary number of compounds
(Danish,
Dutch, Finnish, German, Hungarian, Icelandic, Norwegian, Swedish etc.).
- new dictionary parameters (minimal character numbers for hyph.
distances):
LEFTHYPHENMIN: minimal hyphenation distance from the left end of the word
RIGHTHYPHENMIN: minimal hyphenation distance from the right end of the
word
COMPOUNDLEFTHYPHENMIN: min. hyph. dist. from the left compound word
boundary
COMPOUNDRIGHTHYPHENMIN: min. hyph. dist. from the right comp. word
boundary
- new API function: hnj_hyphen_hyphenate3() (like hyphenate2(), but
with hyphenmin options)
en_US hyphenation patterns:
- extended hyph_en_US.dic with TugBoat hyphenation log (fix thousand
incompletely or badly hyphenated words, for example acad-e-my, acro-nym,
acryl-amide, adren-a-line, aero-space, am-phet-a-mine, anom-aly etc.)
- fixed hyph_en_US.dic: set the right default hyphenation distance of
the original TeX hyphenation patterns:
LEFTHYPHENMIN 2
RIGHTHYPHENMIN 3 (not 2!)
It is not only a typographical issue. It seems, TeX hyphenation
patterns are right only with these settings, for example,
the bad "anoma-ly" is restricted in TeX only by the default
\righthyphenmin=3 (but not restricted in OpenOffice.org, until now).
- documentation (README_hyph_en_US.dic)
- fixes for automake configuration, compiling and checking, see ChangeLog
On the practical usage of the new extension: see README.compound in
the source distribution. More documentation and development tools for
the extended hyphenation patterns are planned. It is suggested that
the (future) hyphenation dictionary developers of the related
languages collect all common non-compound words and sign compound word
boundaries in its hpyhenation dictionaries (the source of the
hyphenation patterns).
FSF.hu Foundation, Hungary (
http://www.fsf.hu) was the main supporter
of the work.
Regards,
László Németh
_______________________________________________
Over de OpenTaal-mailinglist:
http://opentaal.org/mailinglist.phpZoeken in het mailinglistarchief:
http://opentaal.org/zoeken.phpJuridische voorwaarden:
http://opentaal.org/licentie.php