Parent Categories/Forums: Web Search
Edit this Forum

DDC-Concordance

Search:
This forum is an archive for the mailing list: ddc-users@ddc-concordance.org (mailing list options). Messages posted here will be sent to this mailing list.

DDC-Concordance is an open source (LGPL) search engine developed specially to meet the needs of linguistic researchers. The following properties in particular are relevant:
  • Sentence-based or document-based searches
  • Statistical queries, not approximations
  • additionally to classical search engine properties like boolean operators (AND, OR, NOT), left and right truncation and distance search operators, ddc-concordance also can search for word forms. E.g. a search for "child" will find all documents containing wordforms like child, children etc. This functionality is currently available for english, german and russian.
  • ddc-concordance can index metadata from xml documents
  • words can be indexed with searchable annotations, especially word forms, lemmas, part of speech-tags and semantic categories
  • Interval searches (targeted and symmetrical e.g. NEAR and FOLLOWED_BY)
  • searching for phrases
  • relevance ranking operator for documents
  • ddc-concordance is fast. Indexing of a 100 million words corpus takes approximately 1.5 hours. The first ten hits for simple queries are shown in about 0.2 seconds.
  • ddc-concordance can handle huge corpora because of its distributed clustering architecture. The largest known corpus is about 1 billion tokens, but we haven't reached a limit yet.
  • currently there is a command line client for linux, a perl interface and a simple cgi script available. We're working on a python interface, too.
Child Forums (0): None
Post to DDC-Concordance Post New Message  ::  Alert me of new posts  ::  Rating Filter:

Thread (12 Threads) Rating Replies Last Message

Re: ddc-users Digest, Vol 7, Issue 1 by ??????? ???????-2
1
by tobias roth-5

Problems installing rpm of version 1.80 on Fedora by tobias roth-5
3
by Kai Zimmer-2

IndicesToShow not shown in query '#within file' by tobias roth-5
2
by tobias roth-5

Re: ddc-users Digest, Vol 6, Issue 1 by ??????? ???????-2
0
by ??????? ???????-2

Case-insensitive matching in RE by tobias roth-5
0
by tobias roth-5

Re: ddc-users Digest, Vol 5, Issue 1 by ??????? ???????-2
1
by Kai Zimmer-2

sort by l/r context by Matej Durco
0
by Matej Durco

DDC-PHP 0.02 release by Kai Zimmer-3
0
by Kai Zimmer-3

integrate ddc in web-site by Matej Durco
1
by Kai Zimmer-2

Re: ddc-users Digest, Vol 3, Issue 1 by ??????? ???????-2
0
by ??????? ???????-2

threading in ddc? by Matej Durco
0
by Matej Durco

DDC-Python 0.01 release. by Kai Zimmer-2
0
by Kai Zimmer-2
Post to DDC-Concordance Post New Message  ::  Alert me of new posts  ::  Atom feed for DDC-Concordance
LightInTheBox - Buy quality products at wholesale price