Poliqarp
Poliqarp is an open source search engine designed to process text corpora, among others the National Corpus of Polish created at the Institute of Computer Science, Polish Academy of Sciences.
Features
- Custom query language
- Two-level regular expressions:
- operating at the level of characters in words
- operating at the level of words in statements/paragraphs
- Good performance
- Compact corpus representation (compared to similar projects)
- Portability across operating systems: Linux/BSD/Win32
- Lack of portability across endianness (current release works only on little endian devices)
gollark: The 290th will be an actual complete Macron parser and interpreter.
gollark: 289 in total.
gollark: ddg!eso macron
gollark: I wrote a macron compiler.
gollark: osmarkspythonbuildsystemâ„¢ can.
External links
This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.