WiSE - Worldwide Internet & Intranet Search Engine
WiSE (Worldwide Internet & Intranet Search Engine) is a full-text search engine product, which is designed to index and search documents from various sources. Document files can be retrieved either from the intranet file servers or the internet web servers. Data records from RDBMS can also be handled
N-gram indexing method
WiSE deployes N-gram (n-successive character strings) based indexing method.
Two types of text cutting methods are applied to obtain best search results.
- N-gram cutting is best suited for Japanese, Korean, and Chinese texts.
- Word cutting by white space for other languages (e.g. English, French, Arabic, Russian, ...)
- Capable to find exact words or phrases from documents written in multiple languages.
- Also capable to apply N-gram cutting to alpha-numeric characters, in order to do between matching.
REST type Web search API
Application can throw search requests via HTTP, and receive results in XML format.
Following functions are available.
- simple search
- sort order, result pagenation, nearby texts, highliting, ...
- synonym&related words dictionary
- faceted navigation
- distribution (histogram) over numeric&date attributes
- multiple tree categories (configurable)
- candidate search words
- keyword driven recommendation links
- others ...
Multiple language support
- WiSE is suitable for use with global bodies for its multiple language support.
- UTF-8(Unicode) as the internal character encoding.
- Documents written in multiple languages are stored in a single index database.
- Search requests are given in UTF-8 code, so that it gives you the ability to search in multiple national languages.
Scalability
WiSE is designed to scale among distributed server environments. This feature is called "WiSE Clustering".
- Load-balancing
Multiple search servers can be configured to provide hardware redundancy.
With this option, search servers can also be scaled out to support heavy query workloads.
- Distributed-search
Search servers can be scaled out to support massive amount of documents.
- Dynamic re-configuration
N+1(or any) Hardware redundancy can be configured.
- Shared-data
This option allows deployment of NAS(Network Attached Storage) devices for reliability, managability and improved performance.