Search appliance

A search appliance (SA) is a type of computer appliance which is attached to a corporate network for the purpose of indexing the content shared across that network in a way that is similar to a web search engine.[1][2]

Architecture

A search appliance is usually made up of several components. These include a gathering component, a standardizing component, a data storage area, a search component, a user interface component, and a management interface component:[3]

  • The gathering component is usually a web crawler or file crawler that goes out on a network or the Internet and gathers files and data from specified locations. This might include SMB shared directories, NFS shared directories, databases, and web pages. The crawler might either copy files to the search appliance, or only copy the metadata about the file.
  • A standardizing component takes the data from the gathering component and transposes it into a standardized format for storage in the data storage component. It then places it in the data storage area.
  • The data storage component holds metadata about the files and might also contain copies of the actual file or data as well as the metadata about the file.
  • The search component searches through the stored metadata from the files and provides the information to the search interface in the form of query results. It also can provide links to the copies of the files stored on the search appliance, or it can provide links to the original files in the source locations.
  • The search interface is the component where users compose their search queries. It provides instructions to the search component and displays query results to the user.
  • The management interface lets administrators manage user accounts, permissions, adding and deleting search indexes, crawl job scheduling, and other relevant functions.

Commercial examples

  • Google Search Appliance was a SA from Google. It was supplied in two models: a 2U model (GB-7007) capable of indexing up to 10 million documents, and a 5U model (GB-9009) that was capable of indexing up to 30 million documents.[4] Google no longer sells a search appliance
  • The Mindbreeze InSpire Appliance is produced by the Austrian software vendor Fabasoft Mindbreeze.
  • The Perfect Search Appliance stores file metadata in an index on the appliance. A web server on the appliance uses that metadata to provide relevant search results in response to user queries, and provides a link to access the original files.
  • Clusterpoint Search Appliance is a software-only document and file indexing and search solution, that is based on Clusterpoint Server, a hybrid NoSQL database management system with a full text search engine and Document-oriented database.
  • Thunderstone Software is a US based company that makes a software and hardware based search appliance for enterprise and public use.

See also

References

This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.