Elasticsearch

Elasticsearch is a search engine based on the Lucene library. It provides a distributed, multitenant-capable full-text search engine with an HTTP web interface and schema-free JSON documents. Elasticsearch is developed in Java. Following an open-core business model, parts of the software are licensed under various open-source licenses (mostly the Apache License),[2] while other parts[3] fall under the proprietary (source-available) Elastic License. Official clients are available in Java, .NET (C#), PHP, Python, Apache Groovy, Ruby and many other languages.[4] According to the DB-Engines ranking, Elasticsearch is the most popular enterprise search engine followed by Apache Solr, also based on Lucene.[5]

Elasticsearch
Original author(s)Shay Banon
Developer(s)Elastic NV
Initial release8 February 2010 (2010-02-08)
Stable release
6.x6.8.11 / 28 July 2020 (2020-07-28)[1]
7.x7.8.1 / 28 July 2020 (2020-07-28)[1]
Repositorygithub.com/elastic/elasticsearch
Written inJava
Operating systemCross-platform
TypeSearch and index
LicenseVarious (open-core model), e.g. Apache License 2.0 (partially; open source), Elastic License (proprietary; source-available)
Websitewww.elastic.co/products/elasticsearch 
Shay Banon talking about Elasticsearch at Berlin Buzzwords 2010

History

Shay Banon created the precursor to Elasticsearch, called Compass, in 2004.[6] While thinking about the third version of Compass he realized that it would be necessary to rewrite big parts of Compass to "create a scalable search solution".[6] So he created "a solution built from the ground up to be distributed" and used a common interface, JSON over HTTP, suitable for programming languages other than Java as well.[6] Shay Banon released the first version of Elasticsearch in February 2010.[7]

Elastic NV was founded in 2012 to provide commercial services and products around Elasticsearch and related software.[8] In June 2014, the company announced raising $70 million in a Series C funding round, just 18 months after forming the company. The round was led by New Enterprise Associates (NEA). Additional funders include Benchmark Capital and Index Ventures. This round brought total funding to $104M.[9]

In March 2015, the company Elasticsearch changed their name to Elastic.[10]

In June 2018, Elastic filed for an initial public offering with an estimated valuation of between 1.5 and 3 billion dollars.[11] On 5 October 2018, Elastic was listed on the New York Stock Exchange.[12]

Features

Elasticsearch can be used to search all kinds of documents. It provides scalable search, has near real-time search, and supports multitenancy.[4] "Elasticsearch is distributed, which means that indices can be divided into shards and each shard can have zero or more replicas. Each node hosts one or more shards, and acts as a coordinator to delegate operations to the correct shard(s). Rebalancing and routing are done automatically".[4] Related data is often stored in the same index, which consists of one or more primary shards, and zero or more replica shards. Once an index has been created, the number of primary shards cannot be changed.[13]

Elasticsearch is developed alongside a data collection and log-parsing engine called Logstash, an analytics and visualisation platform called Kibana, and Beats, a collection of lightweight data shippers. The four products are designed for use as an integrated solution, referred to as the "Elastic Stack" (formerly the "ELK stack").[14]

Elasticsearch uses Lucene and tries to make all its features available through the JSON and Java API. It supports facetting and percolating,[15][16] which can be useful for notifying if new documents match for registered queries. Another feature is called "gateway" and handles the long-term persistence of the index;[17] for example, an index can be recovered from the gateway in the event of a server crash. Elasticsearch supports real-time GET requests, which makes it suitable as a NoSQL datastore,[18] but it lacks distributed transactions.[19]

On 20 May 2019, Elastic made the core security features of the Elastic Stack available free of charge, including TLS for encrypted communications, file and native realm for creating and managing users, and role-based access control for controlling user access to cluster APIs and indexes.[20] The corresponding source code is available under the “Elastic License”, a source-available license.[21] In addition, Elasticsearch now offers SIEM [22] and Machine Learning [23] as part of its offered services.

Managed services

Developed from the Found acquisition by Elastic in 2015,[24] Elastic Cloud is a family of Elasticsearch-powered SaaS offerings which include the Elasticsearch Service, as well as Elastic App Search Service, and Elastic Site Search Service which were developed from Elastic’s acquisition of Swiftype.[25] In late 2017, Elastic formed partnerships with Google to offer Elastic Cloud in GCP, and Alibaba to offer Elasticsearch and Kibana in Alibaba Cloud.

Elasticsearch Service on Elastic Cloud is the official hosted and managed Elasticsearch and Kibana offering from the creators of the project since August 2018[26][27] Elasticsearch Service users can create secure deployments with partners, Google Cloud Platform (GCP)  and Alibaba Cloud.[28][29]

AWS offers Elasticsearch as a managed service since 2015.[30][31][32] Such managed services provide hosting, deployment, backup and other support.[33] Most managed services also include support for Kibana.

Elasticsearch is the basis of Pangeanic's contribution to the EU's Marie Curie research project "EXPERT"[34] called ActivaTM. Pangeanic built a bilingual database compatible with Computer-Assisted Translation tools, which could offer real-time access via API from a variety of tools. The project received further funding from the EU as the National and European Central Translation Memory project[35] under the Connecting Europe Facility (CEF) programme. NEC TM aims to centralise national translation assets in all the EU's Member States so countries can re-use bilingual translation data produced as a result of public procurement contracts.

Reported Elasticsearch data breaches

  • 2018-11-15 AWS Elasticsearch database belonging to VoxOx exposed tens of millions of text messages, including password reset links, two-factor codes, shipping notifications and more.[36]
  • 2018-11-27 Elasticsearch database belonging to Urban Massage exposed more than 309,000 user records, including names, email addresses and phone numbers.[37]
  • 2019-01-12 Elasticsearch server belonging to do-it-yourself chain, B&Q exposed personal details of individuals caught or suspected of stealing goods from stores.[38][39]
  • 2019-01-21 Elasticsearch database belonging to Youth-run agency AIESEC exposed over 4 million intern applications including the applicant’s name, gender, date of birth, and the reasons why the person was applying for the internship.[40]
  • 2019-01-23 Elasticsearch database belonging to Ascension Data and Analytics exposed 24 million financial and banking documents, representing tens of thousands of loans and mortgages from some of the biggest banks in the U.S.[41]
  • 2019-09-13 Elasticsearch database belonging to Dealer Leads exposed 198 million car buying records which contained the personal information of customers.[42]
  • 2019-10-26 Elasticsearch database belonging to Adobe exposed 7.5 million customer records which contained email addresses, Adobe member IDs (usernames), country of origin, and what Adobe products they were using.[43]
  • 2019-11-19 Elasticsearch database belonging to Conrad Electronic exposed 14 million customer records which contained postal addresses, in parts fax- and telephone numbers as well as IBANs on a fifth of the exposed data-records.[44]
gollark: What of the code-guessing thing there? That's sometimes relevant.
gollark: I have prepared things, now to check it against the messages in existence.
gollark: Okay, no, that wouldn't work as the message DOM nodes don't exist when scrolled down.
gollark: Maybe it would be easier to do this in JS.
gollark: This is going slower than expected due to annoying clipboard weirdness.

See also

References

  1. "Elasticsearch Releases". Retrieved 28 July 2020 via GitHub.
  2. GitHub - elastic/elasticsearch: Open Source, Distributed, RESTful Search Engine., elastic, 14 March 2019, retrieved 14 March 2019
  3. "No, Elastic X-Pack is not going to be open source - according to Elastic themselves -". Flax.co.uk. 2 March 2018. Retrieved 14 March 2019.
  4. "Official Website". Elasticsearch.org. Retrieved 4 February 2014.
  5. "DB-Engines Ranking - popularity ranking of search engines". db-engines.com. Retrieved 10 January 2016.
  6. Banon, Shay. "The Future of Compass & ElasticSearch".
  7. Banon, Shay (8 February 2010). "You Know, for Search". Archived from the original on 16 January 2013.
  8. "Immediate Insight from Data Matters". elastic.co. Retrieved 25 March 2015.
  9. "ElasticSearch Scores $70M In Series C To Fund Growth Spurt". TechCrunch. AOL. Retrieved 25 March 2015.
  10. "Elasticsearch Changes Name to Elastic to Reflect Wide Adoption Beyond Search". Elastic.co. Retrieved 19 October 2016.
  11. Schleifer, Theodore (21 June 2018). "The IPOs keep coming: The search company Elastic has filed to go public". Recode. Archived from the original on 5 October 2018. Retrieved 22 June 2018.
  12. Banon, Shay (5 October 2018). "Ze Bell Has Rung: Thank You Users, Customers, and Partners". Elastic (NV). Retrieved 24 October 2018.
  13. "How to monitor Elasticsearch performance".
  14. "Elastic brings order to its product line with Elastic Stack". Social.techcrunch.com. Retrieved 1 April 2019.
  15. "percolate at elasticsearch.org reference". Elasticsearch.org. Archived from the original on 2 October 2013. Retrieved 4 February 2014.
  16. "Percolating" is a term peculiar to Elasticsearch. Percolating is a reverse search: instead of returning all the documents that match a search query, percolating returns all the (stored) search queries that match a document as their output. Nunn, Xavier; "Detecting data leaks in real time with a custom percolator", Serena Capital blogs, 2019-January-8
  17. "elasticsearch Guide: Gateway". Elasticsearch.org. Retrieved 19 April 2013.
  18. "Elasticsearch as database". Karussell.wordpress.com. Retrieved 4 February 2014.
  19. "No transaction support". Elasticsearch-users.115913.n3.nabble.com. 8 July 2010. Retrieved 4 February 2014.
  20. "Security for Elasticsearch is now free". Elastic Blog. 20 May 2019. Retrieved 17 June 2019.
  21. "Doubling Down on Open". Elastic Blog. 27 February 2018. Retrieved 24 October 2019.
  22. "Introducing Elastic SIEM". Elastic Blog. 25 June 2019. Retrieved 2 March 2020.
  23. "Introducing Machine Learning for the Elastic Stack". Elastic Blog. 4 May 2017. Retrieved 2 March 2020.
  24. Oliver, Andrew C. (10 March 2015). "Elasticsearch buys into search as a service, rebrands as 'Elastic'". InfoWorld.com. Retrieved 1 April 2019.
  25. "Elastic acquires search startup Swiftype". Social.techcrunch.com. Retrieved 1 April 2019.
  26. "Open Source Search & Analytics · Elasticsearch - Elastic". Elastic.co. Retrieved 22 April 2019.
  27. "Elastic Cloud: Hosted Elasticsearch, Hosted Search | Elastic". Elastic.co. Retrieved 1 April 2019.
  28. Yegulalp, Serdar (7 April 2017). "Google Cloud to host open source Elasticsearch". InfoWorld.com. Retrieved 1 April 2019.
  29. "Alibaba Cloud to Offer Elasticsearch, Kibana, and X-Pack in China". Elastic.co. 13 October 2017. Retrieved 1 April 2019.
  30. "New – Amazon Elasticsearch Service". Amazon Web Services. 1 October 2015. Retrieved 22 April 2019.
  31. "Amazon Elasticsearch Service – Amazon Web Services (AWS)". Amazon Web Services, Inc. (in Latin). Retrieved 8 February 2019.
  32. "Hosted Elasticsearch & Kibana on AWS". Elastic.co. Retrieved 16 October 2016.
  33. "Elasticsearch Setup". Ctovision.com. Archived from the original on 21 August 2018. Retrieved 16 October 2016.
  34. "EXPERT (EXPloiting Empirical appRoaches to Translation" (PDF). Expert-itn.eu. Retrieved 13 February 2019.
  35. "National and European Central Translation Memory NEC TM". Nec-tm.eu. Retrieved 13 February 2019.
  36. Zack, Whittaker. "A leaky database of SMS text messages exposed password resets and two-factor codes". TechCrunch.com. Retrieved 24 January 2019.
  37. Zack, Whittaker. "Urban Massage exposed a huge customer database, including sensitive comments on its creepy clients". TechCrunch.com. Retrieved 24 January 2019.
  38. "B&Q 'exposed data about store thieves'". BBC News. Retrieved 28 January 2019.
  39. Lee Johnstone (24 January 2019). "When Security Fails, 70,000 Offender and Incident Logs Exposed". Ctrlbox Information Security. Ctrlbox Information Security. Retrieved 5 February 2019.
  40. Zack, Whittaker. "Youth-run agency AIESEC exposed over 4 million intern applications". TechCrunch.com. Retrieved 24 January 2019.
  41. Zack, Whittaker. "Millions of bank loan and mortgage documents have leaked online". TechCrunch.com. Retrieved 24 January 2019.
  42. Muncaster, Phil. "Marketer Exposes 198 Million Car Buyer Records". infosecurity-magazine.com. Retrieved 30 September 2019.
  43. Cimpanu, Catalin. "Adobe left 7.5 million Creative Cloud user records exposed online". zdnet.com. Retrieved 19 November 2019.
  44. Conrad, Werner. "Datenpanne: Conrad Electronic Gruppe informiert vorsorglich Kunden". conrad.de. Retrieved 19 November 2019.
This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.