Apache solr 和 ES比較

阿新 • • 發佈：2019-02-15

http://solr-vs-elasticsearch.com/

Apache Solr vs Elasticsearch

The Feature Smackdown

API

Feature	Solr 6.2.1	ElasticSearch 5.0
Format	XML, CSV, JSON	JSON
HTTP REST API
Binary API	SolrJ	TransportClient, Thrift (through a plugin)
JMX support		ES specific stats are exposed through the REST API
Official client libraries	Java	Java, Groovy, PHP, Ruby, Perl, Python, .NET, Javascript Official list of clients
Community client libraries	PHP, Ruby, Perl, Scala, Python, .NET, Javascript, Go, Erlang, Clojure	Clojure, Cold Fusion, Erlang, Go, Groovy, Haskell, Java, JavaScript, .NET, OCaml, Perl, PHP, Python, R, Ruby, Scala, Smalltalk, Vert.x Complete list
3rd-party product integration (open-source)	Drupal, Magento, Django, ColdFusion, Wordpress, OpenCMS, Plone, Typo3, ez Publish, Symfony2, Riak (via Yokozuna)	Drupal, Django, Symfony2, Wordpress, CouchBase
3rd-party product integration (commercial)	DataStax Enterprise Search, Cloudera Search, Hortonworks Data Platform, MapR	SearchBlox, Hortonworks Data Platform, MapR etc Complete list
Output	JSON, XML, PHP, Python, Ruby, CSV, Velocity, XSLT, native Java	JSON, XML/HTML (via plugin)

Infrastructure

Feature	Solr 6.2.1	ElasticSearch 5.0
Master-slave replication	Only in non-SolrCloud. In SolrCloud, behaves identically to ES.	Not an issue because shards are replicated across nodes.
Integrated snapshot and restore	Filesystem	Filesystem, AWS Cloud Plugin for S3 repositories, HDFS Plugin for Hadoop environments, Azure Cloud Plugin for Azure storage repositories

Indexing

Feature	Solr 6.2.1	ElasticSearch 5.0
Data Import	DataImportHandler - JDBC, CSV, XML, Tika, URL, Flat File	[DEPRECATED in 2.x] Rivers modules - ActiveMQ, Amazon SQS, CouchDB, Dropbox, DynamoDB, FileSystem, Git, GitHub, Hazelcast, JDBC, JMS, Kafka, LDAP, MongoDB, neo4j, OAI, RabbitMQ, Redis, RSS, Sofa, Solr, St9, Subversion, Twitter, Wikipedia
ID field for updates and deduplication
DocValues
Partial Doc Updates	with stored fields	with _source field
Custom Analyzers and Tokenizers
Per-field analyzer chain
Per-doc/query analyzer chain
Index-time synonyms		Supports Solr and Wordnet synonym format
Query-time synonyms	Technically, yes, but practically no because multi-word/phrase query-time synonyms are not supported. See ES docs and hon-lucene-synonyms blog for nuances.
Multiple indexes
Near-Realtime Search/Indexing
Complex documents
Schemaless	4.4+
Multiple document types per schema	One set of fields per schema, one schema per core
Online schema changes	Schemaless mode or via dynamic fields.	Only backward-compatible changes.
Apache Tika integration
Dynamic fields
Field copying		via multi-fields

Searching

Feature	Solr 6.2.1	ElasticSearch 5.0
Lucene Query parsing
Structured Query DSL	Need to programmatically create queries if going beyond Lucene query syntax.
Span queries
Spatial/geo search
Multi-point spatial search
Faceting		Top N term accuracy can be controlled with shard_size
Geo-distance Faceting
Pivot Facets
More Like This
Boosting by functions
Boosting using scripting languages
Push Queries	Percolation. Distributed percolation supported in 1.0
Field collapsing/Results grouping
Wordlist-based Spellcheck
Autocomplete
Intra-index joins	via parent-child query	via has_children and top_children queries
Inter-index joins	Joined index has to be single-shard and replicated across all nodes.
Resultset Scrolling	New to 4.7.0	via scan search type
Filter queries		also supports filtering by native scripts
Filter execution order	local params and cache property
Alternative QueryParsers	DisMax, eDisMax	query_string, dis_max, match, multi_match etc
Negative boosting	but awkward. Involves positively boosting the inverse set of negatively-boosted documents.
Search across multiple indexes	it can search across multiple compatible collections
Result highlighting
Custom Similarity
Searcher warming on index reload
Term Vectors API

Customizability

Feature	Solr 6.2.1	ElasticSearch 5.0
Pluggable API endpoints
Pluggable search workflow	via SearchComponents
Pluggable Analyzers/Tokenizers
Pluggable QueryParsers
Pluggable Field Types
Pluggable Function queries
Pluggable scoring scripts
Pluggable hashing
Pluggable webapps		[site plugins DEPRECATED in 5.x] blog post
Automated plugin installation		Installable from GitHub, maven, sonatype or elasticsearch.org

Distributed

Feature	Solr 6.2.1	ElasticSearch 5.0
Self-contained cluster	Depends on separate ZooKeeper server	Only Elasticsearch nodes
Automatic node discovery	ZooKeeper	internal Zen Discovery or ZooKeeper
Partition tolerance	The partition without a ZooKeeper quorum will stop accepting indexing requests or cluster state changes, while the partition with a quorum continues to function.	Partitioned clusters can diverge unless discovery.zen.minimum_master_nodes set to at least N/2+1, where N is the size of the cluster. If configured correctly, the partition without a quorum will stop operating, while the other continues to work. See this
Automatic failover	If all nodes storing a shard and its replicas fail, client requests will fail, unless requests are made with the shards.tolerant=true parameter, in which case partial results are retuned from the available shards.
Automatic leader election
Shard replication
Sharding
Automatic shard rebalancing		it can be machine, rack, availability zone, and/or data center aware. Arbitrary tags can be assigned to nodes and it can be configured to not assign the same shard and its replicates on a node with the same tags.
Change # of shards	Shards can be added (when using implicit routing) or split (when using compositeId). Cannot be lowered. Replicas can be increased anytime.	each index has 5 shards by default. Number of primary shards cannot be changed once the index is created. Replicas can be increased anytime.
Shard splitting
Relocate shards and replicas	can be done by creating a shard replicate on the desired node and then removing the shard from the source node	can move shards and replicas to any node in the cluster on demand
Control shard routing	shards or _route_ parameter	routing parameter
Pluggable shard/replica assignment	Probabilistic shard balancing with Tempest plugin
Consistency	Indexing requests are synchronous with replication. A indexing request won't return until all replicas respond. No check for downed replicas. They will catch up when they recover. When new replicas are added, they won't start accepting and responding to requests until they are finished replicating the index.	Replication between nodes is synchronous by default, thus ES is consistent by default, but it can be set to asynchronous on a per document indexing basis. Index writes can be configured to fail is there are not sufficient active shard replicas. The default is quorum, but all or one are also available.

Misc

Apache solr 和 ES比較

http://solr-vs-elasticsearch.com/ Apache Solr vs Elasticsearch The Feature Smackdown API Feature Solr 6.2.1 ElasticSearch 5.0

用Apache Hadoop和Apache Solr處理和索引醫學影象

你還在為大規模影象管理感到頭疼嗎?讀下去,看看這個團隊是如何使用開源產品來更有效地索引和儲存高解析度醫學影象的。時下，醫學影像迅速地成為了一種評估病人狀況，以及確定是否存在醫療條件的最好非侵入性方法。多數情況下，用來協助診斷的影像是構建現代醫學體系的第一步，而成

關系數據庫和NOSQL比較

2個二級需求主鍵比較無法需要 strong ron 關系數據庫 NOSQL 功能： NOSQL 功能簡單基本只支持主鍵查詢，有的NOSQL支持非主鍵查詢(不過非主鍵查詢時，其性能也很慢)，很少有NOSQL支

Apache solr(一)

cor val general 描述 src bsp 使用 blank 功能概念：Apache Solr 是一個開源的搜索服務器。Solr 使用 Java 語言開發，主要基於 HTTP 和 Apache Lucene 實現。Apache Solr 中存儲的資源是以 Doc

Java中Integer和int比較大小出現的錯誤

最好裏的 pan 轉換 als 範圍 urn 返回錯誤 Java在某一處維護著一個常量池,(我記得)在小於128的範圍內,直接用 1 Integer i = 100; 2 int j = 100; 3 return i == j;//true 這裏返回的是true.

Apache solr(二)

def imp 連接 config mysq 1.0 localhost handle mysql數據庫上一篇試著進行了solr的安裝和配置，以及如何solr的檢索，今天試著簡單的將solr連接MySQL數據庫(才嘗試了單表、一對多和多對多的還有待研究) 1、MySQL的

Apache solr(三)

apach csdn tails lan detail get href log http solr集成Tomcat，借鑒了一篇文章，那篇文章已經講的很詳細了，附上地址：鏈接：http://blog.csdn.net/yzl_8877/article/details/53

C# 的 String.CompareTo Equals和==的比較

urn 比較我們 name pos return www 字母 string String.CompareTo 語法 public int CompareTo( string strB) 返回值小於 0，實例小於參數 strB； 0，實例等於參數 strB；大

string中的equals和 == 的比較

div println new 重寫 logs void 控制臺 static ack 1 package com.pang.string_demo; 2 3 public class Demo01 { 4 public static void main

【轉載】Java中Comparable和Comparator比較

import 比較器 todo itl 復制代碼 ack div array open 【本文轉自】http://www.cnblogs.com/skywang12345/p/3324788.html Comparable 簡介 Comparable 是排序接口。若一

Oracle字符和時間比較

知識 etime nbsp 間隔 to_date ember 位數不一致 -m 數據庫中的字段 2017-07-11 13:37:51 類型是char 或者varchar 要進件與 ‘20170625‘ 比較，格式不一致，需要將他轉換成：yyyyMMdd 字符串 1、先

Memcached和Redis比較

計數 select work key-value 網絡io io操作 htm 系統設計 chunk 一、存儲 Memcached基本只支持簡單的key-value存儲方式。Redis除key-value之外，還支持list,set,sorted set,hash等數據結構

Apache Strom和Kafka的簡單筆記 (零) - 開端

pre 進行 publish lis apach bsp 什麽編程模型啟動一.什麽是實時計算系統?(流式計算)1.離線計算和實時計算離線計算實時計算(流式計算) 典型代表 mapReduce

JAVA學習（二） String使用equals方法和==分別比較的是什麽？（轉）

找到基礎上 stirng print 大小 obj lis 分配 ret String使用的equals方法和==的區別 equals方法和==的區別首先大家知道，String既可以作為一個對象來使用，又可以作為一個基本類型來使用。這裏指的作為一個基本類型來使用只是

apache 和Tomcat的區別

apache jsp tomcat 經常在用apache和tomcat等這些服務器，可是總感覺還是不清楚他們之間有什麽關系，在用tomcat的時候總出現apache，總感到迷惑，到底誰是主誰是次，因此特意在網上查詢了一些這方面的資料，總結了一下： apache支持靜態頁，tomc

TCP和UDP比較

雙工 tty 才幹來看電話系統那不文件 pin 一、TCP/IP協議 TCP/IP協議，你一定常常聽說吧，當中TCP(Transmission Control Protocol)稱為傳輸控制協議，IP(Internet Protocol)稱為

搜索引擎solr和elasticsearch

tro server out data 生成文檔列表用戶分析 end 剛開始接觸搜索引擎，網上收集了一些資料。在這裏整理了一下分享給大家。一、關於搜索引擎搜索引擎（Search Engine）是指依據一定的策略、運用特定的計算機程序從互聯網

JAXB和XStream比較

XML cti unmarshal order add emp 標準 ida 優勢這兩東東本質上是有差別的，JAXB稱為OX binding工具，XStream應該算序列化工具，但OX binding工具也會marshall和unmarshall，所以包含了序列化這一部分

Ubuntu 下Apache安裝和配置2

spl pac ubuntu server ould start warn it works apache。在Ubuntu上安裝Apache，有兩種方式：1 使用開發包的打包服務，例如使用apt-get命令；2 從源碼構建Apache。本文章將詳細描述這兩種不同的安裝方式

Java中Comparable和Comparator比較

collect clas bold 數據 let 排序類 height webkit tom 1、Comparable 介紹 Comparable 是一個排序接口，如果一個類實現了該接口，說明該類本身是可以進行排序的。註意，除了基本數據類型（八大基本數據類型）的數組或

Apache solr 和 ES比較

Apache Solr vs Elasticsearch

The Feature Smackdown

API

Infrastructure

Indexing

Searching

Customizability

Distributed

Misc

相關推薦