looly
diff --git a/‎120_Proximity_Matching/00_Intro.md
+38 b/‎120_Proximity_Matching/00_Intro.md
+38
diff --git a/‎120_Proximity_Matching/05_Phrase_matching.md
+120 b/‎120_Proximity_Matching/05_Phrase_matching.md
+120
diff --git a/‎120_Proximity_Matching/10_Slop.md
+61 b/‎120_Proximity_Matching/10_Slop.md
+61
diff --git a/‎120_Proximity_Matching/15_Multi_value_fields.md
+83 b/‎120_Proximity_Matching/15_Multi_value_fields.md
+83
diff --git a/‎120_Proximity_Matching/20_Scoring.md
+54 b/‎120_Proximity_Matching/20_Scoring.md
+54
@@ -0,0 +1,38 @@
+[[proximity-matching]]
+== Proximity Matching
+
+Standard full-text search with TF/IDF treats documents, or at least each field
+within a document, as a big _bag of words_.((("proximity matching")))  The `match` query can tell us whether
+that bag contains our search terms, but that is only part of the story.
+It can't tell us anything about the relationship between words.
+
+Consider the difference between these sentences:
+
+* Sue ate the alligator.
+* The alligator ate Sue.
+* Sue never goes anywhere without her alligator-skin purse.
+
+A `match` query for `sue alligator` would match all three documents, but it
+doesn't tell us whether the two words form part of the same idea, or even the same
+paragraph.
+
+Understanding how words relate to each other is a complicated problem, and
+we can't solve it by just using another type of query,
+but we can at least find words that appear to be related because they appear
+near each other or even right next to each other.
+
+Each document may be much longer than the examples we have presented: `Sue`
+and `alligator` may be separated by paragraphs of other text. Perhaps we still
+want to return these documents in which the words are widely separated, but we
+want to give documents in which the words are close together a higher relevance
+score.
+
+This is the province of _phrase matching_, or _proximity matching_.
+
+[TIP]
+==================================================
+
+In this chapter, we are using the same example documents that we used for
+the <<match-test-data,`match` query>>.
+
+==================================================
@@ -0,0 +1,120 @@
+[[phrase-matching]]
+=== Phrase Matching
+
+In the same way that the `match` query is the go-to query for standard
+full-text search, the `match_phrase` query((("proximity matching", "phrase matching")))((("phrase matching")))((("match_phrase query"))) is the one you should reach for
+when you want to find words that are near each other:
+
+[source,js]
+--------------------------------------------------
+GET /my_index/my_type/_search
+{
+    "query": {
+        "match_phrase": {
+            "title": "quick brown fox"
+        }
+    }
+}
+--------------------------------------------------
+// SENSE: 120_Proximity_Matching/05_Match_phrase_query.json
+
+Like the `match` query, the `match_phrase` query first analyzes the query
+string to produce a list of terms. It then searches for all the terms, but
+keeps only documents that contain _all_ of the search terms, in the same
+_positions_ relative to each other.  A query for the phrase `quick fox`
+would not match any of our documents, because no document contains the word
+`quick` immediately followed by `fox`.
+
+[TIP]
+==================================================
+
+The `match_phrase` query can also be written as a `match` query with type
+`phrase`:
+
+[source,js]
+--------------------------------------------------
+"match": {
+    "title": {
+        "query": "quick brown fox",
+        "type":  "phrase"
+    }
+}
+--------------------------------------------------
+// SENSE: 120_Proximity_Matching/05_Match_phrase_query.json
+
+==================================================
+
+==== Term Positions
+
+When a string is analyzed, the analyzer returns not((("phrase matching", "term positions")))((("match_phrase query", "position of terms")))((("position-aware matching"))) only a list of terms, but
+also the _position_, or order, of each term in the original string:
+
+[source,js]
+--------------------------------------------------
+GET /_analyze?analyzer=standard
+Quick brown fox
+--------------------------------------------------
+// SENSE: 120_Proximity_Matching/05_Term_positions.json
+
+This returns the following:
+
+[role="pagebreak-before"]
+[source,js]
+--------------------------------------------------
+{
+   "tokens": [
+      {
+         "token": "quick",
+         "start_offset": 0,
+         "end_offset": 5,
+         "type": "<ALPHANUM>",
+         "position": 1 <1>
+      },
+      {
+         "token": "brown",
+         "start_offset": 6,
+         "end_offset": 11,
+         "type": "<ALPHANUM>",
+         "position": 2 <1>
+      },
+      {
+         "token": "fox",
+         "start_offset": 12,
+         "end_offset": 15,
+         "type": "<ALPHANUM>",
+         "position": 3 <1>
+      }
+   ]
+}
+--------------------------------------------------
+<1> The `position` of each term in the original string.
+
+Positions can be stored in the inverted index, and position-aware queries like
+the `match_phrase` query can use them to match only documents that contain
+all the words in exactly the order specified, with no words in-between.
+
+==== What Is a Phrase
+
+For a document to be considered a((("match_phrase query", "documents matching a phrase")))((("phrase matching", "criteria for matching documents"))) match for the phrase ``quick brown fox,'' the following must be true:
+
+* `quick`, `brown`, and `fox` must all appear in the field.
+
+* The position of `brown` must be `1` greater than the position of `quick`.
+
+* The position of `fox` must be `2` greater than the position of `quick`.
+
+If any of these conditions is not met, the document is not considered a match.
+
+[TIP]
+==================================================
+
+Internally, the `match_phrase` query uses the low-level `span` query family to
+do position-aware matching. ((("match_phrase query", "use of span queries for position-aware matching")))((("span queries")))Span queries are term-level queries, so they have
+no analysis phase; they search for the exact term specified.
+
+Thankfully, most people never need to use the `span` queries directly, as the
+`match_phrase` query is usually good enough. However, certain specialized
+fields, like patent searches, use these low-level queries to perform very
+specific, carefully constructed positional searches.
+
+==================================================
@@ -0,0 +1,61 @@
+[[slop]]
+=== Mixing It Up
+
+Requiring exact-phrase matches ((("proximity matching", "slop parameter")))may be too strict a constraint. Perhaps we _do_
+want documents that contain ``quick brown fox'' to be considered a match for
+the query ``quick fox,'' even though the positions aren't exactly equivalent.
+
+We can introduce a degree ((("slop parameter")))of flexibility into phrase matching by using the
+`slop` parameter:
+
+[source,js]
+--------------------------------------------------
+GET /my_index/my_type/_search
+{
+    "query": {
+        "match_phrase": {
+            "title": {
+            	"query": "quick fox",
+            	"slop":  1
+            }
+        }
+    }
+}
+--------------------------------------------------
+// SENSE: 120_Proximity_Matching/10_Slop.json
+
+The `slop` parameter tells the `match_phrase` query how((("match_phrase query", "slop parameter"))) far apart terms are
+allowed to be while still considering the document a match. By _how far
+apart_ we mean _how many times do you need to move a term in order to make
+the query and document match_?
+
+We'll start with a simple example. To make the query `quick fox` match
+a document containing `quick brown fox` we need a `slop` of just `1`:
+
+
+                Pos 1         Pos 2         Pos 3
+    -----------------------------------------------
+    Doc:        quick         brown         fox
+    -----------------------------------------------
+    Query:      quick         fox
+    Slop 1:     quick                 ↳     fox
+
+Although all words need to be present in phrase matching, even when using `slop`,
+the words don't necessarily need to be in the same sequence in order to
+match. With a high enough `slop` value, words can be arranged in any order.
+
+To make the query `fox quick` match our document, we need a `slop` of `3`:
+
+                Pos 1         Pos 2         Pos 3
+    -----------------------------------------------
+    Doc:        quick         brown         fox
+    -----------------------------------------------
+    Query:      fox           quick
+    Slop 1:     fox|quick  ↵  <1>
+    Slop 2:     quick      ↳  fox
+    Slop 3:     quick                 ↳     fox
+
+<1> Note that `fox` and `quick` occupy the same position in this step.
+    Switching word order from `fox quick` to `quick fox` thus requires two
+    steps, or a `slop` of `2`.
+
@@ -0,0 +1,83 @@
+=== Multivalue Fields
+
+A curious thing can happen when you try to use phrase matching on multivalue
+fields. ((("proximity matching", "on multivalue fields")))((("match_phrase query", "on multivalue fields"))) Imagine that you index this document:
+
+[source,js]
+--------------------------------------------------
+PUT /my_index/groups/1
+{
+    "names": [ "John Abraham", "Lincoln Smith"]
+}
+--------------------------------------------------
+// SENSE: 120_Proximity_Matching/15_Multi_value_fields.json
+
+Then run a phrase query for `Abraham Lincoln`:
+
+[source,js]
+--------------------------------------------------
+GET /my_index/groups/_search
+{
+    "query": {
+        "match_phrase": {
+            "names": "Abraham Lincoln"
+        }
+    }
+}
+--------------------------------------------------
+// SENSE: 120_Proximity_Matching/15_Multi_value_fields.json
+
+Surprisingly, our document matches, even though `Abraham` and `Lincoln`
+belong to two different people in the `names` array. The reason for this comes
+down to the way arrays are indexed in Elasticsearch.
+
+When `John Abraham` is analyzed, it produces this:
+
+* Position 1: `john`
+* Position 2: `abraham`
+
+Then when `Lincoln Smith` is analyzed, it produces this:
+
+* Position 3: `lincoln`
+* Position 4: `smith`
+
+In other words, Elasticsearch produces exactly the same list of tokens as it would have
+for the single string `John Abraham Lincoln Smith`. Our example query
+looks for `abraham` directly followed by `lincoln`, and these two terms do
+indeed exist, and they are right next to each other, so the query matches.
+
+Fortunately, there is a simple workaround for cases like these, called the
+`position_offset_gap`, which((("mapping (types)", "position_offset_gap")))((("position_offset_gap"))) we need to configure in the field mapping:
+
+[source,js]
+--------------------------------------------------
+DELETE /my_index/groups/ <1>
+
+PUT /my_index/_mapping/groups <2>
+{
+    "properties": {
+        "names": {
+            "type":                "string",
+            "position_offset_gap": 100
+        }
+    }
+}
+--------------------------------------------------
+// SENSE: 120_Proximity_Matching/15_Multi_value_fields.json
+
+<1> First delete the `groups` mapping and all documents of that type.
+<2> Then create a new `groups` mapping with the correct values.
+
+The `position_offset_gap` setting tells Elasticsearch that it should increase
+the current term `position` by the specified value for every new array
+element.  So now, when we index the array of names, the terms are emitted with
+the following positions:
+
+* Position 1: `john`
+* Position 2: `abraham`
+* Position 103: `lincoln`
+* Position 104: `smith`
+
+Our phrase query would no longer match a document like this because `abraham`
+and `lincoln` are now 100 positions apart. You would have to add a `slop`
+value of 100 in order for this document to match.
@@ -0,0 +1,54 @@
+=== Closer Is Better
+
+Whereas a phrase query simply excludes documents that don't contain the exact
+query phrase, a _proximity query_&#x2014;a ((("proximity matching", "proximity queries")))((("slop parameter", "proximity queries and")))phrase query where `slop` is greater
+than `0`&#x2014;incorporates the proximity of the query terms into the final
+relevance `_score`. By setting a high `slop` value like `50` or `100`, you can
+exclude documents in which the words are really too far apart, but give a higher
+score to documents in which the words are closer together.
+
+The following proximity query for `quick dog` matches both documents that
+contain the words `quick` and `dog`, but gives a higher score to the
+document((("relevance scores", "for proximity queries"))) in which the words are nearer to each other:
+
+[source,js]
+--------------------------------------------------
+POST /my_index/my_type/_search
+{
+   "query": {
+      "match_phrase": {
+         "title": {
+            "query": "quick dog",
+            "slop":  50 <1>
+         }
+      }
+   }
+}
+--------------------------------------------------
+// SENSE: 120_Proximity_Matching/20_Scoring.json
+
+<1> Note the high `slop` value.
+
+[source,js]
+--------------------------------------------------
+{
+  "hits": [
+     {
+        "_id":      "3",
+        "_score":   0.75, <1>
+        "_source": {
+           "title": "The quick brown fox jumps over the quick dog"
+        }
+     },
+     {
+        "_id":      "2",
+        "_score":   0.28347334, <2>
+        "_source": {
+           "title": "The quick brown fox jumps over the lazy dog"
+        }
+     }
+  ]
+}
+--------------------------------------------------
+<1> Higher score because `quick` and `dog` are close together
+<2> Lower score because `quick` and `dog` are further apart