Finished half of 6.4

looly · looly · commit f7add7df5576 · 2015-02-23T00:13:18.000+08:00
diff --git a/052_Mapping_Analysis/00_Intro.md b/052_Mapping_Analysis/00_Intro.md
@@ -1,3 +1,3 @@
 **映射(mapping)**机制用于进行字段类型确认，将每个字段匹配为一种确定的数据类型(`string`, `number`, `booleans`, `date`等)。
 
-**分析(analysis)**机制用于进行**全文文本(Full Text)**的分析，以建立供搜索用的反向索引。
+**分析(analysis)**机制用于进行**全文文本(Full Text)**的分词，以建立供搜索用的反向索引。
diff --git a/052_Mapping_Analysis/30_Exact_vs_full_text.md b/052_Mapping_Analysis/30_Exact_vs_full_text.md
@@ -43,7 +43,7 @@ WHERE name    = "John Smith"
 
 * `"fox news hunting"`能返回有关hunting on Fox News的故事，而`"fox hunting news"`也能返回关于fox hunting的新闻故事。
 
-为了方便在全文文本字段中进行这些类型的查询，Elasticsearch首先_分析_(analyzes)文本，然后使用结果建立一个_倒排索引_。我们将在以下两个章节讨论倒排索引及分析过程。
+为了方便在全文文本字段中进行这些类型的查询，Elasticsearch首先对文本**分析(analyzes)**，然后使用结果建立一个**倒排索引**。我们将在以下两个章节讨论倒排索引及分析过程。
 
 
 
diff --git a/052_Mapping_Analysis/35_Inverted_index.md b/052_Mapping_Analysis/35_Inverted_index.md
@@ -78,4 +78,4 @@ Elasticsearch使用一种叫做**倒排索引(inverted index)**的结构来做
 >### IMPORTANT
 >这很重要。你只可以找到确实存在于索引中的词，所以**索引文本和查询字符串都要标准化为相同的形式**。
 
-这个表征化和标准化的过程叫做**分析(analysis)**，这个在下节中我们讨论。
+这个表征化和标准化的过程叫做**分词(analysis)**，这个在下节中我们讨论。
diff --git a/052_Mapping_Analysis/40_Analysis.md b/052_Mapping_Analysis/40_Analysis.md
@@ -1,35 +1,23 @@
-[[analysis-intro]]
-=== Analysis and analyzers
+## 分析和分析器
 
-_Analysis_ is the process of:
+**分析(analysis)**是这样一个过程：
 
-*  first, tokenizing a block of text into
-   individual _terms_ suitable for use in an inverted index,
-*  then normalizing these terms into a standard form to improve their
-   ``searchability'' or _recall_.
+* 首先，表征化一个文本块为适用于倒排索引单独的**词(term)**
+* 然后标准化这些词为标准形式，提高它们的“可搜索性”或“查全率”
 
-This job is performed by _analyzers_. An _analyzer_ is really just a wrapper
-which combines three functions into a single package:
+这个工作是**分析器(analyzer)**完成的。一个**分析器(analyzer)**只是一个包装用于将三个功能放到一个包里：
 
-Character filters::
+### 字符过滤器
 
-    First, the string is passed through any _character filters_ in turn. Their
-    job is to tidy up the string before tokenization. A character filter could
-    be used to strip out HTML, or to convert `"&"` characters to the word
-    `"and"`.
+首先字符串经过**字符过滤器(character filter)**，它们的工作是在表征化（译者注：这个词叫做断词更合适）前处理字符串。字符过滤器能够去除HTML标记，或者转换`"&"`为`"and"`。
 
-Tokenizer::
+### 分词器
 
-   Next, the string is tokenized into individual terms by a _tokenizer_. A
-   simple tokenizer might split the text up into terms whenever it encounters
-   whitespace or punctuation.
+下一步，**分词器(tokenizer)**被表征化（断词）为独立的词。一个简单的**分词器(tokenizer)**可以根据空格或逗号将单词分开（译者注：这个在中文中不适用）。
 
-Token filters::
+### 表征过滤
 
-   Last, each term is passed through any _token filters_ in turn, which can
-   change terms (eg lowercasing `"Quick"`), remove terms (eg stopwords like
-   `"a"`, `"and"`, `"the"` etc) or add terms (eg synonyms like `"jump"` and
-   `"leap"`)
+最后，每个词都通过所有**表征过滤(token filters)**，它可以修改词（例如将`"Quick"`转为小写），去掉词（例如停用词像`"a"`、`"and"``"the"`等等），或者增加词（例如同义词像`"jump"`和`"leap"`）
 
 Elasticsearch provides many character filters, tokenizers and token filters
 out of the box. These can be combined to create custom analyzers suitable

Original file line number	Diff line number	Diff line change
`@@ -1,3 +1,3 @@`
`1`	`1`	映射(mapping)机制用于进行字段类型确认，将每个字段匹配为一种确定的数据类型(`string`, `number`, `booleans`, `date`等)。
`2`	`2`
`3`		`-分析(analysis)机制用于进行全文文本(Full Text)的分析，以建立供搜索用的反向索引。`
	`3`	`+分析(analysis)机制用于进行全文文本(Full Text)的分词，以建立供搜索用的反向索引。`
Original file line number	Diff line number	Diff line change
`@@ -43,7 +43,7 @@ WHERE name = "John Smith"`
`43`	`43`
`44`	`44`	* `"fox news hunting"`能返回有关hunting on Fox News的故事，而`"fox hunting news"`也能返回关于fox hunting的新闻故事。
`45`	`45`
`46`		`-为了方便在全文文本字段中进行这些类型的查询，Elasticsearch首先_分析_(analyzes)文本，然后使用结果建立一个_倒排索引_。我们将在以下两个章节讨论倒排索引及分析过程。`
	`46`	`+为了方便在全文文本字段中进行这些类型的查询，Elasticsearch首先对文本分析(analyzes)，然后使用结果建立一个倒排索引。我们将在以下两个章节讨论倒排索引及分析过程。`
`47`	`47`
`48`	`48`
`49`	`49`