东明书院 - 百度百科
通过提升生活条件和医疗保障,可以替代这些古老而不合时宜的习俗。
2024年12月19日,但区区的武打戏岂能满足他的野心?他早就对邵氏不满,觉得邵氏干的事比资本家还过分,给的片酬连他买东西塞牙缝的都不够,更配不上他的身份。
东明书院 - 百度百科
对此你怎么看呢你认为此次事件究竟谁对谁错呢欢迎在评论区留下观点
兄弟姐妹们啊,今晚糟心,狂飙我只追到34集,结果今晚的微博热搜都给剧透得七七八八了。网友蔼惭补谤办家是原木风装修,他说全屋最满意的地方就是把客厅做成了茶室,放了一张约2.5米的原木茶桌,摆放整齐的精致茶具,在这里品茶真的是一种享受!
飞辞箩颈补辞飞补苍驳迟补辞,肠丑耻蝉丑别苍驳锄补颈丑耻补颈丑别补苍产颈补苍,锄耻锄耻产别颈产别颈诲耻测颈锄丑辞苍驳迟颈补苍飞别颈蝉丑别苍驳。尘别颈驳耻辞苍惫濒补苍:诲耻锄丑补苍锄耻辞迟辞耻诲别产补锄丑耻
elasticsearch分(Fen)词(Ci)器(Qi) character filter ,tokenizer,token filter2023-09-02 18:14·孫()攀(Pan)龍(Zuo)分(Fen)词(Ci)器(Qi):规(Gui)范(Fan)化(Hua):normalization字(Zi)符(Fu)过(Guo)滤(Lv)器(Qi):character filter分(Fen)词(Ci)器(Qi):tokenizer令(Ling)牌(Pai)过(Guo)滤(Lv)器(Qi):token filter无(Wu)论(Lun)是(Shi)内(Nei)置(Zhi)的(De)分(Fen)析(Xi)器(Qi)(analyzer),还(Huan)是(Shi)自(Zi)定(Ding)义(Yi)的(De)分(Fen)析(Xi)器(Qi)(analyzer),都(Du)由(You)三(San)种(Zhong)构(Gou)件(Jian)块(Kuai)组(Zu)成(Cheng)的(De):character filters , tokenizers , token filters。内(Nei)置(Zhi)的(De)analyzer将(Jiang)这(Zhe)些(Xie)构(Gou)建(Jian)块(Kuai)预(Yu)先(Xian)打(Da)包(Bao)到(Dao)适(Shi)合(He)不(Bu)同(Tong)语(Yu)言(Yan)和(He)文(Wen)本(Ben)类(Lei)型(Xing)的(De)analyzer中(Zhong)。Character filters (字(Zi)符(Fu)过(Guo)滤(Lv)器(Qi))字(Zi)符(Fu)过(Guo)滤(Lv)器(Qi)以(Yi)字(Zi)符(Fu)流(Liu)的(De)形(Xing)式(Shi)接(Jie)收(Shou)原(Yuan)始(Shi)文(Wen)本(Ben),并(Bing)可(Ke)以(Yi)通(Tong)过(Guo)添(Tian)加(Jia)、删(Shan)除(Chu)或(Huo)更(Geng)改(Gai)字(Zi)符(Fu)来(Lai)转(Zhuan)换(Huan)该(Gai)流(Liu)。举(Ju)例(Li)来(Lai)说(Shuo),一(Yi)个(Ge)字(Zi)符(Fu)过(Guo)滤(Lv)器(Qi)可(Ke)以(Yi)用(Yong)来(Lai)把(Ba)阿(A)拉(La)伯(Bo)数(Shu)字(Zi)(??????????)转(Zhuan)成(Cheng)成(Cheng)Arabic-Latin的(De)等(Deng)价(Jia)物(Wu)(0123456789)。一(Yi)个(Ge)分(Fen)析(Xi)器(Qi)可(Ke)能(Neng)有(You)0个(Ge)或(Huo)多(Duo)个(Ge)字(Zi)符(Fu)过(Guo)滤(Lv)器(Qi),它(Ta)们(Men)按(An)顺(Shun)序(Xu)应(Ying)用(Yong)。(PS:类(Lei)似(Si)Servlet中(Zhong)的(De)过(Guo)滤(Lv)器(Qi),或(Huo)者(Zhe)拦(Lan)截(Jie)器(Qi),想(Xiang)象(Xiang)一(Yi)下(Xia)有(You)一(Yi)个(Ge)过(Guo)滤(Lv)器(Qi)链(Lian))Tokenizer (分(Fen)词(Ci)器(Qi))一(Yi)个(Ge)分(Fen)词(Ci)器(Qi)接(Jie)收(Shou)一(Yi)个(Ge)字(Zi)符(Fu)流(Liu),并(Bing)将(Jiang)其(Qi)拆(Chai)分(Fen)成(Cheng)单(Dan)个(Ge)token (通(Tong)常(Chang)是(Shi)单(Dan)个(Ge)单(Dan)词(Ci)),并(Bing)输(Shu)出(Chu)一(Yi)个(Ge)token流(Liu)。例(Li)如(Ru),一(Yi)个(Ge)whitespace分(Fen)词(Ci)器(Qi)当(Dang)它(Ta)看(Kan)到(Dao)空(Kong)白(Bai)的(De)时(Shi)候(Hou)就(Jiu)会(Hui)将(Jiang)文(Wen)本(Ben)拆(Chai)分(Fen)成(Cheng)token。它(Ta)会(Hui)将(Jiang)文(Wen)本(Ben)“Quick brown fox!”转(Zhuan)换(Huan)为(Wei)[Quick, brown, fox!](PS:Tokenizer 负(Fu)责(Ze)将(Jiang)文(Wen)本(Ben)拆(Chai)分(Fen)成(Cheng)单(Dan)个(Ge)token ,这(Zhe)里(Li)token就(Jiu)指(Zhi)的(De)就(Jiu)是(Shi)一(Yi)个(Ge)一(Yi)个(Ge)的(De)单(Dan)词(Ci)。就(Jiu)是(Shi)一(Yi)段(Duan)文(Wen)本(Ben)被(Bei)分(Fen)割(Ge)成(Cheng)好(Hao)几(Ji)部(Bu)分(Fen),相(Xiang)当(Dang)于(Yu)Java中(Zhong)的(De)字(Zi)符(Fu)串(Chuan)的(De) split )分(Fen)词(Ci)器(Qi)还(Huan)负(Fu)责(Ze)记(Ji)录(Lu)每(Mei)个(Ge)term的(De)顺(Shun)序(Xu)或(Huo)位(Wei)置(Zhi),以(Yi)及(Ji)该(Gai)term所(Suo)表(Biao)示(Shi)的(De)原(Yuan)单(Dan)词(Ci)的(De)开(Kai)始(Shi)和(He)结(Jie)束(Shu)字(Zi)符(Fu)偏(Pian)移(Yi)量(Liang)。(PS:文(Wen)本(Ben)被(Bei)分(Fen)词(Ci)后(Hou)的(De)输(Shu)出(Chu)是(Shi)一(Yi)个(Ge)term数(Shu)组(Zu))一(Yi)个(Ge)分(Fen)析(Xi)器(Qi)必(Bi)须(Xu)只(Zhi)能(Neng)有(You)一(Yi)个(Ge)分(Fen)词(Ci)器(Qi)Token filters (token过(Guo)滤(Lv)器(Qi))token过(Guo)滤(Lv)器(Qi)接(Jie)收(Shou)token流(Liu),并(Bing)且(Qie)可(Ke)能(Neng)会(Hui)添(Tian)加(Jia)、删(Shan)除(Chu)或(Huo)更(Geng)改(Gai)tokens。例(Li)如(Ru),一(Yi)个(Ge)lowercase token filter可(Ke)以(Yi)将(Jiang)所(Suo)有(You)的(De)token转(Zhuan)成(Cheng)小(Xiao)写(Xie)。stop token filter可(Ke)以(Yi)删(Shan)除(Chu)常(Chang)用(Yong)的(De)单(Dan)词(Ci),比(Bi)如(Ru) the 。synonym token filter可(Ke)以(Yi)将(Jiang)同(Tong)义(Yi)词(Ci)引(Yin)入(Ru)token流(Liu)。不(Bu)允(Yun)许(Xu)token过(Guo)滤(Lv)器(Qi)更(Geng)改(Gai)每(Mei)个(Ge)token的(De)位(Wei)置(Zhi)或(Huo)字(Zi)符(Fu)偏(Pian)移(Yi)量(Liang)。一(Yi)个(Ge)分(Fen)析(Xi)器(Qi)可(Ke)能(Neng)有(You)0个(Ge)或(Huo)多(Duo)个(Ge)token过(Guo)滤(Lv)器(Qi),它(Ta)们(Men)按(An)顺(Shun)序(Xu)应(Ying)用(Yong)。小(Xiao)结(Jie)&回(Hui)顾(Gu)analyzer(分(Fen)析(Xi)器(Qi))是(Shi)一(Yi)个(Ge)包(Bao),这(Zhe)个(Ge)包(Bao)由(You)三(San)部(Bu)分(Fen)组(Zu)成(Cheng),分(Fen)别(Bie)是(Shi):character filters (字(Zi)符(Fu)过(Guo)滤(Lv)器(Qi))、tokenizer(分(Fen)词(Ci)器(Qi))、token filters(token过(Guo)滤(Lv)器(Qi))一(Yi)个(Ge)analyzer可(Ke)以(Yi)有(You)0个(Ge)或(Huo)多(Duo)个(Ge)character filters一(Yi)个(Ge)analyzer有(You)且(Qie)只(Zhi)能(Neng)有(You)一(Yi)个(Ge)tokenizer一(Yi)个(Ge)analyzer可(Ke)以(Yi)有(You)0个(Ge)或(Huo)多(Duo)个(Ge)token filterscharacter filter 是(Shi)做(Zuo)字(Zi)符(Fu)转(Zhuan)换(Huan)的(De),它(Ta)接(Jie)收(Shou)的(De)是(Shi)文(Wen)本(Ben)字(Zi)符(Fu)流(Liu),输(Shu)出(Chu)也(Ye)是(Shi)字(Zi)符(Fu)流(Liu)tokenizer 是(Shi)做(Zuo)分(Fen)词(Ci)的(De),它(Ta)接(Jie)收(Shou)字(Zi)符(Fu)流(Liu),输(Shu)出(Chu)token流(Liu)(文(Wen)本(Ben)拆(Chai)分(Fen)后(Hou)变(Bian)成(Cheng)一(Yi)个(Ge)一(Yi)个(Ge)单(Dan)词(Ci),这(Zhe)些(Xie)单(Dan)词(Ci)叫(Jiao)token)token filter 是(Shi)做(Zuo)token过(Guo)滤(Lv)的(De),它(Ta)接(Jie)收(Shou)token流(Liu),输(Shu)出(Chu)也(Ye)是(Shi)token流(Liu)由(You)此(Ci)可(Ke)见(Jian),整(Zheng)个(Ge)analyzer要(Yao)做(Zuo)的(De)事(Shi)情(Qing)就(Jiu)是(Shi)将(Jiang)文(Wen)本(Ben)拆(Chai)分(Fen)成(Cheng)单(Dan)个(Ge)单(Dan)词(Ci),文(Wen)本(Ben) ----> 字(Zi)符(Fu) ----> token1 normalization:文(Wen)档(Dang)规(Gui)范(Fan)化(Hua),提(Ti)高(Gao)召(Zhao)回(Hui)率(Lv)停(Ting)用(Yong)词(Ci)时(Shi)态(Tai)转(Zhuan)换(Huan)大(Da)小(Xiao)写(Xie)同(Tong)义(Yi)词(Ci)语(Yu)气(Qi)词(Ci)#normalizationGET _analyze{ "text": "Mr. Ma is an excellent teacher", "analyzer": "english"}2 字(Zi)符(Fu)过(Guo)滤(Lv)器(Qi)(character filter):分(Fen)词(Ci)之(Zhi)前(Qian)的(De)预(Yu)处(Chu)理(Li),过(Guo)滤(Lv)无(Wu)用(Yong)字(Zi)符(Fu)HTML StripMappingPattern ReplaceHTML Strip##HTML Strip Character Filter###测(Ce)试(Shi)数(Shu)据(Ju)
I'm so happy!
DELETE my_indexPUT my_index{ "settings": { "analysis": { "char_filter": { "my_char_filter(自(Zi)定(Ding)义(Yi)的(De)分(Fen)析(Xi)器(Qi)名(Ming)字(Zi))":{ "type":"html_strip", "escaped_tags":["a"] } }, "analyzer": { "my_analyzer":{ "tokenizer":"keyword", "char_filter":["my_char_filter(自(Zi)定(Ding)义(Yi)的(De)分(Fen)析(Xi)器(Qi)名(Ming)字(Zi))"] } } } }}GET my_index/_analyze{ "analyzer": "my_analyzer", "text": "I'm so happy!
"}Mapping##Mapping Character Filter DELETE my_indexPUT my_index{ "settings": { "analysis": { "char_filter": { "my_char_filter":{ "type":"mapping", "mappings":[ "滚(Gun) => *", "垃(La) => *", "圾(Ji) => *" ] } }, "analyzer": { "my_analyzer":{ "tokenizer":"keyword", "char_filter":["my_char_filter"] } } } }}GET my_index/_analyze{ "analyzer": "my_analyzer", "text": "你(Ni)就(Jiu)是(Shi)个(Ge)垃(La)圾(Ji)!滚(Gun)"}Pattern Replace##Pattern Replace Character Filter #17611001200DELETE my_indexPUT my_index{ "settings": { "analysis": { "char_filter": { "my_char_filter":{ "type":"pattern_replace", "pattern":"(\\d{3})\\d{4}(\\d{4})", "replacement":"$1****$2" } }, "analyzer": { "my_analyzer":{ "tokenizer":"keyword", "char_filter":["my_char_filter"] } } } }}GET my_index/_analyze{ "analyzer": "my_analyzer", "text": "您(Nin)的(De)手(Shou)机(Ji)号(Hao)是(Shi)17611001200"}3 令(Ling)牌(Pai)过(Guo)滤(Lv)器(Qi)(token filter)--停(Ting)用(Yong)词(Ci)、时(Shi)态(Tai)转(Zhuan)换(Huan)、大(Da)小(Xiao)写(Xie)转(Zhuan)换(Huan)、同(Tong)义(Yi)词(Ci)转(Zhuan)换(Huan)、语(Yu)气(Qi)词(Ci)处(Chu)理(Li)等(Deng)。比(Bi)如(Ru):has=>have him=>he apples=>apple the/oh/a=>干(Gan)掉(Diao)大(Da)小(Xiao)写(Xie)时(Shi)态(Tai)停(Ting)用(Yong)词(Ci)同(Tong)义(Yi)词(Ci)语(Yu)气(Qi)词(Ci)#token filterDELETE test_indexPUT /test_index{ "settings": { "analysis": { "filter": { "my_synonym": { "type": "synonym_graph", "synonyms_path": "analysis/synonym.txt" } }, "analyzer": { "my_analyzer": { "tokenizer": "ik_max_word", "filter": [ "my_synonym" ] } } } }}GET test_index/_analyze{ "analyzer": "my_analyzer", "text": ["蒙(Meng)丢(Diu)丢(Diu),大(Da)G,霸(Ba)道(Dao),daG"]}GET test_index/_analyze{ "analyzer": "ik_max_word", "text": ["奔(Ben)驰(Chi)G级(Ji)"]}近(Jin)义(Yi)词(Ci)匹(Pi)配(Pei)DELETE test_indexPUT /test_index{ "settings": { "analysis": { "filter": { "my_synonym": { "type": "synonym", "synonyms": ["赵(Zhao),钱(Qian),孙(Sun),李(Li)=>吴(Wu)","周(Zhou)=>王(Wang)"] } }, "analyzer": { "my_analyzer": { "tokenizer": "standard", "filter": [ "my_synonym" ] } } } }}GET test_index/_analyze{ "analyzer": "my_analyzer", "text": ["赵(Zhao),钱(Qian),孙(Sun),李(Li)","周(Zhou)"]}大(Da)小(Xiao)写(Xie)#大(Da)小(Xiao)写(Xie)GET test_index/_analyze{ "tokenizer": "standard", "filter": ["lowercase"], "text": ["AASD ASDA SDASD ASDASD"]}GET test_index/_analyze{ "tokenizer": "standard", "filter": ["uppercase"], "text": ["asdasd asd asg dsfg gfhjsdf asfdg g"]}#长(Chang)度(Du)小(Xiao)于(Yu)5的(De)转(Zhuan)大(Da)写(Xie)GET test_index/_analyze{ "tokenizer": "standard", "filter": { "type": "condition", "filter":"uppercase", "script": { "source": "token.getTerm().length() < 5" } }, "text": ["asdasd asd asg dsfg gfhjsdf asfdg g"]}转(Zhuan)小(Xiao)写(Xie)转(Zhuan)大(Da)写(Xie)长(Chang)度(Du)小(Xiao)于(Yu)5的(De)转(Zhuan)大(Da)写(Xie)停(Ting)用(Yong)词(Ci)https://www.elastic.co/guide/en/elasticsearch/reference/7.10/analysis-stop-tokenfilter.html#停(Ting)用(Yong)词(Ci)DELETE test_indexPUT /test_index{ "settings": { "analysis": { "analyzer": { "my_analyzer自(Zi)定(Ding)义(Yi)名(Ming)字(Zi)": { "type": "standard", "stopwords":["me","you"] } } } }}GET test_index/_analyze{ "analyzer": "my_analyzer自(Zi)定(Ding)义(Yi)名(Ming)字(Zi)", "text": ["Teacher me and you in the china"]}#####返(Fan)回(Hui) teacher and you in the china官(Guan)方(Fang)案(An)例(Li):官(Guan)方(Fang)支(Zhi)持(Chi)的(De) token filterhttps://www.elastic.co/guide/en/elasticsearch/reference/7.10/analysis-stop-tokenfilter.html4 分(Fen)词(Ci)器(Qi)(tokenizer):切(Qie)词(Ci)默(Mo)认(Ren)分(Fen)词(Ci)器(Qi):standard(英(Ying)文(Wen)切(Qie)割(Ge),根(Gen)据(Ju)空(Kong)白(Bai)切(Qie)割(Ge))中(Zhong)文(Wen)分(Fen)词(Ci)器(Qi):ik分(Fen)词(Ci)https://www.elastic.co/guide/en/elasticsearch/reference/7.10/analysis-whitespace-tokenizer.html配(Pei)置(Zhi)内(Nei)置(Zhi)的(De)分(Fen)析(Xi)器(Qi)内(Nei)置(Zhi)的(De)分(Fen)析(Xi)器(Qi)不(Bu)用(Yong)任(Ren)何(He)配(Pei)置(Zhi)就(Jiu)可(Ke)以(Yi)直(Zhi)接(Jie)使(Shi)用(Yong)。当(Dang)然(Ran),默(Mo)认(Ren)配(Pei)置(Zhi)是(Shi)可(Ke)以(Yi)更(Geng)改(Gai)的(De)。例(Li)如(Ru),standard分(Fen)析(Xi)器(Qi)可(Ke)以(Yi)配(Pei)置(Zhi)为(Wei)支(Zhi)持(Chi)停(Ting)止(Zhi)字(Zi)列(Lie)表(Biao):curl -X PUT "localhost:9200/my_index" -H 'Content-Type: application/json' -d'{ "settings": { "analysis": { "analyzer": { "std_english": { "type": "standard", "stopwords": "_english_" } } } }, "mappings": { "_doc": { "properties": { "my_text": { "type": "text", "analyzer": "standard", "fields": { "english": { "type": "text", "analyzer": "std_english" } } } } } }}'在(Zai)这(Zhe)个(Ge)例(Li)子(Zi)中(Zhong),我(Wo)们(Men)基(Ji)于(Yu)standard分(Fen)析(Xi)器(Qi)来(Lai)定(Ding)义(Yi)了(Liao)一(Yi)个(Ge)std_englisth分(Fen)析(Xi)器(Qi),同(Tong)时(Shi)配(Pei)置(Zhi)为(Wei)删(Shan)除(Chu)预(Yu)定(Ding)义(Yi)的(De)英(Ying)语(Yu)停(Ting)止(Zhi)词(Ci)列(Lie)表(Biao)。后(Hou)面(Mian)的(De)mapping中(Zhong),定(Ding)义(Yi)了(Liao)my_text字(Zi)段(Duan)用(Yong)standard,my_text.english用(Yong)std_english分(Fen)析(Xi)器(Qi)。因(Yin)此(Ci),下(Xia)面(Mian)两(Liang)个(Ge)的(De)分(Fen)词(Ci)结(Jie)果(Guo)会(Hui)是(Shi)这(Zhe)样(Yang)的(De):curl -X POST "localhost:9200/my_index/_analyze" -H 'Content-Type: application/json' -d'{ "field": "my_text", "text": "The old brown cow"}'curl -X POST "localhost:9200/my_index/_analyze" -H 'Content-Type: application/json' -d'{ "field": "my_text.english", "text": "The old brown cow"}'第(Di)一(Yi)个(Ge)由(You)于(Yu)用(Yong)的(De)standard分(Fen)析(Xi)器(Qi),因(Yin)此(Ci)分(Fen)词(Ci)的(De)结(Jie)果(Guo)是(Shi):[ the, old, brown, cow ]第(Di)二(Er)个(Ge)用(Yong)std_english分(Fen)析(Xi)的(De)结(Jie)果(Guo)是(Shi):[ old, brown, cow ]--------------------------Standard Analyzer (默(Mo)认(Ren))---------------------------如(Ru)果(Guo)没(Mei)有(You)特(Te)别(Bie)指(Zhi)定(Ding)的(De)话(Hua),standard 是(Shi)默(Mo)认(Ren)的(De)分(Fen)析(Xi)器(Qi)。它(Ta)提(Ti)供(Gong)了(Liao)基(Ji)于(Yu)语(Yu)法(Fa)的(De)标(Biao)记(Ji)化(Hua)(基(Ji)于(Yu)Unicode文(Wen)本(Ben)分(Fen)割(Ge)算(Suan)法(Fa)),适(Shi)用(Yong)于(Yu)大(Da)多(Duo)数(Shu)语(Yu)言(Yan)。例(Li)如(Ru):curl -X POST "localhost:9200/_analyze" -H 'Content-Type: application/json' -d'{ "analyzer": "standard", "text": "The 2 QUICK Brown-Foxes jumped over the lazy dog\u0027s bone."}'上(Shang)面(Mian)例(Li)子(Zi)中(Zhong),那(Na)段(Duan)文(Wen)本(Ben)将(Jiang)会(Hui)输(Shu)出(Chu)如(Ru)下(Xia)terms:[ the, 2, quick, brown, foxes, jumped, over, the, lazy, dog's, bone ]-------------------案(An)例(Li)3---------------------标(Biao)准(Zhun)分(Fen)析(Xi)器(Qi)接(Jie)受(Shou)下(Xia)列(Lie)参(Can)数(Shu):max_token_length : 最(Zui)大(Da)token长(Chang)度(Du),默(Mo)认(Ren)255stopwords : 预(Yu)定(Ding)义(Yi)的(De)停(Ting)止(Zhi)词(Ci)列(Lie)表(Biao),如(Ru)_english_ 或(Huo) 包(Bao)含(Han)停(Ting)止(Zhi)词(Ci)列(Lie)表(Biao)的(De)数(Shu)组(Zu),默(Mo)认(Ren)是(Shi) _none_stopwords_path : 包(Bao)含(Han)停(Ting)止(Zhi)词(Ci)的(De)文(Wen)件(Jian)路(Lu)径(Jing)curl -X PUT "localhost:9200/my_index" -H 'Content-Type: application/json' -d'{ "settings": { "analysis": { "analyzer": { "my_english_analyzer": { "type": "standard", "max_token_length": 5, "stopwords": "_english_" } } } }}'curl -X POST "localhost:9200/my_index/_analyze" -H 'Content-Type: application/json' -d'{ "analyzer": "my_english_analyzer", "text": "The 2 QUICK Brown-Foxes jumped over the lazy dog\u0027s bone."}'以(Yi)上(Shang)输(Shu)出(Chu)下(Xia)列(Lie)terms:[ 2, quick, brown, foxes, jumpe, d, over, lazy, dog's, bone ]---------------------定(Ding)义(Yi)--------------------standard分(Fen)析(Xi)器(Qi)由(You)下(Xia)列(Lie)两(Liang)部(Bu)分(Fen)组(Zu)成(Cheng):TokenizerStandard TokenizerToken FiltersStandard Token FilterLower Case Token FilterStop Token Filter (默(Mo)认(Ren)被(Bei)禁(Jin)用(Yong))你(Ni)还(Huan)可(Ke)以(Yi)自(Zi)定(Ding)义(Yi)curl -X PUT "localhost:9200/standard_example" -H 'Content-Type: application/json' -d'{ "settings": { "analysis": { "analyzer": { "rebuilt_standard": { "tokenizer": "standard", "filter": [ "lowercase" ] } } } }}'-------------------- Simple Analyzer---------------------------simple 分(Fen)析(Xi)器(Qi)当(Dang)它(Ta)遇(Yu)到(Dao)只(Zhi)要(Yao)不(Bu)是(Shi)字(Zi)母(Mu)的(De)字(Zi)符(Fu),就(Jiu)将(Jiang)文(Wen)本(Ben)解(Jie)析(Xi)成(Cheng)term,而(Er)且(Qie)所(Suo)有(You)的(De)term都(Du)是(Shi)小(Xiao)写(Xie)的(De)。例(Li)如(Ru):curl -X POST "localhost:9200/_analyze" -H 'Content-Type: application/json' -d'{ "analyzer": "simple", "text": "The 2 QUICK Brown-Foxes jumped over the lazy dog\u0027s bone."}'输(Shu)入(Ru)结(Jie)果(Guo)如(Ru)下(Xia):[ the, quick, brown, foxes, jumped, over, the, lazy, dog, s, bone ]5 常(Chang)见(Jian)分(Fen)词(Ci)器(Qi):standard analyzer:默(Mo)认(Ren)分(Fen)词(Ci)器(Qi),中(Zhong)文(Wen)支(Zhi)持(Chi)的(De)不(Bu)理(Li)想(Xiang),会(Hui)逐(Zhu)字(Zi)拆(Chai)分(Fen)。keyword分(Fen)词(Ci)器(Qi),不(Bu)对(Dui)输(Shu)入(Ru)的(De)text内(Nei)容(Rong)做(Zuo)热(Re)呢(Ne)和(He)处(Chu)理(Li),而(Er)是(Shi)将(Jiang)整(Zheng)个(Ge)输(Shu)入(Ru)text作(Zuo)为(Wei)一(Yi)个(Ge)tokenpattern tokenizer:以(Yi)正(Zheng)则(Ze)匹(Pi)配(Pei)分(Fen)隔(Ge)符(Fu),把(Ba)文(Wen)本(Ben)拆(Chai)分(Fen)成(Cheng)若(Ruo)干(Gan)词(Ci)项(Xiang)。simple pattern tokenizer:以(Yi)正(Zheng)则(Ze)匹(Pi)配(Pei)词(Ci)项(Xiang),速(Su)度(Du)比(Bi)pattern tokenizer快(Kuai)。whitespace analyzer:以(Yi)空(Kong)白(Bai)符(Fu)分(Fen)隔(Ge) Tim_cookie6 自(Zi)定(Ding)义(Yi)分(Fen)词(Ci)器(Qi):custom analyzerchar_filter:内(Nei)置(Zhi)或(Huo)自(Zi)定(Ding)义(Yi)字(Zi)符(Fu)过(Guo)滤(Lv)器(Qi) 。token filter:内(Nei)置(Zhi)或(Huo)自(Zi)定(Ding)义(Yi)token filter 。tokenizer:内(Nei)置(Zhi)或(Huo)自(Zi)定(Ding)义(Yi)分(Fen)词(Ci)器(Qi)。分(Fen)词(Ci)器(Qi)(Analyzer)由(You)0个(Ge)或(Huo)者(Zhe)多(Duo)个(Ge)字(Zi)符(Fu)过(Guo)滤(Lv)器(Qi)(Character Filter),1个(Ge)标(Biao)记(Ji)生(Sheng)成(Cheng)器(Qi)(Tokenizer),0个(Ge)或(Huo)者(Zhe)多(Duo)个(Ge)标(Biao)记(Ji)过(Guo)滤(Lv)器(Qi)(Token Filter)组(Zu)成(Cheng)说(Shuo)白(Bai)了(Liao)就(Jiu)是(Shi)将(Jiang)一(Yi)段(Duan)文(Wen)本(Ben)经(Jing)过(Guo)处(Chu)理(Li)后(Hou)输(Shu)出(Chu)成(Cheng)单(Dan)个(Ge)单(Dan)个(Ge)单(Dan)词(Ci)PUT custom_analysis{ "settings":{ "analysis":{ } }}#自(Zi)定(Ding)义(Yi)分(Fen)词(Ci)器(Qi)DELETE custom_analysisPUT custom_analysis{ "settings": { "analysis": {#第(Di)一(Yi)步(Bu):字(Zi)符(Fu)过(Guo)滤(Lv)器(Qi) 接(Jie)收(Shou)原(Yuan)始(Shi)文(Wen)本(Ben),并(Bing)可(Ke)以(Yi)通(Tong)过(Guo)添(Tian)加(Jia),删(Shan)除(Chu)或(Huo)者(Zhe)更(Geng)改(Gai)字(Zi)符(Fu)来(Lai)转(Zhuan)换(Huan)字(Zi)符(Fu)串(Chuan),转(Zhuan)换(Huan)成(Cheng)可(Ke)识(Shi)别(Bie)的(De)的(De)字(Zi)符(Fu)串(Chuan) "char_filter": { "my_char_filter": { "type": "mapping", "mappings": [ "& => and", "| => or" ] }, "html_strip_char_filter":{ "type":"html_strip", "escaped_tags":["a"] } }, "filter": { #第(Di)三(San)步(Bu):令(Ling)牌(Pai)(token)过(Guo)滤(Lv)器(Qi) ,接(Jie)收(Shou)切(Qie)割(Ge)好(Hao)的(De)token流(Liu)(单(Dan)词(Ci),term),并(Bing)且(Qie)会(Hui)添(Tian)加(Jia),删(Shan)除(Chu)或(Huo)者(Zhe)更(Geng)改(Gai)tokens, 如(Ru):lowercase token fileter可(Ke)以(Yi)把(Ba)所(Suo)有(You)token(单(Dan)词(Ci))转(Zhuan)成(Cheng)小(Xiao)写(Xie),stop token filter停(Ting)用(Yong)词(Ci),可(Ke)以(Yi)删(Shan)除(Chu)常(Chang)用(Yong)的(De)单(Dan)词(Ci); synonym token filter 可(Ke)以(Yi)将(Jiang)同(Tong)义(Yi)词(Ci)引(Yin)入(Ru)token流(Liu) "my_stopword": { "type": "stop", "stopwords": [ "is", "in", "the", "a", "at", "for" ] } }, "tokenizer": {#第(Di)2步(Bu):分(Fen)词(Ci)器(Qi),切(Qie)割(Ge)点(Dian),切(Qie)割(Ge)成(Cheng)一(Yi)个(Ge)个(Ge)单(Dan)个(Ge)的(De)token(单(Dan)词(Ci)),并(Bing)输(Shu)出(Chu)token流(Liu)。它(Ta)会(Hui)将(Jiang)文(Wen)本(Ben)“Quick brown fox!”转(Zhuan)换(Huan)为(Wei)[Quick, brown, fox!],就(Jiu)是(Shi)一(Yi)段(Duan)文(Wen)本(Ben)被(Bei)分(Fen)割(Ge)成(Cheng)好(Hao)几(Ji)部(Bu)分(Fen)。 "my_tokenizer": { "type": "pattern", "pattern": "[ ,.!?]" } }, "analyzer": { "my_analyzer":{ "type":"custom",#告(Gao)诉(Su) "char_filter":["my_char_filter","html_strip_char_filter"], "filter":["my_stopword","lowercase"], "tokenizer":"my_tokenizer" } } } }}GET custom_analysis/_analyze{ "analyzer": "my_analyzer", "text": ["What is ,as.df ssin ? &
| is ! in the a at for "]}------------------------------自(Zi)义(Yi)定(Ding)2---------------------------------------------curl -X PUT "localhost:9200/simple_example" -H 'Content-Type: application/json' -d'{ "settings": { "analysis": { "analyzer": { "rebuilt_simple": { "tokenizer": "lowercase", "filter": [ ] } } } }}'Whitespace Analyzerwhitespace 分(Fen)析(Xi)器(Qi),当(Dang)它(Ta)遇(Yu)到(Dao)空(Kong)白(Bai)字(Zi)符(Fu)时(Shi),就(Jiu)将(Jiang)文(Wen)本(Ben)解(Jie)析(Xi)成(Cheng)terms示(Shi)例(Li):curl -X POST "localhost:9200/_analyze" -H 'Content-Type: application/json' -d'{ "analyzer": "whitespace", "text": "The 2 QUICK Brown-Foxes jumped over the lazy dog\u0027s bone."}'输(Shu)出(Chu)结(Jie)果(Guo)如(Ru)下(Xia):[ The, 2, QUICK, Brown-Foxes, jumped, over, the, lazy, dog's, bone. ]------------------------------Stop Analyzer-----------------top 分(Fen)析(Xi)器(Qi) 和(He) simple 分(Fen)析(Xi)器(Qi)很(Hen)像(Xiang),唯(Wei)一(Yi)不(Bu)同(Tong)的(De)是(Shi),stop 分(Fen)析(Xi)器(Qi)增(Zeng)加(Jia)了(Liao)对(Dui)删(Shan)除(Chu)停(Ting)止(Zhi)词(Ci)的(De)支(Zhi)持(Chi)。默(Mo)认(Ren)用(Yong)的(De)停(Ting)止(Zhi)词(Ci)是(Shi) _englisht_(PS:意(Yi)思(Si)是(Shi),假(Jia)设(She)有(You)一(Yi)句(Ju)话(Hua)“this is a apple”,并(Bing)且(Qie)假(Jia)设(She)“this” 和(He) “is”都(Du)是(Shi)停(Ting)止(Zhi)词(Ci),那(Na)么(Me)用(Yong)simple的(De)话(Hua)输(Shu)出(Chu)会(Hui)是(Shi)[ this , is , a , apple ],而(Er)用(Yong)stop输(Shu)出(Chu)的(De)结(Jie)果(Guo)会(Hui)是(Shi)[ a , apple ],到(Dao)这(Zhe)里(Li)就(Jiu)看(Kan)出(Chu)二(Er)者(Zhe)的(De)区(Qu)别(Bie)了(Liao),stop 不(Bu)会(Hui)输(Shu)出(Chu)停(Ting)止(Zhi)词(Ci),也(Ye)就(Jiu)是(Shi)说(Shuo)它(Ta)不(Bu)认(Ren)为(Wei)停(Ting)止(Zhi)词(Ci)是(Shi)一(Yi)个(Ge)term)(PS:所(Suo)谓(Wei)的(De)停(Ting)止(Zhi)词(Ci),可(Ke)以(Yi)理(Li)解(Jie)为(Wei)分(Fen)隔(Ge)符(Fu))curl -X POST "localhost:9200/_analyze" -H 'Content-Type: application/json' -d'{ "analyzer": "stop", "text": "The 2 QUICK Brown-Foxes jumped over the lazy dog\u0027s bone."}'输(Shu)出(Chu)[ quick, brown, foxes, jumped, over, lazy, dog, s, bone ]stop 接(Jie)受(Shou)以(Yi)下(Xia)参(Can)数(Shu):stopwords : 一(Yi)个(Ge)预(Yu)定(Ding)义(Yi)的(De)停(Ting)止(Zhi)词(Ci)列(Lie)表(Biao)(比(Bi)如(Ru),_englisht_)或(Huo)者(Zhe)是(Shi)一(Yi)个(Ge)包(Bao)含(Han)停(Ting)止(Zhi)词(Ci)的(De)列(Lie)表(Biao)。默(Mo)认(Ren)是(Shi) _english_stopwords_path : 包(Bao)含(Han)停(Ting)止(Zhi)词(Ci)的(De)文(Wen)件(Jian)路(Lu)径(Jing)。这(Zhe)个(Ge)路(Lu)径(Jing)是(Shi)相(Xiang)对(Dui)于(Yu)Elasticsearch的(De)config目(Mu)录(Lu)的(De)一(Yi)个(Ge)路(Lu)径(Jing)curl -X PUT "localhost:9200/my_index" -H 'Content-Type: application/json' -d'{ "settings": { "analysis": { "analyzer": { "my_stop_analyzer": { "type": "stop", "stopwords": ["the", "over"] } } } }}'上(Shang)面(Mian)配(Pei)置(Zhi)了(Liao)一(Yi)个(Ge)stop分(Fen)析(Xi)器(Qi),它(Ta)的(De)停(Ting)止(Zhi)词(Ci)有(You)两(Liang)个(Ge):the 和(He) overcurl -X POST "localhost:9200/my_index/_analyze" -H 'Content-Type: application/json' -d'{ "analyzer": "my_stop_analyzer", "text": "The 2 QUICK Brown-Foxes jumped over the lazy dog\u0027s bone."}'基(Ji)于(Yu)以(Yi)上(Shang)配(Pei)置(Zhi),这(Zhe)个(Ge)请(Qing)求(Qiu)输(Shu)入(Ru)会(Hui)是(Shi)这(Zhe)样(Yang)的(De):[ quick, brown, foxes, jumped, lazy, dog, s, bone ]Pattern Analyzercurl -X POST "localhost:9200/_analyze" -H 'Content-Type: application/json' -d'{ "analyzer": "pattern", "text": "The 2 QUICK Brown-Foxes jumped over the lazy dog\u0027s bone."}'由(You)于(Yu)默(Mo)认(Ren)按(An)照(Zhao)非(Fei)单(Dan)词(Ci)字(Zi)符(Fu)分(Fen)割(Ge),因(Yin)此(Ci)输(Shu)出(Chu)会(Hui)是(Shi)这(Zhe)样(Yang)的(De):[ the, 2, quick, brown, foxes, jumped, over, the, lazy, dog, s, bone ]pattern 分(Fen)析(Xi)器(Qi)接(Jie)受(Shou)如(Ru)下(Xia)参(Can)数(Shu):pattern : 一(Yi)个(Ge)Java正(Zheng)则(Ze)表(Biao)达(Da)式(Shi),默(Mo)认(Ren) \W+flags : Java正(Zheng)则(Ze)表(Biao)达(Da)式(Shi)flags。比(Bi)如(Ru):CASE_INSENSITIVE 、COMMENTSlowercase : 是(Shi)否(Fou)将(Jiang)terms全(Quan)部(Bu)转(Zhuan)成(Cheng)小(Xiao)写(Xie)。默(Mo)认(Ren)truestopwords : 一(Yi)个(Ge)预(Yu)定(Ding)义(Yi)的(De)停(Ting)止(Zhi)词(Ci)列(Lie)表(Biao),或(Huo)者(Zhe)包(Bao)含(Han)停(Ting)止(Zhi)词(Ci)的(De)一(Yi)个(Ge)列(Lie)表(Biao)。默(Mo)认(Ren)是(Shi) _none_stopwords_path : 停(Ting)止(Zhi)词(Ci)文(Wen)件(Jian)路(Lu)径(Jing)curl -X PUT "localhost:9200/my_index" -H 'Content-Type: application/json' -d'{ "settings": { "analysis": { "analyzer": { "my_email_analyzer": { "type": "pattern", "pattern": "\\W|_", "lowercase": true } } } }}'上(Shang)面(Mian)的(De)例(Li)子(Zi)中(Zhong)配(Pei)置(Zhi)了(Liao)按(An)照(Zhao)非(Fei)单(Dan)词(Ci)字(Zi)符(Fu)或(Huo)者(Zhe)下(Xia)划(Hua)线(Xian)分(Fen)割(Ge),并(Bing)且(Qie)输(Shu)出(Chu)的(De)term都(Du)是(Shi)小(Xiao)写(Xie)curl -X POST "localhost:9200/my_index/_analyze" -H 'Content-Type: application/json' -d'{ "analyzer": "my_email_analyzer", "text": "John_Smith@foo-bar.com"}'因(Yin)此(Ci),基(Ji)于(Yu)以(Yi)上(Shang)配(Pei)置(Zhi),本(Ben)例(Li)输(Shu)出(Chu)如(Ru)下(Xia):[ john, smith, foo, bar, com ]Language Analyzers支(Zhi)持(Chi)不(Bu)同(Tong)语(Yu)言(Yan)环(Huan)境(Jing)下(Xia)的(De)文(Wen)本(Ben)分(Fen)析(Xi)。内(Nei)置(Zhi)(预(Yu)定(Ding)义(Yi))的(De)语(Yu)言(Yan)有(You):arabic, armenian, basque, bengali, brazilian, bulgarian, catalan, cjk, czech, danish, dutch, english, finnish, french, galician, german, greek, hindi, hungarian, indonesian, irish, italian, latvian, lithuanian, norwegian, persian, portuguese, romanian, russian, sorani, spanish, swedish, turkish, thai7 中(Zhong)文(Wen)分(Fen)词(Ci)器(Qi):ik分(Fen)词(Ci)安(An)装(Zhuang)和(He)部(Bu)署(Shu)ik下(Xia)载(Zai)地(Di)址(Zhi):https://github.com/medcl/elasticsearch-analysis-ikGithub加(Jia)速(Su)器(Qi):https://github.com/fhefh2015/Fast-GitHub创(Chuang)建(Jian)插(Cha)件(Jian)文(Wen)件(Jian)夹(Jia) cd your-es-root/plugins/ && mkdir ik将(Jiang)插(Cha)件(Jian)解(Jie)压(Ya)缩(Suo)到(Dao)文(Wen)件(Jian)夹(Jia) your-es-root/plugins/ik重(Zhong)新(Xin)启(Qi)动(Dong)esIK文(Wen)件(Jian)描(Miao)述(Shu)IKAnalyzer.cfg.xml:IK分(Fen)词(Ci)配(Pei)置(Zhi)文(Wen)件(Jian)主(Zhu)词(Ci)库(Ku):main.dic英(Ying)文(Wen)停(Ting)用(Yong)词(Ci):stopword.dic,不(Bu)会(Hui)建(Jian)立(Li)在(Zai)倒(Dao)排(Pai)索(Suo)引(Yin)中(Zhong)特(Te)殊(Shu)词(Ci)库(Ku):quantifier.dic:特(Te)殊(Shu)词(Ci)库(Ku):计(Ji)量(Liang)单(Dan)位(Wei)等(Deng)suffix.dic:特(Te)殊(Shu)词(Ci)库(Ku):行(Xing)政(Zheng)单(Dan)位(Wei)surname.dic:特(Te)殊(Shu)词(Ci)库(Ku):百(Bai)家(Jia)姓(Xing)preposition:特(Te)殊(Shu)词(Ci)库(Ku):语(Yu)气(Qi)词(Ci)自(Zi)定(Ding)义(Yi)词(Ci)库(Ku):网(Wang)络(Luo)词(Ci)汇(Hui)、流(Liu)行(Xing)词(Ci)、自(Zi)造(Zao)词(Ci)等(Deng)ik提(Ti)供(Gong)的(De)两(Liang)种(Zhong)analyzer:ik_max_word会(Hui)将(Jiang)文(Wen)本(Ben)做(Zuo)最(Zui)细(Xi)粒(Li)度(Du)的(De)拆(Chai)分(Fen),比(Bi)如(Ru)会(Hui)将(Jiang)“中(Zhong)华(Hua)人(Ren)民(Min)共(Gong)和(He)国(Guo)国(Guo)歌(Ge)”拆(Chai)分(Fen)为(Wei)“中(Zhong)华(Hua)人(Ren)民(Min)共(Gong)和(He)国(Guo),中(Zhong)华(Hua)人(Ren)民(Min),中(Zhong)华(Hua),华(Hua)人(Ren),人(Ren)民(Min)共(Gong)和(He)国(Guo),人(Ren)民(Min),人(Ren),民(Min),共(Gong)和(He)国(Guo),共(Gong)和(He),和(He),国(Guo)国(Guo),国(Guo)歌(Ge)”,会(Hui)穷(Qiong)尽(Jin)各(Ge)种(Zhong)可(Ke)能(Neng)的(De)组(Zu)合(He),适(Shi)合(He) Term Query;ik_smart: 会(Hui)做(Zuo)最(Zui)粗(Cu)粒(Li)度(Du)的(De)拆(Chai)分(Fen),比(Bi)如(Ru)会(Hui)将(Jiang)“中(Zhong)华(Hua)人(Ren)民(Min)共(Gong)和(He)国(Guo)国(Guo)歌(Ge)”拆(Chai)分(Fen)为(Wei)“中(Zhong)华(Hua)人(Ren)民(Min)共(Gong)和(He)国(Guo),国(Guo)歌(Ge)”,适(Shi)合(He) Phrase 查(Cha)询(Xun)。热(Re)更(Geng)新(Xin)远(Yuan)程(Cheng)词(Ci)库(Ku)文(Wen)件(Jian)优(You)点(Dian):上(Shang)手(Shou)简(Jian)单(Dan)缺(Que)点(Dian):词(Ci)库(Ku)的(De)管(Guan)理(Li)不(Bu)方(Fang)便(Bian),要(Yao)操(Cao)作(Zuo)直(Zhi)接(Jie)操(Cao)作(Zuo)磁(Ci)盘(Pan)文(Wen)件(Jian),检(Jian)索(Suo)页(Ye)很(Hen)麻(Ma)烦(Fan)文(Wen)件(Jian)的(De)读(Du)写(Xie)没(Mei)有(You)专(Zhuan)门(Men)的(De)优(You)化(Hua)性(Xing)能(Neng)不(Bu)好(Hao)多(Duo)一(Yi)层(Ceng)接(Jie)口(Kou)调(Diao)用(Yong)和(He)网(Wang)络(Luo)传(Chuan)输(Shu)ik访(Fang)问(Wen)数(Shu)据(Ju)库(Ku)MySQL驱(Qu)动(Dong)版(Ban)本(Ben)兼(Jian)容(Rong)性(Xing)https://dev.mysql.com/doc/connector-j/8.0/en/connector-j-versions.htmlhttps://dev.mysql.com/doc/connector-j/5.1/en/connector-j-versions.html驱(Qu)动(Dong)下(Xia)载(Zai)地(Di)址(Zhi)https://mvnrepository.com/artifact/mysql/mysql-connector-java演(Yan)示(Shi)下(Xia)载(Zai)安(An)装(Zhuang):扩(Kuo)展(Zhan)词(Ci)库(Ku):重(Zhong)启(Qi)es后(Hou)生(Sheng)效(Xiao)=》本(Ben)文(Wen)来(Lai)自(Zi)博(Bo)客(Ke)园(Yuan),作(Zuo)者(Zhe):孙(Sun)龙(Long)-程(Cheng)序(Xu)员(Yuan),转(Zhuan)载(Zai)请(Qing)注(Zhu)明(Ming)原(Yuan)文(Wen)链(Lian)接(Jie):https://www.cnblogs.com/sunlong88/p/17093708.htmljiujiaojidijieshizijibingmeiyoubeipiangansujianzuobowuguan。renminwangjizhe shaolanshe
“但(顿补苍)曼(惭补苍)联(尝颈补苍)的(顿别)所(厂耻辞)有(驰辞耻)人(搁别苍),后(贬辞耻)来(尝补颈)一(驰颈)直(窜丑颈)坚(闯颈补苍)持(颁丑颈)认(搁别苍)为(奥别颈)大(顿补)马(惭补)丁(顿颈苍驳)是(厂丑颈)最(窜耻颈)合(贬别)适(厂丑颈)的(顿别)人(搁别苍)选(齿耻补苍)。”
就这样,他又接连接手了另一个工程项目,再度赚取100万。短短两个月,便积累了200万元的巨额财富。而就在这一过程中,他还结识了自己的夫人阿笋。来源:故宫博物院东明书院 - 百度百科
她早在1982年就进军电影业她和陈勋奇共同出演的《空心大少爷》成为了一部广受欢迎的喜剧爱情电影但是与周润发和王祖贤合作的《大丈夫日记》可能是最让观众印象深刻的作品