Comments (11)
如果是通过analyzer自定义的分词器,确实是存在问题的,如果用tokenizer是ok的
from elasticsearch-analysis-ansj.
通过analyzer自定义的分词器的问题,近期会考虑修复
from elasticsearch-analysis-ansj.
如图配置自定义词典,dic_finance 和 dic_d3 均未生效,只有dic生效了,是使用姿势不对吗
from elasticsearch-analysis-ansj.
配置没毛病,未生效,是根本就没加载,还是刷新不起作用?
from elasticsearch-analysis-ansj.
配置没毛病,未生效,是根本就没加载,还是刷新不起作用?
下面是我 _cat/ansj/config 的结果:
{ "synonyms": [ "synonyms" ], "dic_d3": "jdbc://jdbc:mysql://xxxx:6103/auto_es?useUnicode=true&characterEncoding=utf-8&zeroDateTimeBehavior=convertToNull|bip|bip@bip|select word as name,nature,freq from auto_es.ansj_dict where status=1", "ambiguity": [ "ambiguity" ], "stop": [ "stop" ], "crf": [ "crf" ], "dic_finance": "/config/elasticsearch-analysis-ansj/dict/finance_words.dic", "isQuantifierRecognition": "true", "isRealName": "false", "isNameRecognition": "true", "isNumRecognition": "true", "dic": [ "dic_finance", "dic_d3", "dic" ] }
我觉得应该是加载了,但是最后的 "dic":["dic_finance","dic_d3","dic"] 这个没生效,dic应该默认使用的只有dic这个key指定的词库列表,"dic_finance","dic_d3"两个key对应的词库列表里的词没生效。
@shiyuan
from elasticsearch-analysis-ansj.
"dic_finance","dic_d3"这俩词典,在分词器里使用了么,mapping是如何配置的?
from elasticsearch-analysis-ansj.
"dic_finance","dic_d3"这俩词典,在分词器里使用了么,mapping是如何配置的?
analyzer 用的默认的 index_ansj,没有单独设置其他配置
from elasticsearch-analysis-ansj.
不配置的话,用的默认的配置,dic只会取key为dic的词典
from elasticsearch-analysis-ansj.
dic只会取key为dic的词典
好的,我再试一下,感谢支持!
from elasticsearch-analysis-ansj.
from elasticsearch-analysis-ansj.
可以的,自定义分词器,使用多个词典
{
...
"dic": "dic,dic_0",
"stop": "stop,stop_0",
"ambiguity": "ambiguity,ambiguity_0",
"synonyms": "synonyms,synonyms_0",
...
}
from elasticsearch-analysis-ansj.
Related Issues (20)
- ansj.cfg.yml HOT 2
- 配置中文停用词不生效 HOT 1
- 如何实现短语屏蔽功能 HOT 6
- 2.4.5版本中flush/dic接口奇怪现象 HOT 9
- 除修改config/ansj.cfg.yml添加自定义词典,还有其他方式添加自定义词典吗? HOT 8
- 怎么在插件中加载自己训练的crf模型 HOT 1
- 使用自定义停用词库后报错 HOT 1
- 请问后续能支持8.4.1版本吗?8.3.3版本安装不上 HOT 4
- 8.3.3版本的包各种报错 HOT 1
- 7.10.x版本支持 HOT 1
- 7.17.9版本支持 HOT 2
- 8.7.0版本配置完自定义词典后,分词报error HOT 2
- 如何热更新词语 HOT 1
- 8.7.1版本 _analyze 报错 HOT 2
- es8.8.2配置分词不生效 HOT 1
- 8.11.3版本mvn构建出错(类文件具有错误的版本 55.0, 应为 52.0) HOT 1
- index模式(type=index_ansj)不符合预期 HOT 2
- 希望提供8.13.2版本 HOT 1
- 如何动态配置自定义词库
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from elasticsearch-analysis-ansj.