Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

No stemming for language 'zh' #675

Open
zhjygit opened this issue Apr 25, 2024 · 4 comments
Open

No stemming for language 'zh' #675

zhjygit opened this issue Apr 25, 2024 · 4 comments

Comments

@zhjygit
Copy link

zhjygit commented Apr 25, 2024

I run the command in ubuntu18 as follows:
./kiwix-serve --port 8080 wikipedia_zh_all_maxi_2024-01.zim
my version is kiwix-tools_linux-x86_64-3.7.0.
image
image
as shown above, when i input “北京” in remote web client ,no index promotion related to "北京",and no invalid results.
meanwhile,in kiwix-server command, it shows that "No stemming for language 'zh'"

however, when i did above in kiwix-desktop, everything is fine.
is that bug about the kiwix-server?

@zhjygit
Copy link
Author

zhjygit commented Apr 25, 2024

image
as shown above, when i input “北京”in kiwix-desktop mode, there are many promotioin options about "北京"

@kelson42
Copy link
Contributor

kelson42 commented May 4, 2024

@zhjygit This is definitly strange that you get different result on Kiwix Desktop and on Kiwix Server. But I suspect the core of the problem is that Wikipedia ZIM files are still made with an older version of the libzim. Kiwix Desktop libzim version is a bit older than the one on Kiwix Server, I guess this explains the discrepency. I would not invest too much time investigating this as long as MWoffliner is not fixed.

@tumuyan
Copy link

tumuyan commented May 5, 2024

I have a similar error, but it may be due to different reasons.

I use the command .\kiwix-search.exe .\wikipedia_zh_computer_nopic_2024-04.zim "computer", and return the following results (also with error messages)


No stemming for language 'zh'
Computer Modern
IEEE電腦先鋒獎
計算機系統製造商列表
iSCSI
计算 (计算机科学)
计算机辅助设计
计算机系统结构
次超级计算机
雷明頓蘭德公司
计算机科学

It is a Chinese zim file and has chinese items. but when I use this command .\kiwix-search.exe .\wikipedia_zh_computer_nopic_2024-04.zim "计算"
It just return No stemming for language 'zh' with no result.

@icesunx
Copy link

icesunx commented Nov 24, 2024

I have a similar error, but it may be due to different reasons.

I use the command .\kiwix-search.exe .\wikipedia_zh_computer_nopic_2024-04.zim "computer", and return the following results (also with error messages)


No stemming for language 'zh'
Computer Modern
IEEE電腦先鋒獎
計算機系統製造商列表
iSCSI
计算 (计算机科学)
计算机辅助设计
计算机系统结构
次超级计算机
雷明頓蘭德公司
计算机科学

It is a Chinese zim file and has chinese items. but when I use this command .\kiwix-search.exe .\wikipedia_zh_computer_nopic_2024-04.zim "计算" It just return No stemming for language 'zh' with no result.

我试了一下,在3.5版及之前的版本都是能正常用的虽然也会提示No stemming for language 'zh',但是3.6开始搜索单个汉字还能搜到结果,多于一个汉字就不正常了。应该是3.6开始的版本有什么修改引起的

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants