Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

非对齐文本语法纠错怎么做? #285

Closed
zhoukaiwei66 opened this issue Jun 16, 2022 · 6 comments
Closed

非对齐文本语法纠错怎么做? #285

zhoukaiwei66 opened this issue Jun 16, 2022 · 6 comments
Labels
question Further information is requested wontfix This will not be worked on

Comments

@zhoukaiwei66
Copy link

您好,对于文本非对齐任务,多字少字问题,我试了生成式unilm模型和序列标注的gector模型 ,效果都不好,请问有什么好一点的解决办法么或者处理这类问题的思路么?谢谢您!!

@zhoukaiwei66 zhoukaiwei66 added the question Further information is requested label Jun 16, 2022
@shibing624
Copy link
Owner

T5

@ImXunan
Copy link

ImXunan commented Nov 4, 2022

您好,我想问一下您对于文本不对齐的情况是如何计算PRF的?

@zhoukaiwei66
Copy link
Author

您好,我想问一下您对于文本不对齐的情况是如何计算PRF的?

您好,我想问一下您对于文本不对齐的情况是如何计算PRF的?

你好,我看的文本语法纠错 gector 是基于序列标注进行的,所以PRF和正常的计算方式一样,github有源码,如果是文本生成或者摘要这样的任务,建议看一下BLEU,ROUGE,METEOR,一般用这些方式作为评价指标

@shibing624 shibing624 changed the title 语法错误 非对齐文本语法纠错怎么做? Nov 4, 2022
@ImXunan
Copy link

ImXunan commented Nov 4, 2022

您好,目前我看的开源的代码基本上是英文的语法纠错,请问您找到什么关于中文语法纠错的开源代码,可以推荐一下吗?

@zhoukaiwei66
Copy link
Author

您好,目前我看的开源的代码基本上是英文的语法纠错,请问您找到什么关于中文语法纠错的开源代码,可以推荐一下吗?

看一下gector, t5, seq2seq,这些都是我看的别人的纠错方案里面常用的,https://github.com/HillZhang1999/MuCGEC,看一下这个,作者是ctc2021的冠军

@stale
Copy link

stale bot commented Apr 16, 2023

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.(由于长期不活动,机器人自动关闭此问题,如果需要欢迎提问)

@stale stale bot added the wontfix This will not be worked on label Apr 16, 2023
@stale stale bot closed this as completed Jun 2, 2023
@shibing624 shibing624 pinned this issue Aug 8, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested wontfix This will not be worked on
Projects
None yet
Development

No branches or pull requests

3 participants