Hey <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url=

Commercial use issue,about prithivirajdamodaran/gramformer

PrithivirajDamodaran commented on May 22, 2024 1

As per our email exchange the pre-release model (prithivida/grammar_error_correcter) was trained using filtered WikiEdits data and on top of that, a slice of WI&Locness is used. Because at the time WI&Locness was available as a HuggingFace dataset with no license, in fact, marked as "unknown" (Below are the proofs for that, find attached the screenshots). I have already mentioned this information in the email thread, to which you said it was probably an unintentional miss at the end of the people who uploaded the dataset to HuggingFace. So, to reiterate no intention to undermine anyone's academic work or violate a valid license policy, I merely used it based on the license info shown ( as "Unknown") at that point in time.

(I can see that you have/had them update the license info recently.)
But after you pointed out a possible gap/missing info on the license on the HuggingFace page, I acknowledged that in the email (also mentioned I am anyway in the process of gathering more WikiEdits data to train the subsequent models) and I did the following: A.) Explicitly called out the pre-release model is not intended for commercial usage in Github, B.) Did the same in HuggingFace readme and C.) Trained a brand new model excluding WI&Locness. That is the _V1 model.
_V1 model (prithivida/grammar_error_correcter_v1) is trained using WikiEdit pairs and other synthetic pairs (refer to the readme for details)
Your script is saying both pre-release and V1 models are identical because there might be an inadvertent oversight on my side in picking the right checkpoints while uploading to the tag v1.
I have refreshed the v1 tag with the right checkpoint files now and double-checked. See below

Also to avoid any future unintentional non-compliance in the usage from the consumers of the package <= v1.0 and hence the pre-release model (prithivida/grammar_error_correcter), I can remove it from HuggingFace.

Thanks

from gramformer.