1
0
Fork 0

1 million Bulgarian words

This commit is contained in:
Dimo Karaivanov 2024-01-31 12:34:50 +02:00 committed by GitHub
parent 6b31891fb6
commit 39a199cc7b
No known key found for this signature in database
GPG key ID: B5690EEEBB952194
2 changed files with 1131990 additions and 254957 deletions

View file

@ -8,6 +8,11 @@ Version: 9c91fe4
Source: https://github.com/michmech/lemmatization-lists/blob/master/lemmatization-bg.txt
License: https://github.com/michmech/lemmatization-lists/blob/master/LICENCE
Bulgarian wordlist 3 by chitanka
Source: https://rechnik.chitanka.info/about
Github: https://github.com/chitanka/rechko
License: Just "free download", so assuming public domain.
Also, used the wooorm's hunspell-compatible dictionary to determine which words need to start with a capital letter
Link: https://github.com/wooorm/dictionaries/tree/main/dictionaries/bg
Git commit: 13 Apr 2022 [0c78cc810c8aafb2e6f5140bb6dcd4026b247eb8]