1
0
Fork 0
tt9/docs/dictionaries/bgWordlistReadme.txt
Dimo Karaivanov 2510aba58a
Csv dictionary support (#145)
* the dictionary loader now supports word frequencies

* word frequency validation upon building

* added default word frequencies to all dictionaries

* updated documentation
2023-01-26 11:47:34 +02:00

13 lines
No EOL
773 B
Text

Bulgarian wordlist by: Miglen Georgiev
Version: f46eff1 (2022-04-26)
Source: https://github.com/miglen/bulgarian-wordlists/blob/master/wordlists/bg-words-validated-cyrillic.txt
License: https://github.com/miglen/bulgarian-wordlists/blob/master/LICENSE
Additionally cleaned up repeating words and added some missing ones.
Also, used the wooorm's hunspell-compatible dictionary to determine which words need to start with a capital letter
Link: https://github.com/wooorm/dictionaries/tree/main/dictionaries/bg
Git commit: 13 Apr 2022 [0c78cc810c8aafb2e6f5140bb6dcd4026b247eb8]
Word frequencies obtained from the "General" word frequency dictionary by the Department of Computational Linguistics of the Bulgarian Academy of Sciences.
Link: https://dcl.bas.bg/frequency.html