* Added Hindi language * Removed the hardcoded special characters from language validation. Now reading them from the .yml * improved method of hiding the letters on 0 and 1, when needed * virtual keypad adjustments * improved the single-letter validation during build time * improved Devanagari validation script * improved sorting when filters are on
17 lines
1.3 KiB
Text
17 lines
1.3 KiB
Text
Hindi word list 1 by: FreeDict
|
|
Version: 2017-12-02
|
|
Sources: http://freedict.org/, http://www.iiit.net/ltrc/Dictionaries/Dict_Frame.html
|
|
License: GPL
|
|
|
|
Conjunct consonants list and some more common words obtained from Wikipedia
|
|
Version: 2024-12-05
|
|
Sources: https://en.wiktionary.org/wiki/Appendix:Common_Hindi_words, https://en.wikipedia.org/wiki/Devanagari_conjuncts
|
|
License: Creative Commons Attribution-ShareAlike 4.0 License
|
|
|
|
Hindi and Sanskrit word list and frequencies by: CC-100;
|
|
Version: 2020
|
|
Source: https://data.statmt.org/cc-100/
|
|
References (PDF links are available in the source URL):
|
|
- Unsupervised Cross-lingual Representation Learning at Scale, Alexis Conneau, Kartikay Khandelwal, Naman Goyal, Vishrav Chaudhary, Guillaume Wenzek, Francisco Guzmán, Edouard Grave, Myle Ott, Luke Zettlemoyer, Veselin Stoyanov, Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL), p. 8440-8451, July 2020.
|
|
- CCNet: Extracting High Quality Monolingual Datasets from Web Crawl Data, Guillaume Wenzek, Marie-Anne Lachaux, Alexis Conneau, Vishrav Chaudhary, Francisco Guzmán, Armand Joulin, Edouard Grave, Proceedings of the 12th Language Resources and Evaluation Conference (LREC), p. 4003-4012, May 2020.
|
|
Remark: Only the words that appear 3 times or more in each list were used.
|