Czech (#507)
--------- Co-authored-by: Lukáš Calda <lukas.calda@gmail.com>
This commit is contained in:
parent
a057e2a4e6
commit
83a94ab30b
3 changed files with 763668 additions and 0 deletions
13
app/languages/definitions/Czech.yml
Normal file
13
app/languages/definitions/Czech.yml
Normal file
|
|
@ -0,0 +1,13 @@
|
||||||
|
locale: cs-CZ
|
||||||
|
dictionaryFile: cz-utf8.csv
|
||||||
|
layout:
|
||||||
|
- [SPECIAL] # 0
|
||||||
|
- [PUNCTUATION_DE] # 1
|
||||||
|
- [a, b, c, á, č] # 2
|
||||||
|
- [d, e, f, é, ě, ď] # 3
|
||||||
|
- [g, h, i, í] # 4
|
||||||
|
- [j, k, l] # 5
|
||||||
|
- [m, n, o, ó, ň] # 6
|
||||||
|
- [p, q, r, s, ř, š] # 7
|
||||||
|
- [t, u, v, ť, ů, ú] # 8
|
||||||
|
- [w, x, y, z, ý, ž] # 9
|
||||||
763590
app/languages/dictionaries/cz-utf8.csv
Normal file
763590
app/languages/dictionaries/cz-utf8.csv
Normal file
File diff suppressed because it is too large
Load diff
65
docs/dictionaries/czWordlistReadme.txt
Normal file
65
docs/dictionaries/czWordlistReadme.txt
Normal file
|
|
@ -0,0 +1,65 @@
|
||||||
|
Czech wordlist 1 by svobodneslovniky
|
||||||
|
Version: cc65de64b640c8fbef605034869fbcdb8437a916 (2018-11-12)
|
||||||
|
Source: https://github.com/L10Ncz/svobodneslovniky/tree/master/en-cs
|
||||||
|
License: (GNU) https://github.com/L10Ncz/svobodneslovniky/blob/master/en-cs/LICENCE
|
||||||
|
|
||||||
|
Czech wordlist 2 compiled by Lukas Calda <lukas.calda@gmail.com>
|
||||||
|
Source: https://github.com/sspanak/tt9/pull/379/files
|
||||||
|
|
||||||
|
Czech wordlist 3:
|
||||||
|
Source: https://www.winedt.org/dict.html
|
||||||
|
License and more info see below:
|
||||||
|
|
||||||
|
Word frequencies from LatinIME dictionaries:
|
||||||
|
Source: https://android.googlesource.com/platform/packages/inputmethods/LatinIME
|
||||||
|
Version: 66093bf509ea92fa31d796326d5f30a8d9582ffe (2023-12-21)
|
||||||
|
License: https://android.googlesource.com/platform/packages/inputmethods/LatinIME/+/refs/heads/main/NOTICE
|
||||||
|
|
||||||
|
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
|
||||||
|
Wordlist 3 readme:
|
||||||
|
|
||||||
|
Soubor czech.dic obsahuje cca. 390000 slov. Ceska pismena jsou v kodovani
|
||||||
|
CP1250. Tak jako vsechny slovniky pro kontrolu pravopisu je neuplny a temer
|
||||||
|
jiste obsahuje nejake chyby, i kdyz by jich melo byt poskrovnu. V beznem,
|
||||||
|
neprilis odbornem textu je podil slov, ktera slovnik neobsahuje, ale
|
||||||
|
obsahovat by mel, vetsinou nizsi nez 1%. Pravidla, podle kterych slovnik byl
|
||||||
|
sestaven, jsou zhruba tato:
|
||||||
|
|
||||||
|
Slovnik
|
||||||
|
|
||||||
|
a) MA OBSAHOVAT:
|
||||||
|
|
||||||
|
-ceska slova vcetne cizich a odbornych vyrazu (napr. kytara, orbital, genom)
|
||||||
|
-krestni jmena (napr. Adela, Petruv, Ivane)
|
||||||
|
-oznaceni prislusniku narodu a obyvatelu kontinentu (Rumun, Africanka)
|
||||||
|
-nazvy ceskych mest, odvozenych oblasti a vetsich mest jinych statu (Plzen,
|
||||||
|
karlovarsky, Kutnohorsko, Berlin)
|
||||||
|
-nazvy statu a odvozena pridavna jmena (Turecko, cinsti)
|
||||||
|
-jmena nejdulezitejsich rek a pohori (Alpy, Ryn)
|
||||||
|
-nektere bezne zkratky
|
||||||
|
-oznaceni fyzikalnich velicin (km, cm, MHz)
|
||||||
|
-prednost je davana tvarum se z (univerzita) a dlouhym tvarum (perife'rie),
|
||||||
|
jsou-li pripustne
|
||||||
|
_________________
|
||||||
|
b) NEMA OBSAHOVAT:
|
||||||
|
|
||||||
|
-prijmeni (Dvorak, Novakova)
|
||||||
|
-obchodni znacky a nazvy firem (Mitsubishi, Canon)
|
||||||
|
-prechodniky (vypivsi, hledic)- pouzivaji se velmi malo, takze by zbytecne
|
||||||
|
zabiraly mnoho pameti
|
||||||
|
-hovorove vyrazy (jobovka, makat)
|
||||||
|
-tvary, ktere sice lze vytvorit, ale nemaji smysl (vytekali, poucni)
|
||||||
|
-infinitivy sloves koncici na -ti (az na vyjimky- casto uzivane tvary)
|
||||||
|
|
||||||
|
Slova, v nichz se vzdy pise velke pismeno, jsou ve slovniku s velkym
|
||||||
|
pismenem. Pokud se mohou psat i s malym pismenem, obsahuje slovnik variantu
|
||||||
|
s malym pismenem. Napriklad ve slovniku je: Tomas, Francie, avsak turecko
|
||||||
|
(turecko-recke spory) nebo polak (krome obyvatele Polska je to take druh
|
||||||
|
kachny).
|
||||||
|
|
||||||
|
Pokud pri pouzivani slovniku narazite na chyby nebo chybejici slova, muzete je
|
||||||
|
zaslat e-mailem na adresu: winedt.czdic@volny.cz . Casem budou nova slova do
|
||||||
|
slovniku pridana a chyby opraveny, takze postupne by mel byt cim dal tim
|
||||||
|
dokonalejsi.
|
||||||
|
|
||||||
|
Slovnik muzete pouzivat bez veskerych zaruk a omezeni.
|
||||||
Loading…
Add table
Add a link
Reference in a new issue