Skip to content

Latest commit

 

History

History
50 lines (43 loc) · 4.52 KB

README.md

File metadata and controls

50 lines (43 loc) · 4.52 KB

PL-Wiktionary-To-Dictionary

Parses Polish wiktionary and creates simple dictionaries of foreign languages (e.g. English) to Polish and vice versa.

Run:

python PLWiktionaryToDict.py <path to Polish wiktionary dump>
Main code is saving defined languages creating dictionary files form Polish and to Polish. e.g. python PLWiktionaryToDict.py plwiktionary-latest-pages-meta-current.xml

Last version of wiktionary dump can be downloaded from http://dumps.wikimedia.org/plwiktionary/latest/plwiktionary-latest-pages-meta-current.xml.bz2.

Dictionaries:

In dictionaries folder you can find some (in UTF-8) from 20120806 dump:

To Polish:

From Polish:

License: http://creativecommons.org/licenses/by-sa/3.0/deed