Locked History Actions

Diff for "NKJPNGrams"

Differences between revisions 3 and 4
Revision 3 as of 2012-08-01 10:29:27
Size: 375
Comment:
Revision 4 as of 2012-08-01 10:29:43
Size: 354
Comment:
Deletions are marked like this. Additions are marked like this.
Line 2: Line 2:

== Description ==

N-grams from balanced National Corpus of Polish

The resource is a set of N-grams extracted from balanced National Corpus of Polish for N from 1 to 5. Each unigram is maximum continuous chunk of non-whitespace lower-case characters. The resource contains all unique N-grams followed by number of occurrencies.

Download