|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||
java.lang.Objectutils.CorpusManager
public class CorpusManager
Helper for corpus in tei format
| Constructor Summary | |
|---|---|
CorpusManager()
|
|
| Method Summary | |
|---|---|
static corpusapi.tei.TEICorpus |
getCorpusFromConfigFile(java.lang.String configFilePath)
Loads corpus, given corpus config file path |
static void |
getSampleFromCorpus(corpusapi.tei.TEICorpus c,
int textCount,
java.lang.String targetPath)
Samples corpus for a number of texts and saves them in a given directory |
static AnnotationStats |
getSenseStatisticsForCorpus(corpusapi.Corpus corpus,
corpusapi.tei.TEISenseInventory dict)
Calculates gold standard annotation in corpus |
static corpusapi.tei.TEICorpus |
getWypluwkaForDevelopment()
Gets development part of wypluwka |
static corpusapi.tei.TEICorpus |
getWypluwkaForFinalEvaluation()
Gets final evaluation part of wypluwka |
static void |
printCorpusStats(corpusapi.tei.TEICorpus corpus)
Prints some statistics about the corpus |
static void |
splitCorpus(corpusapi.tei.TEICorpus c,
float proportion,
java.lang.String targetPath1,
java.lang.String targetPath2)
Splits corpus into two |
| Methods inherited from class java.lang.Object |
|---|
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Constructor Detail |
|---|
public CorpusManager()
| Method Detail |
|---|
public static corpusapi.tei.TEICorpus getCorpusFromConfigFile(java.lang.String configFilePath)
throws java.lang.Exception
configFilePath -
java.lang.Exception
public static corpusapi.tei.TEICorpus getWypluwkaForFinalEvaluation()
throws java.lang.Exception
java.lang.Exception
public static corpusapi.tei.TEICorpus getWypluwkaForDevelopment()
throws java.lang.Exception
java.lang.Exception
public static void getSampleFromCorpus(corpusapi.tei.TEICorpus c,
int textCount,
java.lang.String targetPath)
c - corpustextCount - number of texts to choosetargetPath - path to save texts
public static void splitCorpus(corpusapi.tei.TEICorpus c,
float proportion,
java.lang.String targetPath1,
java.lang.String targetPath2)
c - corpusproportion - should be between 0 and 1targetPath1 - path to save first parttargetPath2 - path to save second part
public static AnnotationStats getSenseStatisticsForCorpus(corpusapi.Corpus corpus,
corpusapi.tei.TEISenseInventory dict)
corpus - dict - dictionary of senses
public static void printCorpusStats(corpusapi.tei.TEICorpus corpus)
corpus -
|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||