<?xml version="1.0" encoding="utf-8"?><!DOCTYPE article  PUBLIC '-//OASIS//DTD DocBook XML V4.4//EN'  'http://www.docbook.org/xml/4.4/docbookx.dtd'><article><articleinfo><title>TreebankWydzwieku</title><revhistory><revision><revnumber>20</revnumber><date>2018-08-13 14:22:59</date><authorinitials>AleksanderWawer</authorinitials></revision><revision><revnumber>19</revnumber><date>2018-08-13 14:22:19</date><authorinitials>AleksanderWawer</authorinitials></revision><revision><revnumber>18</revnumber><date>2018-08-13 14:21:24</date><authorinitials>AleksanderWawer</authorinitials></revision><revision><revnumber>17</revnumber><date>2018-08-11 18:36:19</date><authorinitials>AleksanderWawer</authorinitials></revision><revision><revnumber>16</revnumber><date>2018-08-11 18:35:43</date><authorinitials>AleksanderWawer</authorinitials></revision><revision><revnumber>15</revnumber><date>2018-08-11 18:27:35</date><authorinitials>AleksanderWawer</authorinitials></revision><revision><revnumber>14</revnumber><date>2018-08-11 18:25:52</date><authorinitials>AleksanderWawer</authorinitials></revision><revision><revnumber>13</revnumber><date>2018-08-11 18:24:06</date><authorinitials>AleksanderWawer</authorinitials></revision><revision><revnumber>12</revnumber><date>2018-08-11 18:23:25</date><authorinitials>AleksanderWawer</authorinitials></revision><revision><revnumber>11</revnumber><date>2018-08-11 18:22:18</date><authorinitials>AleksanderWawer</authorinitials></revision><revision><revnumber>10</revnumber><date>2018-08-11 18:21:48</date><authorinitials>AleksanderWawer</authorinitials></revision><revision><revnumber>9</revnumber><date>2018-08-11 18:21:13</date><authorinitials>AleksanderWawer</authorinitials></revision><revision><revnumber>8</revnumber><date>2018-08-11 18:20:48</date><authorinitials>AleksanderWawer</authorinitials></revision><revision><revnumber>7</revnumber><date>2018-08-11 18:20:02</date><authorinitials>AleksanderWawer</authorinitials></revision><revision><revnumber>6</revnumber><date>2017-03-11 16:47:46</date><authorinitials>AleksanderWawer</authorinitials></revision><revision><revnumber>5</revnumber><date>2017-03-11 16:45:12</date><authorinitials>AleksanderWawer</authorinitials></revision><revision><revnumber>4</revnumber><date>2017-03-11 16:13:16</date><authorinitials>AleksanderWawer</authorinitials></revision><revision><revnumber>3</revnumber><date>2017-03-11 16:12:41</date><authorinitials>AleksanderWawer</authorinitials></revision><revision><revnumber>2</revnumber><date>2017-03-11 16:12:07</date><authorinitials>AleksanderWawer</authorinitials></revision><revision><revnumber>1</revnumber><date>2017-03-11 16:11:25</date><authorinitials>AleksanderWawer</authorinitials></revision></revhistory></articleinfo><section><title>This is the home page of the Polish Sentiment Treebank</title><para>The dataset is a dependency treebank with sentiment annotations. It was parsed using the Polish dependency parser models available from <ulink url="http://zil.ipipan.waw.pl/PolishDependencyParser"/>. </para><para>For each sentence in the treebank, sentiment of each sub-phrase (sub-tree) has been assigned by a linguist. Sentiment of each leaf word has been labelled according to Polish sentiment dictionary, partially also verified manually. </para><para>Sentiment labels of both phrases and leaves include three classes: neutral, positive and negative. </para><para>Sentiment annotations for each token corresponds to the overall sentiment of the whole phrase under it and inclusive. Specifically: </para><itemizedlist><listitem><para>for every leaf token or word, its sentiment corresponds to this word or token's sentiment </para></listitem><listitem><para>for every non-leaf token or word (node that has non-empty set of children) sentiment field describes the sentiment of the whole phrase, formed by sub-tree starting at this token (that includes this token and all tokens below it) </para></listitem></itemizedlist><para>The treebank has been created specifically for the purpose of analysing compositional sentiment effects in Polish language. </para><section><title>Treebank Wydzwieku: version 1.0</title><itemizedlist><listitem><para>The first part of the treebank is composed from the sub-part of Skladnica treebank, namely from sentences that contain at least one sentiment-bearing word. This part consists of 235 sentences (1915 sentiment-annotated multiword phrases).  </para></listitem><listitem><para>The second part consits of 965 sentences from a product review corpus available from <ulink url="http://zil.ipipan.waw.pl/OPTA/"/>. The number of sentiment-annotated multiword phrases is 4640. </para></listitem></itemizedlist><para>Together, the first version of the treebank consisted of 6555 sentiment-annotated phrases from the parse trees of 1200 sentences. The resource described is the first freely available corpus with fully labeled parse trees that allows for a complete analysis of the compositional effects of sentiment in the Polish language. </para></section><section><title>Treebank Wydzwieku: version 2.0</title><para>In August 2018, as we have added many sentences, we have released a 2.0 version of the treebank! It contains following new parts: </para><itemizedlist><listitem><para>test sentences from <ulink url="http://2017.poleval.pl">PolEval 2017</ulink> sentiment task </para></listitem><listitem><para>2 x 500 sentences collected from various sources on the web, mostly difficult, mixed sentiments and negative </para></listitem></itemizedlist></section><section><title>Download</title><para>Releases: </para><itemizedlist><listitem><para><ulink url="http://zil.ipipan.waw.pl/TreebankWydzwieku?action=AttachFile&amp;do=get&amp;target=TreebankWydzwieku01.zip">version 1.0</ulink> </para></listitem><listitem><para><ulink url="http://zil.ipipan.waw.pl/TreebankWydzwieku?action=AttachFile&amp;do=get&amp;target=TreebankWydzwieku2.0.tar.gz">version 2.0 [new!]</ulink> </para></listitem></itemizedlist></section><section><title>Have questions or ideas?</title><para>Please contact me: axw at ipipan dot ... </para></section></section></article>