author%3A%22Barry%20Krissoff%22 | Pollux - Fachinformationsdienst Politikwissenschaft

In our paper we present a corpus of transcribed Lithuanian parliamentary speeches. The corpus is prepared in a specific format, appropriate for different authorship identification tasks. The corpus consists of approximately 111 thousand texts (24 million words). Each text matches one parliamentary speech produced during an ordinary session from the period of 7 parliamentary terms starting on March 10, 1990 and ending on December 23, 2013. The texts are grouped into 147 categories corresponding to individual authors, therefore they can be used for authorship attribution tasks; besides, these texts are also grouped according to age, gender and political views, therefore they are also suitable for author profiling tasks. Whereas short texts complicate recognition of author speaking style and are ambiguous in relation to the style of other authors, we incorporated only texts containing not less than 100 words into the corpus. In order to make each category as comprehensive and representative as possible, we included only those authors, who produced speeches at least 200 times. All the texts are lemmatized, morphologically and syntactically annotated, tokenized into the character n-grams. The statistical information of the corpus is also available. We have also demonstrated that the created corpus can be effectively used in authorship attribution and author profiling tasks with supervised machine learning methods. The corpus structure also allows using it with unsupervised machine learning methods and can be used for creation of rule-based methods, as well as in different linguistic analyses.

Zugriff(Open Access)

BASE

Exportieren

Open Access#52014

Seimo posėdžių stenogramų tekstynas autorystės nustatymo bei autoriaus profilio sudarymo tyrimams ; Corpus of transcribed parliamentary speeches for authorship attribution and author profiling tasks

Kapočiūtė-Dzikienė, Jurgita; Šarkutė, Ligita; Utka, Andrius

In our paper we present a corpus of transcribed Lithuanian parliamentary speeches. The corpus is prepared in a specific format, appropriate for different authorship identification tasks. The corpus consists of approximately 111 thousand texts (24 million words). Each text matches one parliamentary speech produced during an ordinary session from the period of 7 parliamentary terms starting on March 10, 1990 and ending on December 23, 2013. The texts are grouped into 147 categories corresponding to individual authors, therefore they can be used for authorship attribution tasks; besides, these texts are also grouped according to age, gender and political views, therefore they are also suitable for author profiling tasks. Whereas short texts complicate recognition of author speaking style and are ambiguous in relation to the style of other authors, we incorporated only texts containing not less than 100 words into the corpus. In order to make each category as comprehensive and representative as possible, we included only those authors, who produced speeches at least 200 times. All the texts are lemmatized, morphologically and syntactically annotated, tokenized into the character n-grams. The statistical information of the corpus is also available. We have also demonstrated that the created corpus can be effectively used in authorship attribution and author profiling tasks with supervised machine learning methods. The corpus structure also allows using it with unsupervised machine learning methods and can be used for creation of rule-based methods, as well as in different linguistic analyses.

Zugriff(Open Access)

BASE

Exportieren

Open Access#62014

Seimo posėdžių stenogramų tekstynas autorystės nustatymo bei autoriaus profilio sudarymo tyrimams ; Corpus of transcribed parliamentary speeches for authorship attribution and author profiling tasks

Kapočiūtė-Dzikienė, Jurgita; Šarkutė, Ligita; Utka, Andrius

In our paper we present a corpus of transcribed Lithuanian parliamentary speeches. The corpus is prepared in a specific format, appropriate for different authorship identification tasks. The corpus consists of approximately 111 thousand texts (24 million words). Each text matches one parliamentary speech produced during an ordinary session from the period of 7 parliamentary terms starting on March 10, 1990 and ending on December 23, 2013. The texts are grouped into 147 categories corresponding to individual authors, therefore they can be used for authorship attribution tasks; besides, these texts are also grouped according to age, gender and political views, therefore they are also suitable for author profiling tasks. Whereas short texts complicate recognition of author speaking style and are ambiguous in relation to the style of other authors, we incorporated only texts containing not less than 100 words into the corpus. In order to make each category as comprehensive and representative as possible, we included only those authors, who produced speeches at least 200 times. All the texts are lemmatized, morphologically and syntactically annotated, tokenized into the character n-grams. The statistical information of the corpus is also available. We have also demonstrated that the created corpus can be effectively used in authorship attribution and author profiling tasks with supervised machine learning methods. The corpus structure also allows using it with unsupervised machine learning methods and can be used for creation of rule-based methods, as well as in different linguistic analyses.

Zugriff(Open Access)

BASE

Exportieren

Open Access#72014

Seimo posėdžių stenogramų tekstynas autorystės nustatymo bei autoriaus profilio sudarymo tyrimams ; Corpus of transcribed parliamentary speeches for authorship attribution and author profiling tasks

Kapočiūtė-Dzikienė, Jurgita; Šarkutė, Ligita; Utka, Andrius

In our paper we present a corpus of transcribed Lithuanian parliamentary speeches. The corpus is prepared in a specific format, appropriate for different authorship identification tasks. The corpus consists of approximately 111 thousand texts (24 million words). Each text matches one parliamentary speech produced during an ordinary session from the period of 7 parliamentary terms starting on March 10, 1990 and ending on December 23, 2013. The texts are grouped into 147 categories corresponding to individual authors, therefore they can be used for authorship attribution tasks; besides, these texts are also grouped according to age, gender and political views, therefore they are also suitable for author profiling tasks. Whereas short texts complicate recognition of author speaking style and are ambiguous in relation to the style of other authors, we incorporated only texts containing not less than 100 words into the corpus. In order to make each category as comprehensive and representative as possible, we included only those authors, who produced speeches at least 200 times. All the texts are lemmatized, morphologically and syntactically annotated, tokenized into the character n-grams. The statistical information of the corpus is also available. We have also demonstrated that the created corpus can be effectively used in authorship attribution and author profiling tasks with supervised machine learning methods. The corpus structure also allows using it with unsupervised machine learning methods and can be used for creation of rule-based methods, as well as in different linguistic analyses.

Zugriff(Open Access)

BASE

Exportieren

Open Access#82014

Seimo posėdžių stenogramų tekstynas autorystės nustatymo bei autoriaus profilio sudarymo tyrimams ; Corpus of transcribed parliamentary speeches for authorship attribution and author profiling tasks

Kapočiūtė-Dzikienė, Jurgita; Šarkutė, Ligita; Utka, Andrius

In our paper we present a corpus of transcribed Lithuanian parliamentary speeches. The corpus is prepared in a specific format, appropriate for different authorship identification tasks. The corpus consists of approximately 111 thousand texts (24 million words). Each text matches one parliamentary speech produced during an ordinary session from the period of 7 parliamentary terms starting on March 10, 1990 and ending on December 23, 2013. The texts are grouped into 147 categories corresponding to individual authors, therefore they can be used for authorship attribution tasks; besides, these texts are also grouped according to age, gender and political views, therefore they are also suitable for author profiling tasks. Whereas short texts complicate recognition of author speaking style and are ambiguous in relation to the style of other authors, we incorporated only texts containing not less than 100 words into the corpus. In order to make each category as comprehensive and representative as possible, we included only those authors, who produced speeches at least 200 times. All the texts are lemmatized, morphologically and syntactically annotated, tokenized into the character n-grams. The statistical information of the corpus is also available. We have also demonstrated that the created corpus can be effectively used in authorship attribution and author profiling tasks with supervised machine learning methods. The corpus structure also allows using it with unsupervised machine learning methods and can be used for creation of rule-based methods, as well as in different linguistic analyses.

Zugriff(Open Access)

BASE

Exportieren

Buch(gedruckt)#91999

Tarptautinė Mokslinė Konferencija Energetikos Decentralizavimas: Miestu̜ Energetikos Ateitis: 1999 04 22 - 24 d. Klaipėda

Paulauskas, S.; Tarptautinė Mokslinė Konferencija Energetikos Decentralizavimas: Miestu̜ Energetikos Ateitis

Verfügbarkeit

Verfügbarkeit an Ihrem Standort wird überprüft

Dieses Buch ist auch in Ihrer Bibliothek verfügbar:

Exportieren

Buch(gedruckt)#102003

Darbo teisė: [oficialiu̜ dokumentu̜ tekstai su pakeitimais ir papildymais iki 2003 m. gruodžio 22 d.]

Mirončikienė, Eglė

Verfügbarkeit

Verfügbarkeit an Ihrem Standort wird überprüft

Dieses Buch ist auch in Ihrer Bibliothek verfügbar:

Exportieren

Buch(gedruckt)#112008

Tremtis prie Manos upės: skiriama 1948-u̜j̜u̜ gegužės 22-osios Didžiosios lietuviu̜ tremties atminimui ; paroda "Tas nelaimingas Sibiras ...", 2007 m. birželio 14 - 20 d

In: Lietuvos Nacionalinio Muziejaus biblioteka 19

Genovaitė Nacickaitė, Vida; Paroda "Tas Nelaimingas Sibiras ..."; Lietuvos Nacionalinis Muziejus

Verfügbarkeit

Verfügbarkeit an Ihrem Standort wird überprüft

Dieses Buch ist auch in Ihrer Bibliothek verfügbar:

Exportieren

Open Access#122020

Autorystės pasisavinimo nusikaltimo objektas ir jo reikšmė veikai kvalifikuoti ; Object of misappropriation in an authorship crime and its meaning for qualification

Steponavičiūtė, Ramunė

This article analyses one element of corpus delicti of misappropriation of authorship, criminalised in Lithuanian Criminal Code Article 191 – the object (or the protected good) of a crime. The quality of Lithuanian national regulation and the scope of object of misappropriation of authorship, which affects the qualification of the crime, is evaluated by comparing it with other European Union countries' criminal legal regulation of intellectual property.

Zugriff(Open Access)

BASE

Exportieren

Open Access#132020

Autorystės pasisavinimo nusikaltimo objektas ir jo reikšmė veikai kvalifikuoti ; Object of misappropriation in an authorship crime and its meaning for qualification

Steponavičiūtė, Ramunė

This article analyses one element of corpus delicti of misappropriation of authorship, criminalised in Lithuanian Criminal Code Article 191 – the object (or the protected good) of a crime. The quality of Lithuanian national regulation and the scope of object of misappropriation of authorship, which affects the qualification of the crime, is evaluated by comparing it with other European Union countries' criminal legal regulation of intellectual property.

Zugriff(Open Access)

BASE

Exportieren

Open Access#142020

Autorystės pasisavinimo nusikaltimo objektas ir jo reikšmė veikai kvalifikuoti ; Object of misappropriation in an authorship crime and its meaning for qualification

Steponavičiūtė, Ramunė

This article analyses one element of corpus delicti of misappropriation of authorship, criminalised in Lithuanian Criminal Code Article 191 – the object (or the protected good) of a crime. The quality of Lithuanian national regulation and the scope of object of misappropriation of authorship, which affects the qualification of the crime, is evaluated by comparing it with other European Union countries' criminal legal regulation of intellectual property.

Zugriff(Open Access)

BASE

Exportieren

Open Access#152020

Autorystės pasisavinimo nusikaltimo objektas ir jo reikšmė veikai kvalifikuoti ; Object of misappropriation in an authorship crime and its meaning for qualification

Steponavičiūtė, Ramunė

This article analyses one element of corpus delicti of misappropriation of authorship, criminalised in Lithuanian Criminal Code Article 191 – the object (or the protected good) of a crime. The quality of Lithuanian national regulation and the scope of object of misappropriation of authorship, which affects the qualification of the crime, is evaluated by comparing it with other European Union countries' criminal legal regulation of intellectual property.

Zugriff(Open Access)

BASE

Exportieren

Filter

Format

Medientyp

Sprache

Weitere Sprachen

Jahre

Lietuvos laikinoji vyriausybė: (1941 06 22 - 08 05) ; monografija

Lietuviu̜ tautos sukilimas: 1941 m. birželio 22 - 28 d

"Molotovo-Ribentropo pakto pasekmės": 1997 m. rugpjūčio 22 - 23 d. konferencija Marijampolėje ; medžiaga

Seimo posėdžių stenogramų tekstynas autorystės nustatymo bei autoriaus profilio sudarymo tyrimams ; Corpus of transcribed parliamentary speeches for authorship attribution and author profiling tasks

Seimo posėdžių stenogramų tekstynas autorystės nustatymo bei autoriaus profilio sudarymo tyrimams ; Corpus of transcribed parliamentary speeches for authorship attribution and author profiling tasks

Seimo posėdžių stenogramų tekstynas autorystės nustatymo bei autoriaus profilio sudarymo tyrimams ; Corpus of transcribed parliamentary speeches for authorship attribution and author profiling tasks

Seimo posėdžių stenogramų tekstynas autorystės nustatymo bei autoriaus profilio sudarymo tyrimams ; Corpus of transcribed parliamentary speeches for authorship attribution and author profiling tasks

Seimo posėdžių stenogramų tekstynas autorystės nustatymo bei autoriaus profilio sudarymo tyrimams ; Corpus of transcribed parliamentary speeches for authorship attribution and author profiling tasks

Tarptautinė Mokslinė Konferencija Energetikos Decentralizavimas: Miestu̜ Energetikos Ateitis: 1999 04 22 - 24 d. Klaipėda

Darbo teisė: [oficialiu̜ dokumentu̜ tekstai su pakeitimais ir papildymais iki 2003 m. gruodžio 22 d.]

Tremtis prie Manos upės: skiriama 1948-u̜j̜u̜ gegužės 22-osios Didžiosios lietuviu̜ tremties atminimui ; paroda "Tas nelaimingas Sibiras ...", 2007 m. birželio 14 - 20 d

Autorystės pasisavinimo nusikaltimo objektas ir jo reikšmė veikai kvalifikuoti ; Object of misappropriation in an authorship crime and its meaning for qualification

Autorystės pasisavinimo nusikaltimo objektas ir jo reikšmė veikai kvalifikuoti ; Object of misappropriation in an authorship crime and its meaning for qualification

Autorystės pasisavinimo nusikaltimo objektas ir jo reikšmė veikai kvalifikuoti ; Object of misappropriation in an authorship crime and its meaning for qualification

Autorystės pasisavinimo nusikaltimo objektas ir jo reikšmė veikai kvalifikuoti ; Object of misappropriation in an authorship crime and its meaning for qualification

Suchergebnisse

Filter

Format

Medientyp

Sprache

Weitere Sprachen

Jahre

Kontakt

Hilfe