ParlaMint is a multilingual set of comparable corpora containing parliamentary debates mostly starting in 2015 and extending to mid-2020, with each corpus being about 20 million words in size. The sessions in the corpora are marked as belonging to the COVID-19 period (after October 2019), or being "reference" (before that date). The corpora have extensive metadata, including aspects of the parliament; the speakers (name, gender, MP status, party affiliation, party coalition/opposition); are structured into time-stamped terms, sessions and meetings; with speeches being marked by the speaker and their role (e.g. chair, regular speaker). The speeches also contain marked-up transcriber comments, such as gaps in the transcription, interruptions, applause, etc. Note that some corpora have further information, e.g. the year of birth of the speakers, links to their Wikipedia articles, their membership in various committees, etc. The corpora are encoded according to the Parla-CLARIN TEI recommendation (https://clarin-eric.github.io/parla-clarin/), but have been validated against the compatible, but much stricter ParlaMint schemas. This entry contains the ParlaMint TEI-encoded corpora with the derived plain text version of the corpus along with TSV metadata on the speeches. Also included is the 2.0 release of the data and scripts available at the GitHub repository of the ParlaMint project. Note that there also exists the linguistically marked-up version of the corpus, which is available at http://hdl.handle.net/11356/1405.
ParlaMint is a multilingual set of comparable corpora containing parliamentary debates mostly starting in 2015 and extending to mid-2020, with each corpus being about 20 million words in size. The sessions in the corpora are marked as belonging to the COVID-19 period (after October 2019), or being "reference" (before that date). The corpora have extensive metadata, including aspects of the parliament; the speakers (name, gender, MP status, party affiliation, party coalition/opposition); are structured into time-stamped terms, sessions and meetings; with speeches being marked by the speaker and their role (e.g. chair, regular speaker). The speeches also contain marked-up transcriber comments, such as gaps in the transcription, interruptions, applause, etc. Note that some corpora have further information, e.g. the year of birth of the speakers, links to their Wikipedia articles, their membership in various committees, etc. The corpora are encoded according to the Parla-CLARIN TEI recommendation (https://clarin-eric.github.io/parla-clarin/), but have been validated against the compatible, but much stricter ParlaMint schemas. This entry contains the linguistically marked-up version of the corpus, while the text version is available at http://hdl.handle.net/11356/1388. The ParlaMint.ana linguistic annotation includes tokenization, sentence segmentation, lemmatisation, Universal Dependencies part-of-speech, morphological features, and syntactic dependencies, and the 4-class CoNLL-2003 named entities. Some corpora also have further linguistic annotations, such as PoS tagging or named entities according to language-specific schemes, with their corpus TEI headers giving further details on the annotation vocabularies and tools. The compressed files include the ParlaMint.ana XML TEI-encoded linguistically annotated corpus; the derived corpus in CoNLL-U with TSV speech metadata; and the vertical files (with registry file), suitable for use with CQP-based concordancers, such as CWB, noSketch Engine or KonText. Also included is the 2.0 release of the data and scripts available at the GitHub repository of the ParlaMint project.
ParlaMint 2.1 is a multilingual set of 17 comparable corpora containing parliamentary debates mostly starting in 2015 and extending to mid-2020, with each corpus being about 20 million words in size. The sessions in the corpora are marked as belonging to the COVID-19 period (after November 1st 2019), or being "reference" (before that date). The corpora have extensive metadata, including aspects of the parliament; the speakers (name, gender, MP status, party affiliation, party coalition/opposition); are structured into time-stamped terms, sessions and meetings; with speeches being marked by the speaker and their role (e.g. chair, regular speaker). The speeches also contain marked-up transcriber comments, such as gaps in the transcription, interruptions, applause, etc. Note that some corpora have further information, e.g. the year of birth of the speakers, links to their Wikipedia articles, their membership in various committees, etc. The corpora are encoded according to the Parla-CLARIN TEI recommendation (https://clarin-eric.github.io/parla-clarin/), but have been validated against the compatible, but much stricter ParlaMint schemas. This entry contains the ParlaMint TEI-encoded corpora with the derived plain text version of the corpus along with TSV metadata on the speeches. Also included is the 2.0 release of the data and scripts available at the GitHub repository of the ParlaMint project. Note that there also exists the linguistically marked-up version of the corpus, which is available at http://hdl.handle.net/11356/1431.
ParlaMint 2.1 is a multilingual set of 17 comparable corpora containing parliamentary debates mostly starting in 2015 and extending to mid-2020, with each corpus being about 20 million words in size. The sessions in the corpora are marked as belonging to the COVID-19 period (from November 1st 2019), or being "reference" (before that date). The corpora have extensive metadata, including aspects of the parliament; the speakers (name, gender, MP status, party affiliation, party coalition/opposition); are structured into time-stamped terms, sessions and meetings; with speeches being marked by the speaker and their role (e.g. chair, regular speaker). The speeches also contain marked-up transcriber comments, such as gaps in the transcription, interruptions, applause, etc. Note that some corpora have further information, e.g. the year of birth of the speakers, links to their Wikipedia articles, their membership in various committees, etc. The corpora are encoded according to the Parla-CLARIN TEI recommendation (https://clarin-eric.github.io/parla-clarin/), but have been validated against the compatible, but much stricter ParlaMint schemas. This entry contains the linguistically marked-up version of the corpus, while the text version is available at http://hdl.handle.net/11356/1432. The ParlaMint.ana linguistic annotation includes tokenization, sentence segmentation, lemmatisation, Universal Dependencies part-of-speech, morphological features, and syntactic dependencies, and the 4-class CoNLL-2003 named entities. Some corpora also have further linguistic annotations, such as PoS tagging or named entities according to language-specific schemes, with their corpus TEI headers giving further details on the annotation vocabularies and tools. The compressed files include the ParlaMint.ana XML TEI-encoded linguistically annotated corpus; the derived corpus in CoNLL-U with TSV speech metadata; and the vertical files (with registry file), suitable for use with CQP-based concordancers, such as CWB, noSketch Engine or KonText. Also included is the 2.1 release of the data and scripts available at the GitHub repository of the ParlaMint project. As opposed to the previous version 2.0, this version corrects some errors in various corpora and adds the information on upper / lower house for bicameral parliaments. The vertical files have also been changed to make them easier to use in the concordancers.
Begrepet siida er kjent over hele Sápmi. Så langt tilbake i tid vi kan vite, har siidaer vært der som noe som vi kan forstå som samiske lokalsamfunn. Det som i dag gjenstår av videreførte næringsutøvende og levemåtepraktiserende siidaer, er siidaer som har arbeidet med reinflokker som sitt sammenbindende element. Denne siidatradisjonen har vært levende frem til i dag spesielt i Kautokeino- og Karasjokområdet i Norge, det vil si i det området hvor det foreliggende studiet er gjort. Imidlertid lever den fortsatt i krysspresset mellom representasjonen av tidligere østlige siidaer i litteraturen og den forvaltningsmessige praktisering av reindriftsloven. Bildet av en fasttømret organisering i de østlige siidaer og opphør av de såkalte veidesiidaene har usynliggjort forbindelseslinjene mellom tidligere og dagens siidaer og også ført til oppfattelse av siidasystemet som ikke-eksisterende. Reindriftslovgivningen har på sin side skapt og utviklet et alternativt reinbeitedistriktssystem. Etter at siidaen i prinsippet ble anerkjent ved reindriftsloven av 2007, har problemstillinger og forvaltningspraksis med opphav i reinbeitedistriktssystemet allikevel fått lov til å prege tilnærmingen til siidasystemet. For å kunne rehabilitere siidaer og siidasystemet er det ingen veg utenom å plukke opp og ta utgangspunkt i forbindelseslinjene som ligger i videreføringen av siidaer gjennom tidene. Det går ut på å ta utgangspunkt i siidasystemets egne begreper, forståelsesformer og prioriteringer. I dag dreier det seg først og fremst om å ekstrahere og fremlegge siidaers felles begrepsbruk og tradisjonelle kunnskaper i arbeidet med reinflokker, men også å rette blikket mot tradisjonelle anskuelsesformer som ligger bak praksisene i siidautøvelsen. Blant annet ligger i dette et grunnlag for å synliggjøre settinger og tilpasning i forholdet mellom siidautøvelsen og dens naturgitte omgivelser, og også problemstillinger og sider ved avklaring av forhold mellom siidaer i prosessen mot rehabilitering av siidasystemet. Fastsettelse av ytre rammer for reinbeitedistrikter og siidaer har tematisk sett vært et møtepunkt mellom reineierne og offentlig forvaltning, et møte som så langt ikke har hatt siidaers tradisjonelle kunnskaper og tenkemåter som sitt utgangspunkt og sentrum for oppmerksomhet. Det foreliggende studiet tar for seg eksempler på tradisjonell samisk livsanskuelse og måter å nærme seg siidaer ut fra deres egne begreper og praksiser. Som reingjeteren forholder seg til kantene av flokken så forholder disse tekstene seg til betydningsfulle aspekter av siidaer generelt og ikke til det indre liv i noen spesifikk siida. ; Siidadoaba lea oahpis miehtá Sámi. Siiddat leat leamaš juo nu áigá go mii oba diehtit. Dat siiddat mat leat buolvvas bulvii doalahuvvon otnážii birgejumi ja eallinlági vuođđun, lea dat siiddat main boazoealut dahket siiddastallama. Dát siidavierut lea leamaš eallevaččat gitta otná rádjái, Norgga bealde erenoamážit Guovdageainnus ja Kárášjogas. Dat dattege ain almmolašvuođas cahket duvdahuvvan dilis, girjjálašvuođa dološ nuortalaš siiddaid ja boazodoallolága hálddahuslaš meannudeami gaskkas. Nuortalaš siiddaid čavga organiserema ja bivdosiiddaid bieđganeami govat leat jávkadan ovdalaš ja dálá siiddaid čanastagaid ja dagahan vel dan ipmárdusa ahte siidavuogádat ii leat šat doaimmas. Boazodoallolágaiguin fas lea ásahuvvon ja ovddiduvvon boazoguohtundistriktavuogádat dasa buohta. Maŋŋel go 2007 boazodoallolágain siida vuođđojurdagis dohkkehuvvui leat buorrin ieš sadjásis, leat aŋkke boazoguohtundistriktavuogádaga vuolggahan áššečoavdimušat ja hálddahuslaš doaibmavuogit beassan báidnit vugiid mainna lágiin ipmirdišgoahtit siidavuogádaga. Jus galgat máhttit fastain árvvosmuhttit siidda ja siidavuogádaga, de eat sáhte garvit bargamuša geaži gávdnat dain čanastagain mat leat doalahan siiddaid áiggiid čađa. Das ii leat earágo vuođđun atnit siidavuogádaga iežas doahpagiid, ipmárdusvugiid ja vuoruhemiid. Dálá dilis ovddemussan fertet lavdit ja ovdan bidjat siiddaid oktasaš doabageavaheami ja árbevirolaš máhtu eallogohkkema birra, muhto maiddái geahččalit čalbmat ipmárdusvugiid mat leat dagolaš siiddastallama duohken. Earret eará lea dás vejolaš oidnosii dahkat siiddastallama ja eatnamiid gaskavuođa meriid ja heivehusaid, ja nu maiddái siiddaid gaskavuođa gažaldagaid ja čilgejumi dárbbuid das go galgat bargagoahtit fastain árvvosmuhttit siidavuogádaga. Siiddaid ravdameriid čilgejupmi lea leamaš boazoeaiggádiid ja almmolaš hálddahusa deaivvadeami sáhkafáddán, deaivvadeapmi mas dán rádjái eai leat adnon siiddaid árbevirolaš máhtut ja jurddavuogit sáhkavuođđun. Čállagiin maid dás bijan ovdan, guorahalan ovdamearkkaid das maid sápmelaččat leat árbevirolaččat diđoštan ja vugiid beassat siiddaid lagabuidda alddeset doahpagiid ja doaibmavugiid bokte. Dáid čállagiid lean jurddahan lohkamuššan sihke siiddaid olbmuide ja earáide. Nugo guođoheaddjit bissot ealloravddain, de dáid tevsttaid guorahallamat nai bissot siidaravddaid diđošteapmin, eaige mana guđege siidda sisa. ; The concept of siida is well-known throughout the Sápmi. Far back in time there have been local Sámi communities called siidas. Siidas having reindeer herding as their unifying element are today the only ones representing a continuous practice of traditional siida livelihood and way of life. They represent an ongoing siida tradition up to now, especially the siidas in Guovdageaidnu and Kárášjohka in Norway, i.e. in the region where the present research has been carried out. This tradition, however, still lives under pressure from the representation of the eastern siidas in literature on the one side and the administrative way of practicing the Reindeer Husbandry Act on the other side. The image of a stable organization in the eastern siidas and the end of the hunting siidas has veiled the connecting lines between the past and present siidas and also led to an understanding of the siida system as non-existent. On the other hand, legislation on reindeer husbandry has led to construction and development of an alternate organization, the Reindeer Herding District system. The new Reindeer Husbandry Act now has acknowledged the siida as a basic unit, but issues and governing practices originating from the reindeer herding district system have still been allowed to shape the approach to the siida system. The process towards rehabilitation of siidas and the siida system contains no ways round the method of picking up the connecting lines between past and present siidas and taking them as our starting points. This implies taking the siida system's own concepts, premises of thought and priorities as our starting point. Today it means primarily to extract and present the siidas' shared concepts and traditional herding knowledge, but also to look towards the traditional world view behind the siida practices. This provides, among other things, a basis to elucidate settings and adaptation in the relationship between siida and its natural surroundings, and also issues and ways of settling different matters in the process of rehabilitation of the siida system. Determination of outer limits for reindeer herding districts and siidas has been a thematic meeting point between reindeer herders and government. So far these meetings have not taken the siidas traditional knowledge and ways of thinking as their primary starting point and focus of attention. The present research examines examples of traditional Sámi beliefs and ways to approach siidas based on their own concepts and practices. These texts are meant to be read by both siida members and others. As the reindeer herder primarily relates to the edges of the herd, likewise these texts relate to important aspects of all siidas and not to the inner life of any specific siida.