<?xml version='1.0' encoding='UTF-8'?><metadata xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:dcterms="http://purl.org/dc/terms/" xmlns="http://dublincore.org/documents/dcmi-terms/"><dcterms:title>PrevDistro</dcterms:title><dcterms:identifier>https://hdl.handle.net/21.15109/CONCORDA/XTVX3U</dcterms:identifier><dcterms:creator>Kalivoda, Ágnes</dcterms:creator><dcterms:publisher>ARP</dcterms:publisher><dcterms:issued>2021-06-21</dcterms:issued><dcterms:modified>2025-11-24T18:16:50Z</dcterms:modified><dcterms:description>PrevDistro (Preverb Distributions) is an open-source dataset containing 41.5 million corpus occurrences of 49 preverb-verb construction types. It consists of 10 columns which are as follows:</dcterms:description><dcterms:description>1st: ID</dcterms:description><dcterms:description>2nd: construction type</dcterms:description><dcterms:description>3rd: construction subtype</dcterms:description><dcterms:description>4th: preverb position</dcterms:description><dcterms:description>5th: preverb</dcterms:description><dcterms:description>6th: verb lemma</dcterms:description><dcterms:description>7th: intervening words (as lemmas)</dcterms:description><dcterms:description>8th: actual form</dcterms:description><dcterms:description>9th: document ID</dcterms:description><dcterms:description>10th: actual sentence from the Hungarian Gigaword Corpus, the actual form (KWIC) stands between &lt; ... ></dcterms:description><dcterms:subject>Arts and Humanities</dcterms:subject><dcterms:subject>linguistics</dcterms:subject><dcterms:subject>Hungarian language</dcterms:subject><dcterms:subject>preverb constructions</dcterms:subject><dcterms:language>Hungarian</dcterms:language><dcterms:isReferencedBy>Kalivoda, Ágnes (2021). Igekötős szerkezetek a magyarban [Preverb constructions in Hungarian]. PhD thesis. Pázmány Péter Catholic University, Budapest, Hungary. (to appear) (https://github.com/kagnes/phd_thesis)</dcterms:isReferencedBy><dcterms:date>2021-06-21</dcterms:date><dcterms:contributor>Kalivoda, Ágnes</dcterms:contributor><dcterms:dateSubmitted>2021-06-21</dcterms:dateSubmitted><dcterms:type>corpus data</dcterms:type><dcterms:source>PrevLex: https://github.com/kagnes/prevlex</dcterms:source><dcterms:source>Hungarian Gigaword Corpus (also known as HGC, MNSZ2): http://clara.nytud.hu/mnsz2-dev/</dcterms:source><dcterms:rights>This dataset is made available under a Creative Commons CC0 license with the following additional/modified terms and conditions: </dcterms:rights></metadata>