Data Sets
Both corpora are accessible via the krpoltext R package and a static data API.
The South Korean Party Statements Corpus is a collection of official statements from the two major parties in South Korea from 2003 to 2022. The dataset comprises a total of 35,115 entries from the Conservative Party and 42,335 entries from the Progressive Party (82,723 total, 9 variables).
The South Korean Election Campaign Booklet Corpus is a collection of manifesto pamphlets of individual candidates who ran for offices in single-member or multi-member districts during six National Assembly elections, six local elections, and five Presidential elections in South Korea between 2000 and 2022. The dataset has 49,678 observations across 31 variables.