[KS] KCNA and Rodong Sinmun Article Data
Frank Hoffmann
hoffmann at koreanstudies.com
Wed Jun 17 00:05:57 EDT 2020
Hi Scott:
Indeed, I am just soooo slow. My apologies. You don't actually
collected those articles from their Website/s to put them in a DB,
allowing us to search and read them -- instead you pushed all text,
into a simple DB, ordered by date, and then allow searches for phrases
and/or words. The output is a word count that is being graphically
displayed on a timeline worksheet.
Got confused as this is a Korean Studies list, had not expected this.
Thanks Scott.
Best,
Frank
On Tue, 16 Jun 2020 22:37:16 -0400, Scott Fisher wrote:
> Hi Frank,
>
> Are you on our website, or the Harvard site? For our site I checked
> and everything seems to be working:
> https://focusdataproject.com/north-korea/. From the screenshot it
> looks like it might be the Harvard site. For that one you need to
> request access to the files, then once we hit approve you can
> download the files as spreadsheets.
>
> Hope this helps. Thanks for the email.
>
> Scott
>
>
> On Tue, Jun 16, 2020 at 9:11 PM Frank Hoffmann
> <hoffmann at koreanstudies.com> wrote:
>> Thank you Scott.
>> I see it is just the English language editions (makes searching for
>> names a bit problematic).
>> Question: I am logged in, have marked the two NK Datasets, but all
>> searches get me zero hits, even if searching for terms like "Kim" or
>> "water" or "train." What am I doing wrong?
>>
>> Thanks.
>> Frank
>>
>>
>>
>>
>> On Tue, 16 Jun 2020 11:51:58 -0400, Scott Fisher wrote:
>>> Greetings,
>>>
>>> Thanks to a recent grant, we've been able to assemble databases of
>>> articles from the Korean Central News Agency (KCNA) and the Rodong
>>> Sinmun. For KCNA the articles run from 1 October 2008 to 27 Feb 2020,
>>> just over 85,000 articles. The Rodong Sinmun database is smaller,
>>> running from 2 Jan 2018 to 31 Dec 2019, just over 7,100 articles.
>>> Both represent all articles available on the respective websites at
>>> the time of the scrape/collection earlier this year.
>>>
>>> We added sentiment and topic analysis to the data, put everything
>>> into Tableau, and made both databases searchable on the affiliated
>>> project's website: https://focusdataproject.com/north-korea/. Note
>>> the interesting spike in reporting in Dec 2011. You can run searches
>>> using the Search Article Text feature - comparing KCNA sentiment
>>> regarding Trump and Moon is quite interesting.
>>>
>>> For those who would like access to the full databases, we set up a
>>> Harvard Dataverse:
>>> https://dataverse.harvard.edu/dataverse/focusdataproject.
>>>
>>> We are adding similar data for state media and foreign ministry
>>> postings from China, Russia, and Iran. The project and affiliated
>>> website (https://focusdataproject.com/) are new and just emerging
>>> from beta; please let me know of any technical or related issues.
>>>
>>> Happy to answer any questions. A colleague and I will also be
>>> presenting (virtually, unfortunately) on the databases and associated
>>> methodology at APSA in September.
>>>
>>> Be well,
>>>
>>> Scott
>>>
>>>
>>> Scott Fisher, PhD
>>> Assistant Professor, Professional Security Studies
>>> New Jersey City University
>>> sfisher1 at njcu.edu
>>>
>>>
>>>
>>>
>>
>> _______________________________
>> Frank Hoffmann
>> http://koreanstudies.com
_______________________________
Frank Hoffmann
http://koreanstudies.com
More information about the Koreanstudies
mailing list