“Using LIWC2015 for Computational Linguistic Analysis” will be presented from 1:30-3:30 p.m., Friday, Sept. 23, in Hale 401B. The Linguistic Inquiry and Word Count (LIWC) tool enables users to conduct computational linguistic analyses on single texts, collections of texts, and text corpora, in order to better understand the latent data.
This presentation will include an overview of the linguistic analysis sequence: research design, curation of text sets, data cleaning, exploration of texts and data analytics using LIWC, and common methods for reporting the research information.
Some LIWC2015 capabilities
LIWC (pronounced “luke”) enables the classic counts of word and punctuation types, often used as proxies for text complexity and structural composition. There are function word counts used for author stylometry.
- Then, there are descriptive composite scores (reported out as percentiles) based on years of text research: Analytic, Clout, Authentic, and Tone (sentiment, emotion).
- There are psychological constructs (of authors) based on affect, social, perceptual processes, and drives. These linguistic elements are built off of psychological models and psychological research.
- There are human-based constructs: biological processes, time orientation, relativity, personal concerns, informal language, and cognitive processes.
In all, there are over 90 output variables.
LIWC2015 does enable the incorporation of outside dictionaries (in various languages) and even researcher-created dictionaries. (The built-in dictionary, though, is not editable by users).
A brief history
LIWC originated in the early-to-mid 1990s to study the potential effect of writing on human health. Through the years, it has evolved (LIWC2001, LIWC2007, and now LIWC2015). In its current version, it is comprised of a processor and a built-in dictionary.
As a research instrument, it is well documented and solidly tested, with documented confidence levels for particular metrics (The Development and Psychometric Properties of LIWC2015 by J.W. Pennebaker, R.L. Boyd, K. Jordan, and K. Blackburn).
LIWC2015 has been used in a variety of ways and garnered wide usage. It is mentioned in numerous academic periodicals and books. Dr. James W. Pennebaker, one of the principals behind this tool, gave a TEDx (Austin) Talk titled “The Secret Life of Pronouns,” which is based on his book of the same title.
While LIWC2015 is well documented (to the point of near-transparency), it is a closed source tool (so no access to the code directly). In the research literature, this tool has been applied in creative ways, including remote author profiling, genre analysis, suicidality analysis, social status analysis, predictive analytics, and other applications.
The software tool itself costs approximately $90 for the (forever) educational license through Pennebaker Conglomerates, Inc. Currently, K-State does not have a site license for this software. This tool is not required per se to participate in the session.
No RSVP is needed for this event. K-State staff may enroll for this training through HRIS. If there are any questions, please contact firstname.lastname@example.org or call 785-532-5262.